Frozen AI News archive

GPT4o August + 100% Structured Outputs for All (GPT4o mini edition)

**Stability.ai** users are leveraging **LoRA** and **ControlNet** for enhanced line art and artistic style transformations, while facing challenges with **AMD GPUs** due to the discontinuation of **ZLUDA**. Community tensions persist around the **r/stablediffusion** subreddit moderation. **Unsloth AI** users report fine-tuning difficulties with **LLaMA3** models, especially with PPO trainer integration and prompt formatting, alongside anticipation for **multi-GPU** support and cost-effective cloud computing on **RunPod**. **Google** released the lightweight **Gemma 2 2B** model optimized for on-device use with **2.6B** parameters, featuring safety and sparse autoencoder tools, and announced **Diffusers** integration for efficient text-to-image generation on limited resources.

Canonical issue URL


PART 1: High level Discord summaries

Stability.ai (Stable Diffusion) Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


LM Studio Discord


CUDA MODE Discord


Nous Research AI Discord


Latent Space Discord


OpenAI Discord


Perplexity AI Discord


Eleuther Discord


LangChain AI Discord


OpenRouter (Alex Atallah) Discord


LlamaIndex Discord


Cohere Discord


Modular (Mojo 🔥) Discord


LAION Discord


tinygrad (George Hotz) Discord


DSPy Discord


OpenAccess AI Collective (axolotl) Discord


Torchtune Discord


OpenInterpreter Discord


Mozilla AI Discord


MLOps @Chipro Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Stability.ai (Stable Diffusion) ▷ #general-chat (459 messages🔥🔥🔥):

  • Stable Diffusion and LoRA
  • ControlNet usage
  • Issues with AMD GPUs
  • r/stablediffusion drama
  • Image generation techniques

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (105 messages🔥🔥):

  • Unsloth Fine-tuning Issues
  • LLaMA Model Training
  • Multi-GPU Support
  • Inference Optimization
  • Resource Guides

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (10 messages🔥):

  • BigLlama-3.1-1T-Instruct
  • ChatGPT Pokémon Prompts
  • Gaming Discussions

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (162 messages🔥🔥):

  • Llama-3 Model Training
  • Colab Usage and Limitations
  • Model Deployment with Ollama
  • GGUF Model Conversion
  • Error Handling in Fine-Tuning

Links mentioned:


Unsloth AI (Daniel Han) ▷ #community-collaboration (1 messages):

  • LLaMA3 model configuration
  • Cost-effective cloud computing

Unsloth AI (Daniel Han) ▷ #research (1 messages):

vvelo: https://fxtwitter.com/reach_vb/status/1820493688377643178


HuggingFace ▷ #announcements (1 messages):

  • Gemma 2 2B Release
  • Diffusers Integration for FLUX
  • Magpie Ultra Dataset
  • Whisper Generations with Medusa Heads
  • llm-sagemaker Terraform Module

Links mentioned:


HuggingFace ▷ #general (239 messages🔥🔥):

  • Hugging Face Resources
  • Datasets Issues
  • Summer Plans
  • AI and School Experiences
  • Model Development and Filtering Techniques

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

  • Linear Algebra for 3D Video Analysis
  • Blog Article Recommendations

HuggingFace ▷ #cool-finds (4 messages):

  • Image Synthesis with Transformers
  • Integrating Graphs into LLMs

HuggingFace ▷ #i-made-this (5 messages):

  • SAC agent training
  • Embodied agent platform
  • Talking head synthesis
  • BiRefNet segmentation
  • 3D voxel environments

Links mentioned:


HuggingFace ▷ #reading-group (5 messages):

  • Structured Outputs in OpenAI API
  • LLMs and Reasoning Limitations
  • Scratchpad Theory for LLMs
  • Attention Mechanisms in LLMs

HuggingFace ▷ #computer-vision (4 messages):

  • Depth Estimation
  • Code Implementations for Research Papers

HuggingFace ▷ #NLP (2 messages):

  • NER Annotated CVs Dataset
  • Identifying Relevant JSON Files
  • Keyword and Semantic Search Techniques
  • Using S-BERT for Embedding
  • Feedback on Dataset Utilization

Link mentioned: NER Annotated CVs: This dataset includes 5029 annotated curriculum vitae (CV), marked with IT skill


LM Studio ▷ #general (157 messages🔥🔥):

  • AnythingLLM Setup
  • Model Performance and Comparison
  • TTS and STT Integrations
  • Updates in LM Studio and Popular Models
  • Phi-3 Model Support Issues

Links mentioned:


LM Studio ▷ #hardware-discussion (59 messages🔥🔥):

  • Performance of 8700G/780m IGP
  • Upcoming hardware rumors
  • Comparative GPU performance
  • P40 pricing trends
  • VRAM requirements for larger models

CUDA MODE ▷ #general (5 messages):

  • PufferLib Gameboy Emulator
  • Reinforcement Learning Stream
  • GPUDrive Multi-Agent Simulator
  • Mojo Talk Proposal

Links mentioned:


CUDA MODE ▷ #torch (17 messages🔥):

  • PyTorch 2.4 with CUDA 12.4 issues
  • Windows compatibility for torch cublas hgemm
  • FP16 accumulate performance
  • Performance benchmarking results

Link mentioned: GitHub - aredden/torch-cublas-hgemm: PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu: PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu - aredden/torch-cublas-hgemm


CUDA MODE ▷ #algorithms (3 messages):

  • Model tuning
  • Quantization bits

CUDA MODE ▷ #jobs (7 messages):

  • Hudson River Trading internships
  • GPU job roles
  • Application process for internships

Links mentioned:


CUDA MODE ▷ #torchao (34 messages🔥):

  • INT8 symmetric quantization
  • Quantized training
  • Installation errors with torchao
  • Hardware compatibility issues
  • GPTQ refactor progress

Links mentioned:


CUDA MODE ▷ #off-topic (7 messages):

  • Llama 3 Dataset Analysis
  • Prefix Chunk LLM Paper
  • SARATHI Inference Techniques
  • Spector CTF Challenge

Links mentioned:


CUDA MODE ▷ #llmdotc (99 messages🔥🔥):

  • Ragged Attention Challenges
  • Tokenization Issues in Llama Models
  • Training Stability with Batch Size
  • Instruct Model Newline Formatting
  • Benchmarking PyTorch 2.4 with Nvidia

Links mentioned:


CUDA MODE ▷ #rocm (9 messages🔥):

  • ZLUDA 3 Removal
  • AMD's Permission Dispute
  • Employment Contract Clauses

Links mentioned:


CUDA MODE ▷ #cudamode-irl (2 messages):

  • Project Timelines
  • Google Form Proposals

Nous Research AI ▷ #datasets (1 messages):

  • UltraSteer-V0 Dataset
  • Nvidia Llama2-13B-SteerLM-RM
  • Fine-Grained Dialogue Labels
  • De-duplication Process
  • Initial Version Release

Link mentioned: Avelina/UltraSteer-v0 · Datasets at Hugging Face: no description found


Nous Research AI ▷ #off-topic (1 messages):

vikings7699: Has anyone here ever worked on fine tuning a model specifically for insurance sector?


Nous Research AI ▷ #general (129 messages🔥🔥):

  • Model Training Issues
  • Open Medical Reasoning Tasks
  • New Model Releases
  • Flux AI Capabilities
  • Hugging Face Developments

Links mentioned:


Nous Research AI ▷ #ask-about-llms (19 messages🔥):

  • Library Usage for Fine-tuning
  • Inference Stack Resources
  • Insurance Sector Fine-tuning
  • Pay-as-you-go Llama 450b Hosting
  • Memory Bottlenecks in Inference

Nous Research AI ▷ #reasoning-tasks-master-list (7 messages):

  • Open Medical Reasoning Tasks
  • Synthetic Task Generation
  • LLMs potential limitations

Links mentioned:


Latent Space ▷ #ai-general-chat (128 messages🔥🔥):

  • Web Developer to AI Engineer Pipeline
  • OpenAI Departures
  • Generative AI in Retail
  • Structured Outputs in GPT-4o
  • Energy-Based Language Modeling

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

  • OpenAI DevDay
  • Hands-on sessions
  • Developer meetups

OpenAI ▷ #ai-discussions (86 messages🔥🔥):

  • ChatGPT App Release
  • DALL-E 3 Usage
  • Llama Model API
  • OpenAI Structured Outputs
  • Video Analysis Capabilities

Link mentioned: Assistant GPT - Can I perform knowledge retrieval from a cloud storage?: I have some files that are on my cloud storage (onedrive) and would like to perform knowledge retrieval on them. Is it possible to integrate an assistant to perform knowledge retrieval directly fro...


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

  • Search GPT availability
  • Upload limits for members
  • Generative AI in gaming
  • GPT-4o updates
  • Model response changes

OpenAI ▷ #prompt-engineering (1 messages):

darthgustav.: Use the python tool and import data from uploads.


OpenAI ▷ #api-discussions (1 messages):

darthgustav.: Use the python tool and import data from uploads.


Perplexity AI ▷ #general (82 messages🔥🔥):

  • Perplexity AI updates
  • Feedback on Model Performance
  • Technical Issues with Uploading
  • Content Sorting and Recommendation Systems
  • User Experience with Pro Features

Links mentioned:


Perplexity AI ▷ #sharing (7 messages):

  • NVIDIA Blackwell GPU Delays
  • Google Legal Setback
  • Market Jitters
  • Warhol's Digital Portrait
  • Llama 3 Performance

Links mentioned:


Perplexity AI ▷ #pplx-api (8 messages🔥):

  • Perplexity API Issues
  • Model Availability
  • API Errors
  • Testing on Labs
  • Status Updates

Links mentioned:


Eleuther ▷ #announcements (1 messages):

  • Mechanistic Anomaly Detection Methods
  • Anomaly Detection Performance
  • Eliciting Latent Knowledge from Language Models
  • Detection of Adversarial Examples

Links mentioned:


Eleuther ▷ #general (36 messages🔥):

  • Open Letter Against SB1047
  • Disputes on AI Safety Act
  • Anthropic's Response to SB1047
  • Philosophical Divide on AI Regulation
  • Research Practices in AI Accountability

Links mentioned:


Eleuther ▷ #research (40 messages🔥):

  • Distributed AI Training
  • Latent Space Search
  • In-Context Learning
  • Evaluation Function Challenges
  • Self-Taught Evaluation

Links mentioned:


Eleuther ▷ #scaling-laws (4 messages):

  • Training Instability
  • Double Descent
  • Learning Rate Adjustment

Eleuther ▷ #interpretability-general (5 messages):

  • SAE Developments
  • Transformer Circuits
  • SAELens Library
  • Scaling Monosemanticity
  • SAE Landscape Overview

Links mentioned:


Eleuther ▷ #lm-thunderdome (8 messages🔥):

  • lm-eval-harness usage
  • Huggingface model compatibility
  • Batch size in loglikelihood_rolling
  • Evaluation harness special tokens
  • Accessing benchmark names in JSON output

Link mentioned: mamba/evals/lm_harness_eval.py at main · state-spaces/mamba: Mamba SSM architecture. Contribute to state-spaces/mamba development by creating an account on GitHub.


LangChain AI ▷ #general (83 messages🔥🔥):

  • Ollama Memory Issues
  • LangGraph Courses
  • Mood2Music App Introduction
  • SQL Chat Agent Assistance
  • Automatic Code Reviews Challenges

Links mentioned:


LangChain AI ▷ #share-your-work (2 messages):

  • Agentgenesis
  • AI code snippets
  • Open source contributions

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • GPT-4o-2024-08-06 Release
  • Structured Outputs Issues

Link mentioned: GPT-4o (2024-08-06) - API, Providers, Stats: The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai. Run GPT-4o (2024-08...


OpenRouter (Alex Atallah) ▷ #general (62 messages🔥🔥):

  • Gemini Pro 1.5 Performance
  • New Pricing for Google Gemini
  • OpenRouter API Access
  • Structured Outputs in GPT Models
  • Model Misconfigurations and Limitations

Links mentioned:


LlamaIndex ▷ #announcements (1 messages):

  • CodiumAI Webinar
  • RAG-augmented coding assistants
  • Context-aware code generation

Link mentioned: LlamaIndex Webinar: Using RAG with LlamaIndex for Large-Scale Generative Coding · Zoom · Luma: Retrieval-Augmented Generation (RAG) plays a central role in achieving contextual awareness in AI-generated code, which is crucial for enterprises adopting…


LlamaIndex ▷ #blog (4 messages):

  • Local Multi-Agent System
  • Second RAG-a-thon
  • LlamaIndex Workflows
  • Documentation for llama-agents

LlamaIndex ▷ #general (49 messages🔥):

  • HuggingFace Inference API for embeddings
  • LlamaParse Arabic language parsing
  • SimpleDirectoryReader PDF loading behaviors
  • Vector DB comparison sharing
  • LlamaIndex function calling issue

Links mentioned:


Cohere ▷ #discussions (29 messages🔥):

  • Hallucination Index
  • Command R Plus licensing debate
  • Open source vs Open weights
  • Mistral models

Links mentioned:


Cohere ▷ #questions (3 messages):

  • Contacting Dennis Padilla
  • Lauren's absence

Cohere ▷ #cohere-toolkit (1 messages):

  • Cohere Toolkit
  • LLM with RAG
  • 3rd Party API integration
  • Command Models

Modular (Mojo 🔥) ▷ #mojo (30 messages🔥):

  • InlineList functionality
  • Small buffer optimization in Lists
  • Using custom accelerators with Mojo
  • Integration of CXL with FPGA
  • Future of compiler support for RISC-V

Links mentioned:


LAION ▷ #general (18 messages🔥):

  • John Schulman's move to Anthropic
  • Open-source AI challenges
  • Meta's JASCO disappearance
  • Nullbulge dox controversy
  • School BUD-E voice assistant

Links mentioned:


LAION ▷ #research (8 messages🔥):

  • 273k Model Experimentation
  • CIFAR Images in FTT Analysis

Link mentioned: The Matrix Laurence Fishburne GIF - The matrix Laurence fishburne Morpheus - Discover & Share GIFs: Click to view the GIF


tinygrad (George Hotz) ▷ #general (8 messages🔥):

  • Tinygrad on Aurora
  • XMX Support
  • FP8 NVIDIA Bounty

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (16 messages🔥):

  • Contiguous Buffers
  • JIT and Batch Sizes
  • Computer Algebra Study Notes
  • CLANG and LLVM Threading

Link mentioned: computer-algebra-study-notes/README.md at main · mesozoic-egg/computer-algebra-study-notes: Contribute to mesozoic-egg/computer-algebra-study-notes development by creating an account on GitHub.


DSPy ▷ #show-and-tell (6 messages):

  • Wiseflow
  • HybridAGI
  • Dynamic Knowledge Base

Links mentioned:


DSPy ▷ #papers (2 messages):

  • Language Models in Software Engineering
  • Inference Compute Scaling
  • LLM-based Agents
  • DeepSeek Performance

Links mentioned:


DSPy ▷ #general (7 messages):

  • MIPRO vs BootstrapFewShotWithRandomSearch
  • MIPROv2 assertions
  • Complexity in approaches

DSPy ▷ #colbert (1 messages):

gamris: Would you recommend FastEmbed by Qdrant instead? https://github.com/qdrant/fastembed


OpenAccess AI Collective (axolotl) ▷ #general (7 messages):

  • Synthetic Data Generation
  • SQL Examples with Llama Index
  • Lora Adapter MD5 Consistency
  • Bitsandbytes Multi Backend Refactor

Link mentioned: (WIP) Multi backend refactor -> main (full diff of all already merged PRs) by Titus-von-Koeller · Pull Request #1220 · bitsandbytes-foundation/bitsandbytes: This PR to main serves the purpose to keep an overview of all the extensive changes that have been introduced to multi-backend-refactor to the iterative PRs around this topic. We will eventually me...


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (5 messages):

  • Gemma 2 27B QLoRA tweaks
  • L40S GPU training performance
  • Astral UV package

Link mentioned: GitHub - astral-sh/uv: An extremely fast Python package installer and resolver, written in Rust.: An extremely fast Python package installer and resolver, written in Rust. - astral-sh/uv


OpenAccess AI Collective (axolotl) ▷ #general-help (3 messages):

  • Context Length Adjustment
  • RoPE Scaling for Fine-Tuning

OpenAccess AI Collective (axolotl) ▷ #announcements (1 messages):

caseus_: Office hours kicks off in an hour in <#1268285745555308649>.


Torchtune ▷ #announcements (1 messages):

  • PPO training recipe
  • Qwen2 support
  • Feature requests

Torchtune ▷ #general (9 messages🔥):

  • Llama 3 Model Support
  • Model Downloading Issues
  • Prompt Formatting with Llama 3
  • Llama 3 Instruct vs. Base Models

Torchtune ▷ #dev (6 messages):

  • Model Index Page
  • Refactored PreferenceDataset

Link mentioned: [4/n] Refactor preference dataset with transforms design by RdoubleA · Pull Request #1276 · pytorch/torchtune: Context Following the RFC in #1186, we will use the unified message_transform -> template -> tokenization data pipeline in all our datasets. This PR updates PreferenceDataset to follow t...


OpenInterpreter ▷ #general (9 messages🔥):

  • Open Interpreter Setup
  • Open Interpreter Security
  • Python Compatibility
  • Error Resolution
  • Open Source Vision Models

OpenInterpreter ▷ #O1 (2 messages):

  • Ollama Model List
  • Deepgram Support

Link mentioned: open-interpreter/docs/language-models/local-models/ollama.mdx at main · OpenInterpreter/open-interpreter: A natural language interface for computers. Contribute to OpenInterpreter/open-interpreter development by creating an account on GitHub.


Mozilla AI ▷ #announcements (2 messages):

  • Llamafile project updates
  • Community survey for feedback
  • sqlite-vec release party
  • Machine Learning Paper Talks
  • AMA with Local AI maintainer

Link mentioned: Discover Typeform, where forms = fun): Create a beautiful, interactive form in minutes with no code. Get started for free.


MLOps @Chipro ▷ #events (1 messages):

  • LinkedIn Engineering
  • ML Platform Transformation




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}