Frozen AI News archive

not much happened today

**Geoffrey Hinton** and **John Hopfield** won the **Nobel Prize in Physics** for foundational work on neural networks linking AI and physics. **Meta AI** introduced a **13B parameter audio generation model** as part of Meta Movie Gen for video-synced audio. **Anthropic** launched the **Message Batches API** enabling asynchronous processing of up to 10,000 queries at half the cost. **Together Compute** released **Flux Schnell**, a free model for 3 months. New techniques like **PrefixQuant** quantization and **Prompt Caching** for low-latency inference were highlighted by **rohanpaul_ai**. **LangGraph** added long-term memory support for persistent document storage. **Hex-LLM** framework was introduced for TPU-based low-cost, high-throughput LLM serving from Hugging Face models. Discussions on AI safety emphasized gender equality in science, and concerns about premature AI regulation by media and Hollywood were raised.

Canonical issue URL

AI News for 10/8/2024-10/9/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (228 channels, and 1872 messages) for you. Estimated reading time saved (at 200wpm): 222 minutes. You can now tag @smol_ai for AINews discussions!

Just a smattering of smol stories today:


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Advancements and Industry News

AI Engineering and Development

AI Ethics and Societal Impact

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Continuous Finetuning: A Novel Approach to Enhancing LLM Performance

Theme 2. vLLM Outperforms llama.cpp in Distributed Inference Benchmarks

Theme 3. Microsoft's Differential Transformer: A Breakthrough in LLM Attention

Theme 4. Inflection AI Expands with New Models and Enterprise Offerings

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Research and Breakthroughs

AI Model Releases and Improvements

Industry Developments

Expert Opinions and Predictions

AI-Generated Content and Tools


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. Advanced AI Model Performance and Optimization

Theme 2. Infrastructure and Hardware Support for AI

Theme 3. Challenges in Training and Fine-tuning AI Models

Theme 4. Data Management, Security, and Scalability

Theme 5. AI Tools, Integrations, and Community Research

O1-preview

Theme 1. AI Model Advancements and Releases

Theme 2. AI Tools and Integration Challenges

Theme 3. AI in Research and Recognition

Theme 4. AI for Creative and Emotional Engagement

Theme 5. Technical Challenges and Solutions in AI Development


PART 1: High level Discord summaries

LM Studio Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


Cohere Discord


aider (Paul Gauthier) Discord


Interconnects (Nathan Lambert) Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


GPU MODE Discord


OpenAI Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


LlamaIndex Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


OpenAccess AI Collective (axolotl) Discord


Torchtune Discord


LangChain AI Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


LAION Discord


DSPy Discord


DiscoResearch Discord


Gorilla LLM (Berkeley Function Calling) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LM Studio ▷ #general (204 messages🔥🔥):

  • Llama 3.2 Inquiry
  • MLX Model Issues
  • Model Accessibility
  • New Features in LM Studio 0.3.4
  • Quantization Model Concerns

Links mentioned:


LM Studio ▷ #hardware-discussion (30 messages🔥):

  • Using dual GPUs
  • Performance of RTX 3060 and RX 6600
  • R9 7900X CPU performance
  • AVX2 support in VMs
  • Difference in GPU architectures

Unsloth AI (Daniel Han) ▷ #general (200 messages🔥🔥):

  • Model Merging at Scale
  • Fine-tuning Qwen 2.5
  • Unsloth AI and Dataset Formats
  • Instruct vs. Base Models
  • Hugging Face Datasets

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (19 messages🔥):

  • Colab gguf file download struggles
  • Using logits with Ollama Llama
  • Continued pretraining of Llama 3.2 3b
  • AMD GPU limitations with Unsloth
  • Fine-tuning with Unsloth FastLanguageModel

HuggingFace ▷ #announcements (1 messages):

  • Nvidia models
  • Meta's VLMs
  • Hugging Face Accelerate 1.0
  • ColPali multimodal retrieval
  • Paper Central

Links mentioned:


HuggingFace ▷ #general (134 messages🔥🔥):

  • Model Performance Comparison
  • Mira Network Discussion
  • Model Specificity in Use Cases
  • TensorFlow Issues
  • Python Community Q&A

Links mentioned:


HuggingFace ▷ #today-im-learning (9 messages🔥):

  • Hierarchical Generation
  • Image Autoencoder Integration
  • Differences in Model Types
  • Hugging Face Metrics Implementation

Link mentioned: coupling generation and compression: no description found


HuggingFace ▷ #cool-finds (1 messages):

  • Scade tools
  • Comfy-Flux integration
  • Custom image generations

Links mentioned:


HuggingFace ▷ #i-made-this (4 messages):

  • VividNode Updates
  • Burnout in Tech Creators
  • Fine-tuning Whisper for ATC
  • FluxBooru-CFG3.5

Link mentioned: Release v1.4.0 · yjg30737/pyqt-openai: What's Changed Add is_g4f and g4f_platform fields to message table, remove old text Fix problems related to recent update, rename file Move every function in globals.py to utils.py for better org...


HuggingFace ▷ #NLP (8 messages🔥):

  • ONNX conversion of T5 models
  • Exploratory analysis of legal documents
  • Big data technologies discussion
  • Validation of LLM outputs
  • Server setup for Hugging Face pipelines

Links mentioned:


HuggingFace ▷ #diffusion-discussions (8 messages🔥):

  • Image Quality in Diffusion Models
  • Flux Loras and GGUF Training
  • Training Diffusion Models with Diffusers

Cohere ▷ #discussions (38 messages🔥):

  • Temperature settings for CMD-R
  • JSON schema discussions
  • Introduction of new members
  • Nobel Prize speculation
  • HumanEval and QA processes

Link mentioned: Tweet from Omar Sanseviero (@osanseviero): BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Literature to the Attention Is All You Need authors. Their work has made thousands cry, laugh, or ric...


Cohere ▷ #questions (36 messages🔥):

  • Cohere API Issues
  • System Role Formatting
  • ETL Pipeline for RAG
  • Zero Data Retention for LLMs

Links mentioned:


Cohere ▷ #api-discussions (46 messages🔥):

  • Cohere API usage
  • RAG solution for banks
  • Embedding model performance
  • Trial key limitations

Cohere ▷ #projects (29 messages🔥):

  • Emotional State Machine
  • Emotion Propagation
  • AI and Mental Health
  • Emotion in Voice AI

aider (Paul Gauthier) ▷ #general (70 messages🔥🔥):

  • Aider and Model Integration
  • Using OpenRouter with Aider
  • Feedback on Architect Mode
  • Community Discussions on LLMs
  • Issues with Aider Functionality

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (53 messages🔥):

  • Aider Usage Queries
  • Model and Mode Configuration
  • File Handling in Aider
  • Architect Mode Feedback
  • Performance Optimizations

Links mentioned:


aider (Paul Gauthier) ▷ #links (7 messages):

  • OpenAI Realtime Console
  • Cline AI Assistant v2.0
  • Firefox Security Update

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (44 messages🔥):

  • 2024 Nobel Prize in Chemistry
  • Nato's academic background
  • LMSYS becoming a company
  • Editing Google Scholar
  • Challenges in energy sciences

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (27 messages🔥):

  • PRMs/Verifiers in ML
  • Patents in ML space
  • Alternatives to PRMs
  • Transparency in ML research

Interconnects (Nathan Lambert) ▷ #ml-drama (16 messages🔥):

  • AI risks discussion featuring prominent figures
  • Nobel Prize controversy in AI research
  • ICLR 2025 review process changes
  • Schmidhuber's critique on attribution in AI
  • Social media reactions to discipline-related insights

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (16 messages🔥):

  • ButtBench Alignment Project
  • SuperAlignment lead at AI2
  • Industry vs Community
  • Lucas Beyer's PR Tactics
  • Allennlp account management

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (3 messages):

  • Data Wall in AI
  • Brute-force approach for AI development
  • Human reasoning vs AI data requirements
  • Efficiency in AI models

Link mentioned: The real data wall is billions of years of evolution: Careful with those human analogies


Interconnects (Nathan Lambert) ▷ #posts (15 messages🔥):

  • RoboNato vs RealNato
  • OLMo fine-tuning for content
  • NotebookLM concept

Perplexity AI ▷ #general (100 messages🔥🔥):

  • Perplexity AI Profitability
  • Complexity Extension for Perplexity
  • Changes in Perplexity AI Responses
  • Future of Collections and Spaces
  • Access to Pro Features

Links mentioned:


Perplexity AI ▷ #sharing (12 messages🔥):

  • Meta's Movie Generator
  • Nobel Prize in Physics
  • AI Automation in Ports
  • AI Evaluation by Braintrust
  • 2024 Summer Olympics

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (4 messages):

  • Citation API Whitelisting
  • API Credit Purchase Issues
  • Invoice Company Details Update
  • Declined Card for API Access

Stability.ai (Stable Diffusion) ▷ #general-chat (108 messages🔥🔥):

  • ControlNet Models
  • Flux Inpainting
  • Kaggle Notebooks for Automatic1111
  • Distilled CFG Explained
  • Deforum Usage Alternatives

Links mentioned:


Eleuther ▷ #general (86 messages🔥🔥):

  • Nobel Prizes in AI and Chemistry
  • PhD Course Competition and Metrics
  • Web3 and Web5 Discussions
  • Publications and Research Collaboration
  • Current Topics in Chess and AI

Links mentioned:


Eleuther ▷ #research (7 messages):

  • Weight Normalization in Models
  • Gradient Initialization Techniques
  • Power-Laws in Gradient Descent

Links mentioned:


Eleuther ▷ #scaling-laws (6 messages):

  • Scaling Laws Overview
  • Kaplan's Scaling Laws
  • Data and Model Size Relationship

Eleuther ▷ #lm-thunderdome (3 messages):

  • 0-shot COT model releases
  • Evaluation implementation details
  • JAX libraries and implementations

Eleuther ▷ #multimodal-general (1 messages):

tensor_kelechi: What are the best lightweight VLMs?


GPU MODE ▷ #general (7 messages):

  • HBM vs SRAM scaling
  • 3D Stacking Solutions
  • Memory Architecture in AI
  • Manufacturing Difficulties
  • Rotary Embeddings CUDA Kernel

GPU MODE ▷ #triton (4 messages):

  • Triton source files
  • GitHub repository structure

Link mentioned: triton/python/triton/ops/blocksparse/matmul.py at 5b29da719daeb3566bfc95b7d02f3561e505bcaf · triton-lang/triton: Development repository for the Triton language and compiler - triton-lang/triton


GPU MODE ▷ #torch (1 messages):

  • PyTorch API changes
  • torch._dynamo migration
  • GitHub issue suggestions

Link mentioned: mace/mace/tools/compile.py at 118a514efde34d963666118ce45360e94d648ef5 · ACEsuit/mace): MACE - Fast and accurate machine learning interatomic potentials with higher order equivariant message passing. - ACEsuit/mace


GPU MODE ▷ #beginner (1 messages):

vayuda: do macs with m series chips use arm sve instructions


GPU MODE ▷ #pmpp-book (3 messages):

  • 5th Edition Release
  • Special Offers for Existing Users

GPU MODE ▷ #torchao (39 messages🔥):

  • torchao Integration with ComfyUI
  • Float8 Quantization Performance
  • Row-wise vs Column-wise Scaling in FSDP2
  • Quantization Issues on Windows
  • torch.inference_mode Limitations

Link mentioned: GitHub - huggingface/optimum-quanto: A pytorch quantization backend for optimum: A pytorch quantization backend for optimum. Contribute to huggingface/optimum-quanto development by creating an account on GitHub.


GPU MODE ▷ #off-topic (1 messages):

vayuda: apparently hinton is the first "pure cs" nobel prize winner


GPU MODE ▷ #llmdotc (15 messages🔥):

  • GPT2 Training Issues
  • Understanding Dependencies in Coding
  • floatX Definition and Usage
  • Using IDE Features Effectively

GPU MODE ▷ #rocm (2 messages):

  • Raspberry Pi 5
  • External GPU setup
  • amdgpu Linux kernel patch
  • 4K gaming performance

Link mentioned: Use an External GPU on Raspberry Pi 5 for 4K Gaming | Jeff Geerling: no description found


GPU MODE ▷ #bitnet (1 messages):

tiendung: how good is it compare to original method? (need CPU)


GPU MODE ▷ #webgpu (2 messages):

  • Testing WebGPU
  • Browser Automation vs Native Development
  • Resource Management in Playwright

GPU MODE ▷ #liger-kernel (2 messages):

  • FusedLinearJSD Implementation
  • Performance Metrics in High BT

Link mentioned: Add FusedLinearJSD by Tcc0403 · Pull Request #300 · linkedin/Liger-Kernel: Summary similar to the fuse linear CE. It handle the forward and backward pass of the final linear layer via JSD by avoiding the materialization of the large logits tensor. Since JSD is the last la...


GPU MODE ▷ #metal (2 messages):

  • GPU integer operations
  • bfloat16 support on M2

OpenAI ▷ #ai-discussions (75 messages🔥🔥):

  • ChatGPT vs. Claude Subscriptions
  • O1 and O1 Mini Models
  • AI Evolution and Consciousness
  • Routing Models in AI
  • Challenges in AI Development

Link mentioned: Tweet from Mark Johns / Doomlaser (@Doomlaser): My poem about AI, in the form of a nonet. I am not an AI hater, AI is hated and feared by many, Cherished by others who know it well, Which side will win the battle, Of what it is to be, To live diff...


OpenAI ▷ #prompt-engineering (2 messages):

  • ChatGPT rewriting responses
  • Dall-E prompts
  • Canvas feature

OpenAI ▷ #api-discussions (2 messages):

  • ChatGPT Rewriting Response Issue
  • DALL-E Prompts
  • Canvas Feature

Nous Research AI ▷ #general (67 messages🔥🔥):

  • Free compute offer
  • Nobel Prize in Chemistry
  • Lm Studio updates and MLX
  • New pre-training dataset by LLM360
  • Job recruitment practices

Links mentioned:


Nous Research AI ▷ #ask-about-llms (3 messages):

  • Llama Stack
  • Fast Inference with Llama 3.1-8B
  • Meta's GitHub Releases

Links mentioned:


Nous Research AI ▷ #research-papers (4 messages):

  • Text to Video Models
  • O1 Replication Journey
  • Model Merging at Scale

Links mentioned:


Nous Research AI ▷ #interesting-links (1 messages):

  • VLM performance timeline
  • Vision-language models
  • Parameter count comparison

Nous Research AI ▷ #research-papers (4 messages):

  • Free text-to-video models
  • O1 Replication Journey
  • Model merging at scale

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (71 messages🔥🔥):

  • Prompt Caching
  • Inflection 3.0 and Enterprise
  • OpenRouter API Rate Limits
  • NotebookLM Deep Dive Podcast
  • User Concerns about Gemini Moderation

Links mentioned:


LlamaIndex ▷ #blog (4 messages):

  • LlamaIndex Workflows tutorial
  • LlamaCloud and LlamaParse demo
  • SFTechWeek meetup
  • OpenAI Realtime API Client demo

Links mentioned:


LlamaIndex ▷ #general (45 messages🔥):

  • Semantic chunking in TypeScript
  • PropertyGraphIndex extractors
  • Integration issues with LlamaIndex
  • Context chat engine and reranking
  • RAG reducing hallucinations

Links mentioned:


Latent Space ▷ #ai-general-chat (39 messages🔥):

  • AI girlfriend data breach
  • Sequoia's 3rd annual AI essay
  • Nobel Prize in Chemistry
  • Palmyra X 004 release
  • ChatGPT search rollout

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

  • Molmo
  • Pixmo

Link mentioned: Join our Cloud HD Video Meeting: Zoom is the leader in modern enterprise video communications, with an easy, reliable cloud platform for video and audio conferencing, chat, and webinars across mobile, desktop, and room systems. Zoom ...


Modular (Mojo 🔥) ▷ #general (2 messages):

  • DOM Data Attributes
  • WebAssembly Component Model

Link mentioned: GitHub - WebAssembly/component-model: Repository for design and specification of the Component Model: Repository for design and specification of the Component Model - WebAssembly/component-model


Modular (Mojo 🔥) ▷ #mojo (24 messages🔥):

  • Mojo and Scikit-learn
  • Mojo GPU Support
  • Running ONNX Models in Mojo
  • Drivers for Mojo GPU Usage

Link mentioned: GitHub - yetalit/Mojmelo: Machine Learning algorithms in pure Mojo 🔥: Machine Learning algorithms in pure Mojo 🔥. Contribute to yetalit/Mojmelo development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #max (5 messages):

  • Performance of Mojo graphs
  • Pre-compiling graphs
  • Reuse of inference sessions
  • mojo run vs compiled binaries
  • Graph input types

LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

  • Lab Assignments Released
  • Course Sign Up
  • Discord for Collaboration
  • Lab Completion Criteria

Link mentioned: Large Language Model Agents: no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (16 messages🔥):

  • Lab 1 File Issues
  • Quiz Submission Concerns
  • Course Offering Next Semester
  • Ninja & Legendary Tier Requirements
  • Agent Definition Discussion

LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (10 messages🔥):

  • Role of Reinforcement Learning in AGI
  • Session Q&A Clarifications
  • Live Session Video Confusions
  • Collaborative Assignment Brainstorming

Link mentioned: YouTube: no description found


OpenAccess AI Collective (axolotl) ▷ #runpod-help (26 messages🔥):

  • Training Vicuna-7B model
  • CUDA out of memory errors
  • DeepSpeed configuration issues
  • Runpod instance usage

Links mentioned:


Torchtune ▷ #general (9 messages🔥):

  • Model Scalability Concerns
  • P-value Reporting in ML
  • Implementation of L-mul
  • RL Algorithm Seed Impact
  • Signal vs. Noise in Research

Torchtune ▷ #dev (1 messages):

  • SOAP optimizer
  • AdamW learning rate issues
  • NanoGPT speedrunning achievements

Link mentioned: Tweet from Keller Jordan (@kellerjordan0): NanoGPT speedrunning update: Using the SOAP optimizer (https://arxiv.org/abs/2409.11321), @vyasnikhil96 has achieved a new sample efficiency record of 3.28 Fineweb validation loss in 3.25B training to...


Torchtune ▷ #papers (3 messages):

  • Diff Transformer
  • L-Mul Algorithm
  • Floating Point Multiplication Replacement

Links mentioned:


LangChain AI ▷ #general (9 messages🔥):

  • Memcached support in LangChain
  • LiteLLM prompt caching and streaming
  • Natural language to SQL query limitations
  • SQL chain with models other than GPT 3.5
  • Integrating Livekit with LangChain

Links mentioned:


OpenInterpreter ▷ #general (8 messages🔥):

  • Mozilla AI open source talk
  • Using --stdin flag confusion
  • LLMs and deterministic outputs
  • Impact of model updates
  • Code outcome variability

OpenInterpreter ▷ #ai-content (1 messages):

8i8__papillon__8i8d1tyr: https://www.youtube.com/watch?v=kNj0O7cKCU4


tinygrad (George Hotz) ▷ #general (3 messages):

  • exo on Linux with clang backend
  • Nix package issues
  • Tinygrad debug mode observations
  • Pull Request #6945 for clang
  • auto-casting bf16 to float32

Link mentioned: WIP: autocast bf16 to float32 for clang by 1ntEgr8 · Pull Request #6945 · tinygrad/tinygrad: I hooked the rewriter using the extra_matcher field of the renderer (mimicking PTX). The rewrite rules are not correct (does not perform the shift), will fix soon. I was able to compile and run the...


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

  • Fashion MNIST PR
  • Dataset Suggestions
  • Learning Resources

Link mentioned: added beautiful fashion mnist and example by Kinvert · Pull Request #6961 · tinygrad/tinygrad: People learning tinygrad might want a step in difficulty between MNIST and CIFAR-10. This is what I personally did here to keep learning tinygrad. Might be useful to others. Up to you guys if you w...


LAION ▷ #general (1 messages):

  • Hierarchical Generation
  • Stable Cascade Models

Link mentioned: coupling generation and compression: no description found


LAION ▷ #research (3 messages):

  • o1-preview Generalization
  • o1-mini Performance
  • AIW Task Issues
  • TruthfulQA Success

DSPy ▷ #show-and-tell (3 messages):

  • The Cat API
  • Cat image fetching tools
  • Cat breeds data

Links mentioned:


DiscoResearch ▷ #general (1 messages):

  • Whisper Turbo German Model
  • Speech Recognition Optimization

Link mentioned: primeline/whisper-large-v3-turbo-german · Hugging Face: no description found


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (1 messages):

  • Writer's Palmyra-X-004 model
  • DevRel inquiries





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}