Frozen AI News archive

State of AI 2024

**Nathan Benaich's State of AI Report** in its 7th year provides a comprehensive overview of AI research and industry trends, including highlights like **BitNet** and the synthetic data debate. **Cerebras** is preparing for an IPO, reflecting growth in AI compute. A hackathon hosted by **Daily** and the **Pipecat** community focuses on conversational voice AI and multimodal experiences with $20,000 in prizes. Nobel Prizes in Physics and Chemistry were awarded for AI research: **Geoffrey Hinton** and **John Hopfield** for neural networks and statistical mechanics, and **Demis Hassabis**, **John Jumper**, and **David Baker** for AlphaFold and protein structure prediction. **Meta** released **Llama 3.2** with multimodal capabilities, accompanied by educational resources and performance updates. *"This recognizes the impact of deep neural networks on society"* and *"tremendous impact of AlphaFold and ML-powered protein structure prediction"* were noted by experts.

Canonical issue URL

AI News for 10/9/2024-10/10/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (231 channels, and 2109 messages) for you. Estimated reading time saved (at 200wpm): 267 minutes. You can now tag @smol_ai for AINews discussions!

It is the season of annual perspectives, whether it is SWE-bench's first year (celebrated by MLE-bench today) or Sequoia's third year, or a16z's 2nd anniversary of being cooked by roon, but the big dog here is Nathan Benaich's State of AI Report, now in year 7.

https://www.youtube.com/watch?v=CyOL_4K2Nyo

AI Engineers will probably want to skip the summaries and go straight to the slides, which recap topics we cover in this newsletter but in one place, though you'll have to dig a little to find references:

image.png

The Research and Industry sections will be most relevant, with useful 1-slide summaries of the must-know research of the year, like BitNet (our coverage here):

image.png

and even-handed presentations of both sides of the synthetic data debate:

image.png

With some of the coverage is perhaps too uncritically-accepting of bold claims at face value.

As Cerebras shapes up for IPO, the Compute Index is a nice proxy for why this is the first of its cohort to finally emerge:

image.png

as well as good recaps of the funding landscape

image.png

image.png


Brought to you by Daily: If you’re interested in conversational voice AI (and video, too), join the team at Daily and the Open Source Pipecat community for a hackathon in San Francisco on October 19th and 20th. $20,000 in prizes for the best voice AI agents, virtual avatar experiences, UIs for multi-modal AI, art projects, and whatever else we dream up together.

Swyx commentary: They just announced that Cartesia (my fave TTS recently) and GCP have joined as sponsors AND Product Hunt is hosting a remote track as well! If you ever wanted to do anything voice + video this is the place to be next weekend!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Nobel Prizes in Physics and Chemistry Awarded for AI Research

New AI Model Releases and Updates

AI Development and Research

AI Tools and Applications

AI Industry and Market Trends

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Large Language Model Releases: Behemoth 123B

Theme 2. Nvidia RTX 5090: Pricing Strategy and VRAM Concerns

Theme 3. Voice Assistant Development with Llama 3

Theme 4. ARIA: New Open Multimodal Native Mixture-of-Experts Model

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Research and Breakthroughs

AI Safety and Ethics Concerns

AI Industry Developments

Broader Implications and Discussions


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. Fine-Tuning Models Enhances Performance

Theme 2. Launch and Integration of New AI Models

Theme 3. Advancements in AI Audio and Podcast Creation

Theme 4. Hardware Optimization for Enhanced AI Performance

Theme 5. AI Ethics and Community Trends


PART 1: High level Discord summaries

Notebook LM Discord Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


GPU MODE Discord


Eleuther Discord


Perplexity AI Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Cohere Discord


Interconnects (Nathan Lambert) Discord


Stability.ai (Stable Diffusion) Discord


Modular (Mojo 🔥) Discord


LM Studio Discord


OpenAccess AI Collective (axolotl) Discord


LlamaIndex Discord


Latent Space Discord


LLM Agents (Berkeley MOOC) Discord


Torchtune Discord


OpenInterpreter Discord


LAION Discord


DSPy Discord


LangChain AI Discord


tinygrad (George Hotz) Discord


Mozilla AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


AI21 Labs (Jamba) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Notebook LM Discord ▷ #use-cases (81 messages🔥🔥):

  • NotebookLM audio generation
  • User experiences with NotebookLM
  • Research applications of NotebookLM
  • Critical perspectives on AI-generated content
  • Podcast content creation with NotebookLM

Links mentioned:


Notebook LM Discord ▷ #general (296 messages🔥🔥):

  • Audio Podcast Issues
  • Generating Content with Sources
  • NotebookLM Features and Functionality
  • User Experiences with NotebookLM
  • AI and Language Support

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (193 messages🔥🔥):

  • Fine-tuning models
  • RAM usage
  • Handling long prompts
  • Support for multimodal models
  • User interfaces for LLMs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (1 messages):

dr13x3: https://karpathy.ai/blog/calculator.html


Unsloth AI (Daniel Han) ▷ #help (126 messages🔥🔥):

  • Qwen 2.5 fine-tuning
  • Eval memory issues
  • Dataset context lengths
  • RAG for LLMs
  • Model evaluation strategies

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (6 messages):

  • Chain-of-Thought (CoT) reasoning
  • Decoding processes in LLMs
  • Speed of evolution in AI research

Link mentioned: Chain-of-Thought Reasoning Without Prompting: In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT) promptin...


OpenAI ▷ #ai-discussions (128 messages🔥🔥):

  • AGI Development
  • OpenAI's Advanced Voice Model
  • AI Evolution and Training
  • OpenAI and Apple Partnership
  • Mistral AI's Advances

OpenAI ▷ #gpt-4-discussions (6 messages):

  • Vision O1 Release
  • Custom GPT Models
  • Chat Popup Notifications

OpenAI ▷ #prompt-engineering (37 messages🔥):

  • Character Development and AI Interaction
  • Community Support in AI Prompting
  • Academic Research on RAG Systems

OpenAI ▷ #api-discussions (37 messages🔥):

  • ChatGPT prompt improvement
  • User ban from server
  • Academic research insights
  • Roleplaying character descriptions

OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • MythoMax API
  • MythoMax performance

Link mentioned: Tweet from OpenRouter (@OpenRouterAI): Just launched a free API endpoint for MythoMax 🎁 Powered by @togethercompute Lite (with int4 quantization). Why MythoMax is great 👇 Quoting OpenRouter (@OpenRouterAI) A reminder that MythoMax, ...


OpenRouter (Alex Atallah) ▷ #general (162 messages🔥🔥):

  • NotebookLM Deep Dive podcast
  • Gemini moderation concerns
  • Automated podcast apps
  • Claude model issues
  • Grok model integration

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

  • TTS Spaces Arena
  • Llama 3.1 Benchmarking
  • FluxBooru Demo
  • Fine-tuning Whisper
  • 7 Million Wikipedia Images

HuggingFace ▷ #general (127 messages🔥🔥):

  • Model Card Transparency
  • Image Interpretation Models
  • Continuous Fine-tuning Techniques
  • Role of RAG in Chatbots
  • Inference Speed Optimization

Links mentioned:


HuggingFace ▷ #today-im-learning (8 messages🔥):

  • LoRA Training
  • Deep Learning Refresh
  • Maintaining Consistency
  • Self-Created Review Questions

Link mentioned: The Novice's LLM Training Guide,): Written by Alpin Inspired by /hdg/'s LoRA train rentry This guide is being slowly updated. We've already moved to the axolotl trainer. The Basics The Transformer architecture Training Basics Pre-train...


HuggingFace ▷ #cool-finds (3 messages):

  • Scade Forms UI
  • Microsoft Collection Inquiry
  • Masakhane Dataset Release

Link mentioned: masakhane/afrimmlu · Datasets at Hugging Face: no description found


HuggingFace ▷ #i-made-this (13 messages🔥):

  • Llama 3.1 Inference Benchmark
  • TTS Arena API Integration
  • RTL SDR with Whisper
  • Reinforcement Learning for Crypto Market Making
  • Pre-Commit Hooks for IPython Notebooks

Links mentioned:


HuggingFace ▷ #core-announcements (1 messages):

  • CogVideoX-Factory
  • Memory Efficient Training
  • Finetuning Scripts

Link mentioned: GitHub - a-r-r-o-w/cogvideox-factory: Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed: Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed - a-r-r-o-w/cogvideox-factory


HuggingFace ▷ #NLP (4 messages):

  • Using Intel CPUs for NLP
  • BentoML for Pipelines
  • Hosting Open Source LLMs

Link mentioned: GitHub - bentoml/BentoTGI: Contribute to bentoml/BentoTGI development by creating an account on GitHub.


HuggingFace ▷ #diffusion-discussions (3 messages):

  • SDXL Color ControlNet
  • T2I Adapter - Color
  • Img2Img Pipeline with SDXL
  • High Denoising Strength

Link mentioned: TencentARC/t2iadapter_color_sd14v1 · Hugging Face: no description found


GPU MODE ▷ #triton (9 messages🔥):

  • TMA interface refactoring
  • Issue with GEMM implementation
  • Passing descriptors
  • Compatibility with torch.compile

Links mentioned:


GPU MODE ▷ #torch (4 messages):

  • TorchDynamo APIs
  • torch.compile modes
  • Triton TMA descriptors

Link mentioned: TorchDynamo APIs for fine-grained tracing — PyTorch 2.4 documentation: no description found


GPU MODE ▷ #algorithms (1 messages):

  • Scaling Inference-Time Computation
  • LLM Performance Improvement

Link mentioned: Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters: Enabling LLMs to improve their outputs by using more test-time computation is a critical step towards building generally self-improving agents that can operate on open-ended natural language. In this ...


GPU MODE ▷ #cool-links (1 messages):

majormelancholy: https://github.com/microsoft/vptq


GPU MODE ▷ #youtube-recordings (1 messages):

marksaroufim: https://www.youtube.com/watch?v=BmJSIDLoP4s


GPU MODE ▷ #torchao (69 messages🔥🔥):

  • torch.compile issues
  • int8 quantization performance
  • ComfyUI limitations
  • torch.export errors
  • comparison with diffusers

Links mentioned:


GPU MODE ▷ #off-topic (3 messages):

  • Recovery from Injury
  • Creative Military Ration Recipes

GPU MODE ▷ #llmdotc (17 messages🔥):

  • Understanding floatX
  • Dependencies in programming
  • Learning GitHub
  • Editor functionalities
  • Training file development

GPU MODE ▷ #intel (5 messages):

  • Intel's collaboration with external companies
  • Coldplay acquisition by Intel

GPU MODE ▷ #bitnet (1 messages):

  • Llama3.2-1B
  • Finetuning Evaluation
  • Fineweb Dataset

GPU MODE ▷ #liger-kernel (5 messages):

  • Merge CI changes
  • Ignore index functionality

GPU MODE ▷ #self-promotion (5 messages):

  • Llama 3.1 Benchmark
  • GEMM Kernel Development
  • Matrix Multiplication Performance
  • Hierarchical Tiling Techniques

Links mentioned:


GPU MODE ▷ #smol-binaries (3 messages):

  • Removing libtorch dependency
  • Lean library for torch inductor AOT
  • Weekend project excitement

Eleuther ▷ #general (57 messages🔥🔥):

  • Transition from Crypto to AI
  • Web5 Discussions
  • Paper Writing Resources
  • Recruiter Trends in Tech
  • LUMI Performance Queries

Links mentioned:


Eleuther ▷ #research (34 messages🔥):

  • Llama 8bn 3.1 performance on MATH dataset
  • Tuning and training techniques
  • Understanding Rotary Positional Encodings
  • Image encoders in Llama 3.2 vision models
  • Benchmarks for reasoning improvement in LLMs

Links mentioned:


Eleuther ▷ #lm-thunderdome (6 messages):

  • Model Inference Abstraction
  • Dependency-Agnostic Design
  • Framework Diversity
  • Evaluation Code Isolation
  • JAX Dependencies Concerns

Perplexity AI ▷ #general (84 messages🔥🔥):

  • Changes in Perplexity Response Quality
  • AI Video Generation Expectations
  • Perplexity API and Cost Considerations
  • User Experience Issues
  • AI Models for Various Tasks

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

  • Fraud Detection Framework
  • Lun Nituite Jia
  • AI Reasoning Framework
  • GPU Driver Updates

Perplexity AI ▷ #pplx-api (6 messages):

  • Return Citations
  • Account Details Request
  • Exa vs. Perplexity AI
  • API Documentation

aider (Paul Gauthier) ▷ #general (53 messages🔥):

  • SambaNova Integration with Aider
  • Architect Prompt Updates
  • AI Code Optimization Challenges
  • Benchmarking Aider
  • Palmyra X 004 Release

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (28 messages🔥):

  • Aider Workflow Issues
  • Configuring Aider
  • Model Limitations in Aider
  • Using Aider with Git
  • Multithreading in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #links (7 messages):

  • Function Calling in Smaller Models
  • Deno 2 Announcements
  • Jupyter Support in Deno 2
  • JavaScript and TypeScript Ecosystem
  • Deno Notebook Features

Links mentioned:


Nous Research AI ▷ #general (38 messages🔥):

  • O1 Replication Journey
  • New Training Strategy Proposal
  • Computational Aesthetics in Neural Networks
  • Dynamic Regularization through Cellular Automata
  • Innovations in LLM Performance

Links mentioned:


Nous Research AI ▷ #ask-about-llms (19 messages🔥):

  • Getting Started with RAG
  • Understanding Entropix Weights
  • RoPE Application in Attention
  • Embed Chain Recommendations
  • RAGtouille Library

Nous Research AI ▷ #research-papers (9 messages🔥):

  • Pyramid Flow Video Generation
  • Recurrent Neural Networks and Long Context Processing
  • Model Merging Challenges
  • Chain-of-Thought Reasoning
  • O1 Replication Journey

Links mentioned:


Nous Research AI ▷ #interesting-links (1 messages):

teknium: https://github.com/huggingface/trl/issues/2175


Nous Research AI ▷ #research-papers (9 messages🔥):

  • Pyramid Flow Video Generation
  • Model Merging at Scale
  • Chain-of-Thought Reasoning
  • Long Context RNNs

Links mentioned:


Cohere ▷ #discussions (15 messages🔥):

  • 2024 Nobel Prize in Literature
  • Attention Is All You Need
  • Community Reactions
  • Twitter Rumors

Link mentioned: Tweet from Omar Sanseviero (@osanseviero): BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Literature to the Attention Is All You Need authors. Their work has made thousands cry, laugh, or ric...


Cohere ▷ #questions (2 messages):

  • Google Drive connection issues
  • Support request

Cohere ▷ #projects (58 messages🔥🔥):

  • AI Emotional Processing
  • Therapeutic AI Tools
  • Challenges in AI Conversations
  • Ethical Concerns of Companion AIs
  • Personal AI Projects

Interconnects (Nathan Lambert) ▷ #news (13 messages🔥):

  • LMSYS becomes a company
  • Aria - Multimodal MoE release
  • Comparisons with Molmo paper

Link mentioned: Tweet from Vaibhav (VB) Srivastav (@reach_vb): 🚨 @rhymes_ai_ released Aria - Multimodal MoE (3.9B active), 64K tokens, caption 256 frames in 10 sec, Apache 2.0 licensed! Beats GPT4o & Gemini Flash ⚡ > 3.9B Active, 25.3B Total parameters >...


Interconnects (Nathan Lambert) ▷ #ml-questions (8 messages🔥):

  • o1 reasoning tree
  • Research Paths in AI
  • Robotics
  • World Models
  • Brain MRI Decoding

Interconnects (Nathan Lambert) ▷ #ml-drama (7 messages):

  • Blocked UX
  • Social Media Dynamics
  • Notable Figures in AI
  • Anonymous Contributions

Interconnects (Nathan Lambert) ▷ #memes (19 messages🔥):

  • ButtBench Alignment Project
  • SuperAlignment lead at ai2
  • Posting limits in industry
  • Public Relations strategies
  • Social Media Engagement

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (5 messages):

  • Setting Up Systems
  • Andrew's Influence

Stability.ai (Stable Diffusion) ▷ #general-chat (47 messages🔥):

  • Deforum usage alternatives
  • CogVideoX for video generation
  • Flux model installation
  • Product recreation with AI
  • KJNodes for Comfy UI

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (46 messages🔥):

  • Rust Provenance APIs
  • io_uring and legacy issues
  • Distributed Shared Memory
  • CXL and memory interconnects
  • RDMA mechanisms

Links mentioned:


LM Studio ▷ #general (23 messages🔥):

  • LMX vs. GGUF Performance
  • GPU Acceleration Configuration
  • Llama 3.2 Model Support
  • Old CUDA Versions
  • Model Rankings

Link mentioned: Tweet from ollama (@ollama): We'll be showing an early preview of Meta's llama 3.2 vision tonight! 😍 https://lu.ma/h9i1lkh5


LM Studio ▷ #hardware-discussion (21 messages🔥):

  • NVIDIA RTX 4000 Series
  • AVX2 CPU Usage in VMs
  • M3 Pro vs PC Performance
  • RAM Limitations in Laptops
  • MacBook Pricing in EU vs US

OpenAccess AI Collective (axolotl) ▷ #general (1 messages):

aleksagordic: <@257999024458563585> dm-ed you something 🙌


OpenAccess AI Collective (axolotl) ▷ #general-help (2 messages):

  • 10xA100 rental
  • 10xH100 rental
  • GPU host options

OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (39 messages🔥):

  • Hugging Face Authentication
  • Axolotl Multi-GPU Usage
  • Config File Issues
  • Hugging Face CLI in Jupyter
  • Using Environment Variables for Tokens

Links mentioned:


LlamaIndex ▷ #blog (3 messages):

  • LlamaIndex voice agent demo
  • Argilla's data quality tool
  • AI Builders Night event
  • Zoom Developers meetup
  • Multi-agent systems

Links mentioned:


LlamaIndex ▷ #general (24 messages🔥):

  • AWS Bedrock
  • Qdrant Database Ingestion
  • Hugging Face Inference API
  • Embedding Issues in Documentation

Links mentioned:


Latent Space ▷ #ai-general-chat (21 messages🔥):

  • Sierra's Valuation
  • UGround Visual Grounding
  • State of AI Report 2024
  • AMD AI Chips
  • New AI Model from Writer

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

  • Molmo
  • Pixmo
  • Zoom meetings

Link mentioned: Join our Cloud HD Video Meeting: Zoom is the leader in modern enterprise video communications, with an easy, reliable cloud platform for video and audio conferencing, chat, and webinars across mobile, desktop, and room systems. Zoom ...


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (9 messages🔥):

  • Hackathon Submissions
  • Lab Download Issues
  • RAG Framework Preferences
  • Web Browser Agents

LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (2 messages):

  • Brainstorming
  • Channel Collaboration

Torchtune ▷ #general (1 messages):

  • Small Model Training
  • Mixed Optimizations
  • Seed Stage Ideas

Torchtune ▷ #dev (10 messages🔥):

  • SOAP Optimizer
  • Distributed Optimizers
  • Entropix

Links mentioned:


OpenInterpreter ▷ #general (6 messages):

  • OS Mode for MacOS
  • AI Agent in Terminal
  • Calypso AI Streamer

Links mentioned:


OpenInterpreter ▷ #O1 (1 messages):

  • ElevenLabs Creator Plan Costs

LAION ▷ #research (4 messages):

  • Vision-Language Intelligence
  • Visual Autoregressive Modeling
  • Matryoshka Diffusion Models
  • Coarse to Fine Autoregression
  • Image-to-Vector-to-Image Autoencoder

DSPy ▷ #general (3 messages):

  • DOTS algorithm
  • Dynamic reasoning
  • DSPy framework
  • 24 game script
  • Reasoning actions in LLMs

Link mentioned: DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search: Enhancing the capability of large language models (LLMs) in reasoning has gained significant attention in recent years. Previous studies have demonstrated the effectiveness of various prompting strate...


LangChain AI ▷ #share-your-work (1 messages):

  • AI Chat Assistant
  • Piñata Challenge

Link mentioned: no title found: no description found


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

msd6921: Anyone here ever used diffusers from Hugging Face here?


Mozilla AI ▷ #announcements (1 messages):

  • Llama 3.2 Release
  • Mozilla Accelerator Program
  • Lumigator MVP

Link mentioned: Mozilla’s new accelerator aims to support small, open-source AI: The org wants to highlight models that might not otherwise see funding.


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

  • BFCL-V3
  • Gorilla LLM optimization
  • multi-round conversations

AI21 Labs (Jamba) ▷ #jamba (1 messages):

  • Hugging Face model AI21-Jamba-1.5-Mini
  • CUDA error in Docker
  • torch.multiprocessing




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}