Frozen AI News archive

The AI Nobel Prize

**Geoff Hinton** and **John Hopfield** won the **Nobel Prize in Physics** for their work on **Artificial Neural Networks**. The award citation spans **14 pages** highlighting their contributions. **Zep** released a new community edition of their low-latency memory layer for AI agents, emphasizing knowledge graphs for memory. At OpenAI's DevDay, new features like real-time voice API, vision model fine-tuning, and prompt caching with a **50% discount** on reused tokens were introduced. **Anthropic's Claude 3.5 Sonnet** was recognized as the best model currently. **Reka AI Labs** updated their **Reka Flash** model with enhanced multimodal and function calling capabilities. The **GOT (Generic OCR Transformer)** achieved **98.79% accuracy** on OCR benchmarks. Discussions on open-source AI models highlighted their role in fostering competition and decentralization. Software development insights included the importance of Single Sign-On (SSO), thorough testing, and AI-assisted coding workflows. Ethical and societal topics covered critiques of tax policies and the appointment of France's first Minister of AI.

Canonical issue URL

AI News for 10/7/2024-10/8/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (226 channels, and 2556 messages) for you. Estimated reading time saved (at 200wpm): 277 minutes. You can now tag @smol_ai for AINews discussions!

We could talk about the new Differential Transformer paper, or the new AdderLM paper, but who are we kidding, the big story of the day is Geoff Hinton and John Hopfield's Nobel Prize in Physics.

image.png

The 14 page citation covers their greatest hits, while the memes from AI people and reaction from career physicists has been... interesting.

https://youtu.be/dR1ncz-Lozc?feature=shared

Of course, Hopfield is not new to physics prizes.


[Sponsored by Zep]: Zep is a low-latency memory layer for AI agents and assistants. They continuously update their internal graph of user interactions to deliver fast, deterministic fact retrieval. They just released their new community edition; check it out on GitHub!

Swyx commentary: The use of Knowledge Graphs for Memory was one of the hottest topics at the AI Engineer conference - other popular frameworks are also launching "long term memory" support, but this is an open source solution that isn't tied to LangChain, Autogen, et al. Readme includes a lovely FAQ which we love to see. Memory layers seem to be as hot in 2024 as Vector databases were in 2023.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI and Language Models

Software Development and Engineering

AI Ethics and Societal Impact

AI Research and Development

AI Tools and Applications

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Energy-Efficient AI: Addition-Based Algorithm Claims 95% Reduction

Theme 2. Zamba 2: New Mamba-based Models Outperform Larger Competitors

Theme 3. Open WebUI 0.3.31: New Features Rivaling Commercial AI Providers

Theme 4. AntiSlop Sampler: Reducing Repetitive Language in LLM Outputs

Theme 5. Optimizing AI Agents: DSPy and Argilla for Improved Search and Prompts

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Releases and Improvements

AI Research and Techniques

AI Capabilities and Impact

AI Image Generation Techniques


AI Discord Recap

A summary of Summaries of Summaries to us by O1-preview

Theme 1. Cutting-Edge AI Models Unveiled and Explored

Theme 2. Nobel Prize Controversy: AI Meets Physics

Theme 3. Fine-Tuning Frenzy and Optimization Obstacles

Theme 4. GPU Gossip: Hardware Headaches and Hints

Theme 5. AI Tools and APIs: User Triumphs and Trials


PART 1: High level Discord summaries

HuggingFace Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


Eleuther Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Cohere Discord


Latent Space Discord


Nous Research AI Discord


GPU MODE Discord


LlamaIndex Discord


Modular (Mojo 🔥) Discord


Interconnects (Nathan Lambert) Discord


Perplexity AI Discord


DSPy Discord


OpenInterpreter Discord


LLM Agents (Berkeley MOOC) Discord


tinygrad (George Hotz) Discord


LangChain AI Discord


Torchtune Discord


LAION Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The OpenAccess AI Collective (axolotl) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #announcements (1 messages):

  • Nvidia models
  • Meta's VLMs
  • Open Source Hugging Face Accelerate 1.0
  • Video language models
  • ColPali multimodal retrieval

Links mentioned:


HuggingFace ▷ #general (566 messages🔥🔥🔥):

  • LLMs Performance & Limitations
  • Tokenization Importance
  • AI Advancements & Research
  • Connectome Replication in ML
  • GPU Use and Compatibility

Links mentioned:


HuggingFace ▷ #today-im-learning (7 messages):

  • Jailbreaking LLMs
  • Alpaca Dataset and Fine-tuning
  • Model Merging at Scale
  • Google's Research Contributions

Links mentioned:


HuggingFace ▷ #i-made-this (1 messages):

pelolisu: Diffusers Logo


HuggingFace ▷ #computer-vision (2 messages):

  • Extending Character Set in TrOCR

HuggingFace ▷ #NLP (1 messages):

  • T5 model ONNX files
  • Conversion methods to ONNX
  • ONNX export with torch

Link mentioned: Convert Transformers to ONNX with Hugging Face Optimum: no description found


HuggingFace ▷ #diffusion-discussions (6 messages):

  • Image Model Identification
  • Diffusion Models

LM Studio ▷ #announcements (2 messages):

  • LM Studio 0.3.4 release
  • New features in LM Studio
  • Bug fixes in LM Studio

Links mentioned:


LM Studio ▷ #general (227 messages🔥🔥):

  • LM Studio updates
  • MLX Engine introduction
  • Performance comparisons of models
  • Issues with LM Studio
  • User experiences with LLM models

Links mentioned:


LM Studio ▷ #hardware-discussion (81 messages🔥🔥):

  • Linux Resource Usage
  • GPU VRAM Options
  • Multi-GPU Configurations
  • Performance of AMD vs NVIDIA
  • Stable Diffusion Model Efficiency

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (246 messages🔥🔥):

  • Unsloth Studio Release
  • Fine-Tuning Models
  • Model Merging Research
  • Performance of LLMs
  • RAG and Fine-Tuning

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (60 messages🔥🔥):

  • Unsloth functionality on Windows
  • Fine-tuning LLMs for content moderation
  • ShareGPT format in Colab
  • Prompt design for completion tasks
  • Colab resources for model training

Links mentioned:


aider (Paul Gauthier) ▷ #general (157 messages🔥🔥):

  • Aider's Features
  • Embeddings and Semantic Search
  • Message Batches API
  • Free LLM Options
  • Cost Estimations in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (101 messages🔥🔥):

  • Aider Confusion on API Key Usage
  • Command Line Options for Aider
  • Context Management in Aider
  • Deepseek Model Usage
  • Feedback and Feature Requests

Links mentioned:


aider (Paul Gauthier) ▷ #links (4 messages):

  • Python 3.13 Release
  • Google NotebookLM Podcast

Links mentioned:


Eleuther ▷ #general (36 messages🔥):

  • Nobel Prize in Physics
  • AI research and recognition
  • Hinton and Hopfield's contributions
  • Physics community reactions
  • Model merging research

Links mentioned:


Eleuther ▷ #research (188 messages🔥🔥):

  • Normalized Transformer (nGPT)
  • MuP and Initialization
  • Diff Transformer
  • Gradient Descent Behavior
  • Generative Reward Models

Links mentioned:


Eleuther ▷ #multimodal-general (1 messages):

zackt1234: https://discord.com/channels/729741769192767510/1214931475850469426/1292977027254583397


OpenAI ▷ #ai-discussions (138 messages🔥🔥):

  • Document Categorization AI
  • AI Tools for File Management
  • Cloud vs Local AI Costs
  • AVM and Multi-modal AI
  • AI Subscriptions Comparison

Link mentioned: 70 Best Automated File Organization AI tools - 2024: Discover the best 70 paid and free AI Automated File Organization, and find their features and pricing. Find the best AI tools for Automated File Organization.


OpenAI ▷ #prompt-engineering (23 messages🔥):

  • Learning Styles in Dog Training
  • Prompt Leaderboard Discussion
  • Curiosity About Prompt Creation
  • Gemini Advanced Prompt Success

OpenAI ▷ #api-discussions (23 messages🔥):

  • Learning Styles in Training
  • Prompt Engineering Queries
  • Interest in Prompt Leaderboards
  • Limitations of Prompt Length
  • Collaboration for Prompt Creation

OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

  • OpenAI prompt caching
  • Prompt caching audits
  • Cost savings with caching
  • Updates on model endpoints
  • Anthropic beta endpoints

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (112 messages🔥🔥):

  • OpenRouter Performance Issues
  • Anthropic API Usage
  • Prompt Caching Details
  • Model Provider Selection
  • Rate Limits

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (95 messages🔥🔥):

  • Stable Diffusion WebUI setup
  • GPU performance comparison
  • Image generation and modification tips
  • ControlNet models
  • General help and resource sharing

Link mentioned: GitHub - lllyasviel/ControlNet: Let us control diffusion models!: Let us control diffusion models! Contribute to lllyasviel/ControlNet development by creating an account on GitHub.


Cohere ▷ #discussions (19 messages🔥):

  • Cohere API Performance
  • Cohere Dark Mode
  • Data Retention Settings
  • AI Club Collaboration
  • Cohere Outage

Links mentioned:


Cohere ▷ #questions (25 messages🔥):

  • Fine Tuning Cohere API
  • Commercial Use of Cohere APIs
  • Changing Frequency and Presence Penalties
  • Data Usage and Privacy Controls
  • Crafting Effective Prompts

Links mentioned:


Cohere ▷ #api-discussions (22 messages🔥):

  • Rerank API with Semi-structured Data
  • Python SDK issues
  • API v1 and v2 differences
  • Janitor AI Proxy link
  • Advanced settings in documentation

Links mentioned:


Cohere ▷ #projects (9 messages🔥):

  • Companion Discord Bot
  • Moderation Tools
  • Identifying Proper Models
  • Hugging Face Resources

Link mentioned: GitHub - rapmd73/Companion: A discord chat bot utilizing AI in a fun and whimsical way. Provides some moderation tools as well.: A discord chat bot utilizing AI in a fun and whimsical way. Provides some moderation tools as well. - GitHub - rapmd73/Companion: A discord chat bot utilizing AI in a fun and whimsical way. Provid...


Latent Space ▷ #ai-general-chat (58 messages🔥🔥):

  • Advanced voice mode frustrations
  • Nobel Prize for Hinton and Hopfield
  • Anthropic Message Batches API
  • Salesforce Generative Lightning UX
  • Cursor tips and tricks

Links mentioned:


Nous Research AI ▷ #general (39 messages🔥):

  • Knowledge Graphs in AI
  • Hermes Model Datasets
  • Free Compute for Competitions
  • LLM Evaluation Services
  • Prototyping with LLMs

Links mentioned:


Nous Research AI ▷ #ask-about-llms (4 messages):

  • Nous masking attention
  • Creating eval datasets
  • LLM Judge experience
  • Llama file location
  • Llama-stack

Nous Research AI ▷ #research-papers (6 messages):

  • Diff Transformer
  • Model Merging at Scale
  • Text to Video Models

Links mentioned:


Nous Research AI ▷ #interesting-links (3 messages):

  • OpenAI o1 model
  • Open O1 project
  • Large-scale model merging

Links mentioned:


Nous Research AI ▷ #research-papers (6 messages):

  • Diff Transformer
  • Model Merging at Scale
  • Text to Video Models

Links mentioned:


GPU MODE ▷ #general (7 messages):

  • Inference optimisation
  • HBM concerns
  • SRAM scaling issues
  • DGX-1 performance comparison
  • 3D stacking solutions

Link mentioned: Don’t Move The Data! | : no description found


GPU MODE ▷ #triton (3 messages):

  • TMA Descriptor Initialization
  • Batch Matrix Multiplication
  • Compilation Issues with tl.dot

Link mentioned: tl.dot with 3D shapes compilation error · Issue #4867 · triton-lang/triton: This code import triton import triton.language as tl from torch import Tensor @triton.jit def get_three_d_weights( currents, # [B_M, B_N] weight_block, # [B_K, B_N] BLOCK_SIZE_K: tl.constexpr, ): p...


GPU MODE ▷ #torch (19 messages🔥):

  • GPU Acceleration for DataLoaders
  • DALI Challenges
  • CUDA Operations and Performance
  • PyTorch Conference Insights
  • SPDL for Efficient Data Loading

Links mentioned:


GPU MODE ▷ #beginner (1 messages):

vayuda: do macs with m series chips use arm sve instructions


GPU MODE ▷ #pmpp-book (4 messages):

  • CUDA architecture details
  • Persistent threads
  • Occupancy and register usage

GPU MODE ▷ #youtube-recordings (3 messages):

  • dtype Clarification
  • Quantized Training
  • Mixed-Precision Training
  • INT8 Speedup on 4090 GPU

GPU MODE ▷ #torchao (5 messages):

  • ViT Sparsity Experiment
  • WeightNormSparsifier
  • Model Inference Time

GPU MODE ▷ #off-topic (5 messages):

  • Raspberry Pi compatibility
  • Nobel Prize in Physics
  • Hinton's Nobel Prize win

Link mentioned: Tweet from jack morris @ COLM (@jxmnop): BREAKING: The Nobel Prize in Physics has been awarded to ptrblock for “fundamental contributions to physics”


GPU MODE ▷ #rocm (2 messages):

  • Raspberry Pi 5
  • External GPU gaming
  • Amdgpu Linux kernel patch
  • GLmark2 performance

Link mentioned: Use an External GPU on Raspberry Pi 5 for 4K Gaming | Jeff Geerling: no description found


GPU MODE ▷ #webgpu (1 messages):

  • ORT Min JS examples
  • WebGPU backend

GPU MODE ▷ #metal (3 messages):

  • BFloat16 conversion
  • Model performance on Mac
  • GPU integer shifts

GPU MODE ▷ #avx (2 messages):

  • vpternlogd instruction
  • AVX-512 ISA
  • Logic design
  • Amiga programming

Link mentioned: AVX Bitwise ternary logic instruction busted!: How a modern AVX instruction shares a similar design with a 1985 blitter chip, by Arnaud Carré


LlamaIndex ▷ #blog (5 messages):

  • LlamaIndex Hackathon
  • LlamaParse premium
  • Oracle integrations
  • LlamaIndex Workflows Tutorial

Links mentioned:


LlamaIndex ▷ #general (49 messages🔥):

  • Docstore Functionality
  • Contextual Retrieval from Anthropic
  • Ingestion Pipeline for Qdrant
  • DuckDB Vector Store Limitations
  • RAG Pipeline Query Handling

Links mentioned:


Modular (Mojo 🔥) ▷ #general (13 messages🔥):

  • TIOBE Index
  • Mojo Programming Language
  • WebAssembly
  • Rust Frontend Frameworks
  • Data Attributes in DOM

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (19 messages🔥):

  • Mojo Keywords Reevaluation
  • ECS Implementation Challenges
  • Feedback on Mojo Proposal
  • Display of Keywords for Beginners
  • Game Development Discussions

Links mentioned:


Modular (Mojo 🔥) ▷ #max (13 messages🔥):

  • Max Inference Engine Issues
  • Custom Operations Tutorial
  • PyTorch Version Compatibility
  • Graph Compilation Times

Interconnects (Nathan Lambert) ▷ #news (5 messages):

  • 2024 Nobel Prize in Physics
  • OpenAI's Compute Capacity
  • Microsoft Competition

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (14 messages🔥):

  • Llama 3.2 11B vs 8B performance
  • Vision integration in text models
  • Research on PRMs/Verifiers
  • State-of-the-art audio models

Link mentioned: Tweet from Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z): LRM-Modulo? Our initial experiments with o1 showed that while LRMs like it do seem to lift the floor for performance on planning problems, they are still far from robust. One idea is to view LRMs as ...


Interconnects (Nathan Lambert) ▷ #ml-drama (14 messages🔥):

  • Discrediting Claims
  • Energy Use in AI Research
  • Internal Disputes
  • History with Emma and Jeff

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (6 messages):

  • Toy features
  • Sampling techniques
  • Explainability in AI

Links mentioned:


Perplexity AI ▷ #general (31 messages🔥):

  • Discord Experience Issues
  • Merch Announcement for Referrals
  • Image Generation in Perplexity
  • Perplexity Web vs Mobile Performance
  • Perplexity Profit Concerns

Perplexity AI ▷ #sharing (6 messages):

  • China's Sound Laser
  • 5MVA Design
  • Small Circuit Design
  • Cerebras IPO Challenges
  • Generating Descriptions

Perplexity AI ▷ #pplx-api (2 messages):

  • Rate Limit Increase Request
  • Support Response Issues

DSPy ▷ #show-and-tell (1 messages):

  • Tool Creation
  • Assistants Development

DSPy ▷ #general (13 messages🔥):

  • Custom LM vs Custom Adapter
  • Deprecation of LM Clients
  • LM Configuration and Adapters
  • Optimizer and Adapter Issues
  • Communication in DSPy Community

Link mentioned: Creating a Custom Local Model (LM) Client | DSPy: ---


OpenInterpreter ▷ #general (13 messages🔥):

  • Open-Interpreter Tool Calling
  • Structured Output for Tools
  • Mozilla AI Talk Announcement

LLM Agents (Berkeley MOOC) ▷ #mooc-questions (6 messages):

  • In-person attendance at lectures
  • AI agent startups using Autogen
  • Building frameworks with Redis

LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

  • DSPy Lecture by Omar
  • Contributions to DSPy

tinygrad (George Hotz) ▷ #general (5 messages):

  • tinygrad Website Navigation
  • Exo Bounty Challenge
  • Tinygrad Documentation

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

  • Buffer Count Workaround
  • Inefficiencies in Tensor Operations

LangChain AI ▷ #general (3 messages):

  • Travel Concerns
  • ChatPromptTemplate Usage
  • Invalid JSON Formatting

LangChain AI ▷ #langchain-templates (1 messages):

  • Escaping Quotes in Messages
  • ChatPromptTemplate Usage
  • FewShotChatMessagePromptTemplate

LangChain AI ▷ #share-your-work (1 messages):

gustaf_81960_10487: <@387269332142391298> update your certs!


Torchtune ▷ #general (4 messages):

  • BF16 Training Challenges
  • Learning Rate Adjustments
  • Stochastic Rounding in Optimizers

LAION ▷ #general (2 messages):

  • Geoffrey Hinton Nobel Comparison
  • Model Merging at Scale

Links mentioned:


LAION ▷ #research (2 messages):

  • Model Merging at Scale
  • Autoarena Tool

Links mentioned:









{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}