Frozen AI News archive

not much happened today

**Sebastien Bubeck** introduced **REINFORCE++**, enhancing classical REINFORCE with **PPO-inspired techniques** for **30% faster training**. **AI21 Labs** released **Phi-4** under the **MIT License**, accessible via **Ollama**. **François Chollet** announced plans for **ARC-AGI-2** and a next-generation **AGI benchmark**. **LangChain** launched **10 new integration packages** to boost **LLM application development**. **Tom Doerr** introduced **Ollama-OCR**, a Python package for **text extraction** using **vision language models**. **Arohan** optimized **Shampoo** for **memory efficiency**, reducing usage from **20 to 6 bytes per parameter**. **Bindu Reddy** showcased **CodeLLM's v1** for **frontend code generation** and highlighted **LlamaIndex Workflows** for **academic summarization** and **slide generation**. **Hwchase17** collaborated with **Together Compute** to enhance **WebDev Arena** with **complex coding agents** for **LLM coding evaluations**. **Jonathan Ross** detailed **Groq's** mission to reduce **compute costs by 1000x** amid rising **generative AI** spending. **Clement Delangue** warned about **scam alerts** involving false claims of association with **AI21**. **Vikhyat K** raised concerns about the **ethical implications** and **trade-offs** of **AGI**. Memes and humor included creative AI prompts and critiques of **LLM behaviors**.

Canonical issue URL

AI News for 1/7/2025-1/8/2025. We checked 7 subreddits, 433 Twitters and 32 Discords (218 channels, and 2346 messages) for you. Estimated reading time saved (at 200wpm): 278 minutes. You can now tag @smol_ai for AINews discussions!

Traditionally, the industry wakes up on the Ides of the month. We have a week to go.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Research & Models

AI Development Tools & Frameworks

AI Applications & Use Cases

AI Business & Industry

AI Policy & Ethics

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. HP's Innovative AMD AI Machine with Unified RAM

Theme 2. Phi-4 by Microsoft: Released and Analyzed

Theme 3. DeepSeek V3 GGUF: 2-bit Quantization Success

Theme 4. NVIDIA Cosmos: Foundation Model for Virtual Worlds

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. 25% of Google's Code Generated by AI

Theme 2. Elon Musk's AI Launch Promises


AI Discord Recap

A summary of Summaries of Summaries by o1-mini-2024-09-12

Theme 1. New AI Models Surge Forward

Theme 2. AI Tools and API Integrations Expand

Theme 3. Community Support and Technical Hurdles

Theme 4. GPU Optimizations and Hardware Discussions

Theme 5. AI Applications in Creative and Technical Domains


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Codeium (Windsurf) Discord


LM Studio Discord


Stability.ai (Stable Diffusion) Discord


Stackblitz (Bolt.new) Discord


aider (Paul Gauthier) Discord


Cursor IDE Discord


Notebook LM Discord Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


Nomic.ai (GPT4All) Discord


Nous Research AI Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


OpenAI Discord


Perplexity AI Discord


GPU MODE Discord


Cohere Discord


Latent Space Discord


LlamaIndex Discord


AI21 Labs (Jamba) Discord


LLM Agents (Berkeley MOOC) Discord


DSPy Discord


OpenInterpreter Discord


LAION Discord


Torchtune Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Axolotl AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (407 messages🔥🔥🔥):

Finetuning Phi-4, Unsloth API, CUDA on TPUs, Deepseek V3, Training Distinct LLMs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

Job Search


Unsloth AI (Daniel Han) ▷ #help (30 messages🔥):

Unsloth multi-GPU support, Training loss iteration spikes, DeepSeek GUFF file concerns, Avoiding overfitting in datasets, RAG and fine-tuning discussions


Codeium (Windsurf) ▷ #discussion (66 messages🔥🔥):

Codeium Chat Issues, Windsurf Performance, Authentication Problems, Billing and Credits, Google Signup Only


Codeium (Windsurf) ▷ #windsurf (300 messages🔥🔥):

Windsurf Performance Issues, User Support and Feedback, Integration with Python Linters, Account and Billing Problems, AI Model Capabilities

Links mentioned:


LM Studio ▷ #general (76 messages🔥🔥):

Performance of Phi-4 model, Issues with LM Studio model loading, Deepseek-V3 compatibility, Qwen2 model functionality, LM Studio as a server and frontend connection

Links mentioned:


LM Studio ▷ #hardware-discussion (113 messages🔥🔥):

Speculative Decoding, Nvidia Digits Performance, 7900XT Comparison, LPDDR5X vs M2 Ultra, Recent Nvidia GPU Releases

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (187 messages🔥🔥):

NVIDIA 5090 Graphics Card, Commercial Use of Stable Diffusion Models, Lora Training Techniques, Creating Realistic Monsters with AI, Image-to-Image Generation Techniques

Links mentioned:


Stackblitz (Bolt.new) ▷ #prompting (6 messages):

Bolt's capabilities, UI Design Prompts, Prompting Techniques


Stackblitz (Bolt.new) ▷ #discussions (180 messages🔥🔥):

Rate Limiting and Token Management, Complex Project Development Tips, Deployment Issues, Supabase Connection Challenges, Use of Different Tools with Bolt

Links mentioned:


aider (Paul Gauthier) ▷ #general (101 messages🔥🔥):

Sonnet vs O1 Pro performance, Aider usage tips, DeepSeek model performance, Sudoku solving discussion, Clickbait video frustration

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (49 messages🔥):

Aider Usage Issues, Litellm Custom Model Setup, Ollama Model Interaction, Deepseek Configuration on OpenRouter, Message Formatting Errors

Links mentioned:


aider (Paul Gauthier) ▷ #links (7 messages):

LLM Interviewing, SynthLang, Gemini 2.0 Flash Experimental

Link mentioned: SynthLang - Prompt Generator & Tester: no description found


Cursor IDE ▷ #general (153 messages🔥🔥):

Cursor IDE Bugs, Composer Functionality, Flutter Development, Technical Debt in Coding, User Experience Issues in Cursor

Links mentioned:


Notebook LM Discord ▷ #use-cases (23 messages🔥):

System Prompt for Quoting, Language Settings in NotebookLM, Repurposing Content, Business Use Cases for AI, Video Content Analysis

Links mentioned:


Notebook LM Discord ▷ #general (86 messages🔥🔥):

NotebookLM Plus Access Issues, Using NotebookLM for Education, Podcast Features, Customization Challenges, General Usage Feedback

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Model Context Protocol, Agents Base launch, Marketing automation

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (60 messages🔥🔥):

LLM Game Development, Azure Model Integration, AI Model Conversation Preferences, Bug Reports on Llama Models, API Call Timeout Issues

Links mentioned:


Modular (Mojo 🔥) ▷ #general (12 messages🔥):

Feedback on Current State, Font Weight Adjustment, CPU/GPU Pairing, AMD vs Nvidia Performance

Link mentioned: Reaction My Eyes GIF - Reaction My Eyes Cant Unsee - Discover & Share GIFs: Click to view the GIF


Modular (Mojo 🔥) ▷ #mojo (47 messages🔥):

Indexing static lists in Mojo, Difference between ListLiteral and VariadickPack, Traits development in Mojo, Overloads and polymorphism proposals, Static analysis methods in Mojo

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (56 messages🔥🔥):

Model Performance and Quantization, GPU Support Issues, Hiring Opportunities in AI, Q4_0 Model Issues, GPT4All Community Contributions

Links mentioned:


Nous Research AI ▷ #general (40 messages🔥):

Networking Solutions for Budget, Phi-4 Model Technical Insights, USB Networking Capabilities, Job Opportunities in Web Development

Link mentioned: microsoft/phi-4 · Hugging Face: no description found


Nous Research AI ▷ #ask-about-llms (3 messages):

Zero Trust in Development, Using Placeholder Data, MVP Development Environment, Solutions for Early Development


Nous Research AI ▷ #research-papers (1 messages):

craftycannon_98161: Any progress?


Nous Research AI ▷ #interesting-links (3 messages):

Structure of Neural Embeddings, MiniMind Lightweight Language Model, Training Pipeline for LLMs

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

craftycannon_98161: Any progress?


Eleuther ▷ #general (15 messages🔥):

Pythia Evaluation, Learning AI Tools, Supervised Fine-Tuning Libraries

Links mentioned:


Eleuther ▷ #research (11 messages🔥):

Cross-Entropy Memory Optimization, SD3 Paper Discussion, HunyuanProver for Theorem Proving

Links mentioned:


Eleuther ▷ #lm-thunderdome (1 messages):

teknium: woops dno how that image got there


Eleuther ▷ #gpt-neox-dev (21 messages🔥):

OOM Issues with 6.7B model, DeepSpeed Pipe Module Performance, AdamW Optimizer Details, Batch Size Behavior in Training, BF16 Loss Scaling Discussion

Link mentioned: <a href="https://api.wandb.ai",">no title found: no description found


Interconnects (Nathan Lambert) ▷ #events (2 messages):

Thursday Meeting, Shack15 Venue


Interconnects (Nathan Lambert) ▷ #news (13 messages🔥):

01.AI Rumors and Valuation, Institutional Data Initiative, AI for Good: Omdena, Hugging Face and Phi-4

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (11 messages🔥):

MoE models efficiency, Expert weight loading, OlMoE in vLLM, Transformer architectures, Peak performance in MoEs


Interconnects (Nathan Lambert) ▷ #ml-drama (7 messages):

ChatGPT Versions, Token Usage Concerns, OpenAI Executive Predictions, Community Dynamics

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (7 messages):

NVIDIA's Performance, Orin in Robotics, Community Support in Open Source, Anthropic Research on AI Alignment

Link mentioned: - YouTube: no description found


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

Nextcloud Community Support, Ai2's Work, Open Source Contribution


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

SnailBot News: <@&1216534966205284433>


OpenAI ▷ #ai-discussions (20 messages🔥):

LLaMA Fine-Tuning, Censorship in AI Responses, Corporate Influence in Politics, Modern Guilds Concept, Custom GPT Model Behaviors


OpenAI ▷ #gpt-4-discussions (7 messages):

Ubuntu 24.04.1, ROCm 6.3.1, Ollama 3.2 Vision, O1 Pro upgrade, Concept clarifier GPT


OpenAI ▷ #prompt-engineering (7 messages):

Prompt Instruction Vague, Style Naming in Prompts, Completion Quality Concerns


OpenAI ▷ #api-discussions (7 messages):

Prompt Engineering, Instruction Clarity, Completion Rates


Perplexity AI ▷ #announcements (1 messages):

CSV Download Feature


Perplexity AI ▷ #general (23 messages🔥):

Subscription Options, Performance Issues, Application Integration, File Upload Errors, Voice Functionality

Link mentioned: Youzu.ai: Where AI Interior Design Meets Real-World Shopping: Introducing the world’s first Design-to-Buy platform, powered by AI✨


Perplexity AI ▷ #sharing (15 messages🔥):

AI Superintelligence, Nuclear Power Purchases, Nvidia's Personal AI, Healthiest Cooking Oils, React JS Learning Resources

Link mentioned: YouTube: no description found


GPU MODE ▷ #general (3 messages):

NCU profile comparison, Community welcome


GPU MODE ▷ #triton (9 messages🔥):

Using wgmma for MMAs, GPU warmup importance, Benchmark timing, Fused MLP implementations, On-chip MLP usage

Link mentioned: GitHub - NVlabs/tiny-cuda-nn: Lightning fast C++/CUDA neural network framework: Lightning fast C++/CUDA neural network framework. Contribute to NVlabs/tiny-cuda-nn development by creating an account on GitHub.


GPU MODE ▷ #cuda (2 messages):

Cutlass Kernel Performance, Diffing Generated PTX and SASS


GPU MODE ▷ #cool-links (1 messages):

drisspg: https://hipscript.lights0123.com/


GPU MODE ▷ #off-topic (3 messages):

Compact PC Benefits, Gaming Laptop vs Desktop Size, Thermal Performance Concerns


GPU MODE ▷ #webgpu (1 messages):

iron_bound: https://hipscript.lights0123.com/


GPU MODE ▷ #🍿 (11 messages🔥):

Discord based leaderboard, Alpha users recruitment, Fastest softmax kernel competition, GPU Glossary materials, Kernel coding


GPU MODE ▷ #thunderkittens (4 messages):

Thunderkittens vs Flash Attention 3, Reproducing plots, Collaboration on kernels

Links mentioned:


GPU MODE ▷ #edge (1 messages):

Shard counts adjustment, File generation process


Cohere ▷ #discussions (3 messages):

Community Check-in


Cohere ▷ #questions (2 messages):

Token Usage Export


Cohere ▷ #api-discussions (7 messages):

Cohere LLM API, Token Budget Concerns, Model Specifications, Recursive Loop Issue, Max Token Configuration


Cohere ▷ #cmd-r-bot (23 messages🔥):

Exporting Token Usage, Cohere Documentation Search


Latent Space ▷ #ai-general-chat (30 messages🔥):

FP4 Wars, State of the Art Open Source TTS, Omi Wearable Technology, Salesforce Hiring Freeze, New Directions in LLM Products

Links mentioned:


LlamaIndex ▷ #blog (3 messages):

Cohere integration with LlamaIndex, LlamaIndex Workflows in AI, GitHub Event on AI Agents

Link mentioned: LlamaIndex — Cohere: Learn how to use Cohere and LlamaIndex together to generate responses based on data.


LlamaIndex ▷ #general (17 messages🔥):

Metadata Management in LlamaIndex, Evaluation Times for FaithfulnessEvaluator, API Token Sharing, Python Dependency Conflicts


AI21 Labs (Jamba) ▷ #general-chat (13 messages🔥):

AI21 Labs and crypto, Using Jamba for coding assistance, AI's coding capabilities, Podcast app development, Exploring programming with Jamba


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (12 messages🔥):

Certificate Declaration Form, Email Consistency for Certificates, Spring 2025 Course Details, Twitter Account Verification for Certificates, Certificate Availability Timeline


DSPy ▷ #general (6 messages):

Long context issues, Hide demo fields parameter, Framework improvements


DSPy ▷ #examples (2 messages):

Vertex AI models, DSPy integration, Inference processes


OpenInterpreter ▷ #general (6 messages):

Open Interpreter Production Setup, Prompting Techniques for Code Generation, Custom Instructions for Model Performance, NVIDIA Grace Blackwell AI Supercomputer

Link mentioned: NVIDIA Project DIGITS: The World’s Smallest AI Supercomputer. : Reserve yours today.


OpenInterpreter ▷ #O1 (1 messages):

davidlandstarop1: God bless us all <@1075395291869614122>


OpenInterpreter ▷ #ai-content (1 messages):

davidlandstarop1: Safety first <@1221270473355038720>


LAION ▷ #general (1 messages):

Dual 3090 Setup, Fine-tuning LLM on Music Notation


LAION ▷ #research (1 messages):

rom1504: Is there any good open tool registry for building agents ?


Torchtune ▷ #general (1 messages):

jovial_lynx_74856: Anyone here tried finetuning ModernBERT?







{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}