Frozen AI News archive

not much happened today

**Helium-1 Preview** by **kyutai_labs** is a **2B-parameter multilingual base LLM** outperforming **Qwen 2.5**, trained on **2.5T tokens** with a **4096 context size** using token-level distillation from a **7B model**. **Phi-4 (4-bit)** was released in **lmstudio** on an **M4 max**, noted for speed and performance. **Sky-T1-32B-Preview** is a **$450 open-source reasoning model** matching **o1's performance** with strong benchmark scores. **Codestral 25.01** by **mistralai** is a new SOTA coding model supporting **80+ programming languages** and offering **2x speed**.

Canonical issue URL

AI News for 1/10/2025-1/13/2025. We checked 7 subreddits, 433 Twitters and 32 Discords (219 channels, and 2928 messages) for you. Estimated reading time saved (at 200wpm): 312 minutes. You can now tag @smol_ai for AINews discussions!

Welcome to Codestral, but for the frontier model labs, releases happen closer to the 15th of every month. Not long now.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Releases & Benchmarks

AI Research & Innovations

AI Applications & Tools

AI Infrastructure & Hardware

AI Safety, Ethics & Policies

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Criticism of 'Gotcha' tests to determine LLM intelligence

Theme 2. Kokoro TTS Achieves High Performance with Limited Parameters

Theme 3. Sky-T1: Open-Source AI Model Training for $450

Theme 3. Hugging Face Unveils Agent Course for AI Developers

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. UC Berkeley's Sky-T1 Outperforms OpenAI-o1 with Budget Training


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. New Models and Surprising Stats

Theme 2. HPC Tuning and Memory Moves

Theme 3. Building Agents and Custom Bots

Theme 4. Fine-Tuning, LoRA, and Data Delights

Theme 5. Privacy, Caching, and Extended Context


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Eleuther Discord


Codeium (Windsurf) Discord


Cursor IDE Discord


LM Studio Discord


Nous Research AI Discord


Stackblitz (Bolt.new) Discord


OpenAI Discord


Notebook LM Discord Discord


Stability.ai (Stable Diffusion) Discord


Latent Space Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


Perplexity AI Discord


Interconnects (Nathan Lambert) Discord


Cohere Discord


GPU MODE Discord


Modular (Mojo 🔥) Discord


DSPy Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


OpenInterpreter Discord


LAION Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Axolotl AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (1120 messages🔥🔥🔥):

Fine-tuning Llama 3.3, Using Unsloth for AI Models, Performance Metrics, GPUs and Cloud Solutions, Chat Templates and Tokenization

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (23 messages🔥):

AI Chatbot Creation, Transcribing Videos, AI for Language Learning, Llama Model Usage, Voice Modes in AI

Link mentioned: DuckDuckGo AI Chat at DuckDuckGo: no description found


Unsloth AI (Daniel Han) ▷ #help (236 messages🔥🔥):

Fine-tuning LLMs, Data Preparation and Augmentation, LoRA for Style Transfer, Challenges in AI and NLP, Using Pre-trained Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

Maya: Multilingual Vision-Language Model


Unsloth AI (Daniel Han) ▷ #research (11 messages🔥):

DataCollatorForPromptCompletion, Unsloth Training Speed, Cybersecurity LLM for Deception, Fine-tuning in Unsloth, Research Submission Inquiry

Links mentioned:


Eleuther ▷ #general (68 messages🔥🔥):

SmolLM-Corpus release, Grouped-Query Attention analysis, VLMs with multiple image input, Context Attribution research, User introductions and expertise

Links mentioned:


Eleuther ▷ #research (738 messages🔥🔥🔥):

Latro Model, Process Reward Models (PRMs), Chain-of-Thought (CoT) Reasoning, VinePPO Algorithm, Reinforcement Learning (RL)

Links mentioned:


Eleuther ▷ #interpretability-general (5 messages):

Mechanistic Interpretability Audio Content, Neel Nanda Podcast on SAEs, Weekly Mechanistic Interpretability Reading Groups

Link mentioned: Neel Nanda - Mechanistic Interpretability (Sparse Autoencoders): Machine Learning Street Talk (MLST) · Episode


Eleuther ▷ #lm-thunderdome (19 messages🔥):

Goodfire API Implementation, MLQA Benchmark Clarity, Dataset Issues, GPT-4o Usage, Pre-commit Line Ending Issues

Links mentioned:


Eleuther ▷ #gpt-neox-dev (5 messages):

Slurm CPU memory issues, Pretraining resource recommendations


Codeium (Windsurf) ▷ #discussion (141 messages🔥🔥):

Windsurf Login Issues, Codeium Pricing and Subscription, Feature Requests and Feedback, Technical Errors and Troubleshooting, User Experience and Support Concerns

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (592 messages🔥🔥🔥):

Windsurf functionality issues, Cascade performance, Subscription model concerns, User experiences with AI tools, Feature requests for Windurf

Links mentioned:


Cursor IDE ▷ #general (553 messages🔥🔥🔥):

Cursor IDE Performance, New AI and Tools, Collaboration Projects, AI Rules and Guidelines, Cursor Extension Issues

Links mentioned:


LM Studio ▷ #general (317 messages🔥🔥):

LM Studio capabilities, Model performance comparisons, AI hardware discussions, Quantization effects on models, User experiences with coding models

Links mentioned:


LM Studio ▷ #hardware-discussion (186 messages🔥🔥):

PowerMac G3 Build, Llama Model Loading Issues, RTX Graphics Cards, NVIDIA DIGITS Launch, Dual GPU Setup Considerations

Links mentioned:


Nous Research AI ▷ #general (390 messages🔥🔥):

Claude Model Discussions, Hyperparameter Search as a Service, Twitter Experience and Features, Models and Quantization, GitHub Downtime Concerns

Links mentioned:


Nous Research AI ▷ #ask-about-llms (15 messages🔥):

LLM for medical advice, Privacy concerns with AI, Audiobook text extraction


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/prajdabre1/status/1877720543933370418?s=46


Nous Research AI ▷ #interesting-links (59 messages🔥🔥):

Qwen 0.5B Model Performance, Generative Knowledge Distillation (GKD), Synthetic Data Usage in AI, MobileLLM Research Insights, Improvements in Attention Mechanisms

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/prajdabre1/status/1877720543933370418?s=46


Stackblitz (Bolt.new) ▷ #announcements (1 messages):

katetra: https://x.com/stackblitz/status/1878818461905739994


Stackblitz (Bolt.new) ▷ #prompting (27 messages🔥):

Stripe Integration, Prompting Techniques, Building AI Apps, User Experiences with Bolt, Webinar Announcement

Links mentioned:


Stackblitz (Bolt.new) ▷ #discussions (386 messages🔥🔥):

Token Management on Bolt, Integrating with Supabase and Netlify, Usage Issues with AI Prompts, CORS Handling in API Requests, Using Stripe with Bolt

Links mentioned:


OpenAI ▷ #ai-discussions (264 messages🔥🔥):

AI productivity in the UK, Embedded AI agents for customer service, Comparison of AI models, New AI model releases, The future of coding with AI

Link mentioned: OpenAI Status: no description found


OpenAI ▷ #gpt-4-discussions (25 messages🔥):

Canvas Issues, Code Output Concerns, Location Usage, Team Account Problems, Custom GPT Functionality


OpenAI ▷ #prompt-engineering (48 messages🔥):

GPT table interpretation issues, OCR vs AI for table reading, Improving table accuracy, Lateral thinking for data format, Reliability of AI models


OpenAI ▷ #api-discussions (48 messages🔥):

GPT interpreting tables, Real OCR performance, Complex table structures, Improving AI accuracy, Lateral thinking for data formats


Notebook LM Discord ▷ #announcements (2 messages):

NotebookLM Mobile Experience Study, Feedback on Audio Overviews, Participant Incentives, User Experience Research

Links mentioned:


Notebook LM Discord ▷ #use-cases (46 messages🔥):

Notebook LM capabilities, Podcast sharing platforms, D&D resources with AI, Audio overview feedback, AI in education

Links mentioned:


Notebook LM Discord ▷ #general (289 messages🔥🔥):

NotebookLM Features and Limitations, Using NotebookLM for Research, Podcast Customization, Embedding NotebookLM, User Onboarding and Support

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (313 messages🔥🔥):

Pony Models vs Illustrious, Dreambooth and Training Loras, High-Resolution Generation Techniques, Extensions and Tools for Stable Diffusion, Generating Images with AI

Links mentioned:


Latent Space ▷ #ai-general-chat (138 messages🔥🔥):

AI Model Cost and Performance, Ciortation and Training of Models, AI Services and Tools, Generative AI in Retail, New AI Research and Technologies

Links mentioned:


Latent Space ▷ #ai-announcements (25 messages🔥):

New Podcast Episode, O1 Guest Post Discussion, User Experiences with O1 Pro, Article Featured on HN, Dynamic Use of O1

Links mentioned:


Latent Space ▷ #ai-in-action-club (116 messages🔥🔥):

Claude Projects, AI Tools in Development, Ruby for Prototyping, Mob Coding, AI Applications in Interior Design

Links mentioned:


aider (Paul Gauthier) ▷ #announcements (5 messages):

Aider v0.71.0, Chat mode switching, DeepSeek prompts, Pretty output in editing, Release history insights

Link mentioned: Release history: Release notes and stats on aider writing its own code.


aider (Paul Gauthier) ▷ #general (182 messages🔥🔥):

DeepSeek Model Performance, Model Configuration in Aider, Quantization for Neural Networks, AI Coding Tools Improvement, Polyglot Benchmark Issues

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (84 messages🔥🔥):

Aider configuration, Prompt caching in Aider, Editing files in Aider, Using models from Hyperbolic, Handling suggestions in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #links (4 messages):

browser-use, CodeGate, Deepseek AI Assistant, Always On AI Assistant

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

louisgv: Phi 4 is now available: https://openrouter.ai/microsoft/phi-4


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Friday Agents, Telegram LLM Interface, DeVries AI Chatbot

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (212 messages🔥🔥):

OpenRouter usage, Deepseek model performance, Launching of Mistral's Codestral model, Comparison of different LLMs, Provider deployment on OpenRouter

Links mentioned:


Perplexity AI ▷ #general (190 messages🔥🔥):

Perplexity Subscription Concerns, Comparison of AI Models, User Experience Issues, Image Generation Features, API Usage and Costs

Links mentioned:


Perplexity AI ▷ #sharing (14 messages🔥):

Anthropic valuation, Roman Empire lead poisoning, AI Chips, Bitcoin Recovery, Spotify CEO cashout

Links mentioned:


Perplexity AI ▷ #pplx-api (11 messages🔥):

Sonar 3.3 API Availability, Citations in Llama-3.1-Sonar, Future Model Releases, Changelog Confusion, Model Deprecation Notice

Link mentioned: no title found: no description found


Interconnects (Nathan Lambert) ▷ #news (38 messages🔥):

Codestral 25.01, Helium-1 Model Launch, CC-BY License Discussions, Qwen 2.5-Math Models, OpenAI and OSS Contributions

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (9 messages🔥):

Qwen Instruct Model Training, LoRa Fine-Tuning, Generative Agents and Environment Navigation


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

420gunna: https://x.com/aidan_mclau/status/1878944278782890158


Interconnects (Nathan Lambert) ▷ #random (38 messages🔥):

Learning about AI, Local AI Models, Meta Ray-Bans, CIOs and AI Talks, VITURE Pro Neckband

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (58 messages🔥🔥):

Sky-T1-32B-Preview, Reinforcement learning vs. Supervised fine-tuning, Generative AI for talks, Challenges in academic talks, Process Reward Models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

Dr. Huberman's Insights, Mental Health Support, Impact of Discussions


Interconnects (Nathan Lambert) ▷ #retort-podcast (6 messages):

Forum Channel Suggestion, Website for Podcast Listings


Interconnects (Nathan Lambert) ▷ #policy (11 messages🔥):

U.S. AI Economic Blueprint, AI Diffusion Controls, National Security & Economic Strength, Export Controls, AI Leadership

Links mentioned:


Cohere ▷ #discussions (13 messages🔥):

Community Chat Etiquette, Command R+ Capabilities, North Waitlist Interest


Cohere ▷ #questions (53 messages🔥):

Cohere Datasets Bug, Command R+ Benchmarks, Dataset Upload Issues, API Response Error, User Communication Concerns

Links mentioned:


Cohere ▷ #api-discussions (4 messages):

API Issues, Trial Account Problems


Cohere ▷ #cmd-r-bot (46 messages🔥):

Cohere's command functionalities, Theological programming, LLM code generation, Bot interaction guidelines


GPU MODE ▷ #general (3 messages):

O1 Workflow, Claude's Role, Interface Directives


GPU MODE ▷ #triton (13 messages🔥):

Triton Puzzles Optimization, Autotuning GPU Models, CUDA Block Assignment, Num Stages Impact, Cross Entropy Kernel Improvement

Links mentioned:


GPU MODE ▷ #cuda (7 messages):

CUDA Installation on Ubuntu, CUDA in Visual Studio Code, Blackwell GeForce GPU support, FA3 Profiling on H200 vs H100

Links mentioned:


GPU MODE ▷ #torch (11 messages🔥):

Profiler UTF-8 decode issue, Using Flash Attention with Transformers, Challenges with Data Parallelism Strategies, Inference Pipeline for Large Models, NNSight for Memory Efficiency

Links mentioned:


GPU MODE ▷ #announcements (1 messages):

Upcoming Talks, Flash Infer, Mosaic GPU, Turing int8 matmul, Profiling at NVIDIA


GPU MODE ▷ #jobs (4 messages):

GPU expertise hiring at Meta, GenAI inference acceleration, Depth of technical work at Meta

Link mentioned: Software Engineer, Systems ML - HPC Specialist: Meta's mission is to build the future of human connection and the technology that makes it possible.


GPU MODE ▷ #beginner (7 messages):

Importing CUDA to Visual Studio Code, CUDA Toolkit Installation, Building Copilot with Llama 3.2, CUDA Atomic Functions for Doubles, Using Integer Functions for Doubles

Links mentioned:


GPU MODE ▷ #off-topic (3 messages):

DGX H100 concerns, Sonoma AI Speaker Series, Fundraising ideas for server

Link mentioned: Sonoma AI with Wine · Luma: This is an in-person event! Registration required in order to get in.Topic: Sonoma AI (and wine) Meetups for remote tech workersWhat we’ll do:Have some food…


GPU MODE ▷ #lecture-qa (5 messages):

Learning CUDA and GPU Programming, Completing Lecture Exercises, Forming Study Groups


GPU MODE ▷ #liger-kernel (6 messages):

Qwen2-VL issues, Error with Liger Kernel, Downgrading Transformers

Link mentioned: IndexError: The shape of the mask [7387] at index 0 does not match the shape of the indexed tensor [1] at index 0 · Issue #515 · linkedin/Liger-Kernel: 🐛 Describe the bug The error exists when I try to use the qwen2-vl with qwen2-vl liger kernel to generate text. The following code got the following error. But the same code if I change the liger k.....


GPU MODE ▷ #self-promotion (2 messages):

ASRG Season 1 Pilot, Maya Multilingual Vision-Language Model

Link mentioned: Tweet from Systems Reading Group (@asrg_gg): EP0: Pilot of ASRG Season 1 is tomorrow!We’ll read The Linux Kernel Module Programming Guide in C. Make sure to have an x86-64 VM with Ubuntu 22.04 or just use multipass like @nanod1jkstra does.


GPU MODE ▷ #edge (5 messages):

Vulkan on Raspberry Pi 5, Nvidia Cosmos on Jetson, Transformer Engine Porting, 3D Vision Stack Libraries

Link mentioned: [ET-VK] Request VMA_ALLOCATION_CREATE_HOST_ACCESS_SEQUENTIAL_WRITE_BIT, not RANDOM by swolchok · Pull Request #7615 · pytorch/executorch: SummaryIt looks like we are careful to use only copy_from and copy_to with StagingBuffer on CPU, in which case we only need SEQUENTIAL_WRITE.This matters on Raspberry Pi 5, where there appears (f...


Modular (Mojo 🔥) ▷ #general (4 messages):

2025 Community Meeting, MAX GPU Benchmarking, MAX-CV, Meeting Video Upload, Attendance Concerns


Modular (Mojo 🔥) ▷ #mojo (23 messages🔥):

Testing Mojo Code on macOS, Nightly Documentation for Mojo/Max, Async Proposals for Mojo, Compiler Issues with Mojo, Int8 to String Conversion in Mojo

Links mentioned:


DSPy ▷ #papers (1 messages):

wiltonb: Happy reading!

https://kanesimms.substack.com/p/what-agentic-ai-actually-is-a-deeply


DSPy ▷ #general (19 messages🔥):

AzureOpenAI Integration, dspy.react with phi-4 Functionality, Getting Started with DSPy, Optimizing LLMs, Prompt Performance Across Models

Links mentioned:


Torchtune ▷ #general (20 messages🔥):

Phi-4 Models, Adaptive Batching, Using Instruct Models for Medical Training, Quality over Quantity in Training Data

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (16 messages🔥):

MOOC Enrollment, Final Project Results, Weekly Lectures Start Date, Assignment Submission Process, Course Difficulty

Link mentioned: Quizzes Archive - LLM Agents MOOC: NOTE: The correct answers are in the black boxes (black text on black background). Highlight the box with your cursor to reveal the correct answer (or copy the text into a new browser if it’s hard to ...


LlamaIndex ▷ #blog (2 messages):

AI Builders Summit, AutoRAG Framework, RAG Techniques, Small Language Models


LlamaIndex ▷ #general (12 messages🔥):

LlamaIndex Engineer Search, GraphRAG Visualization Issue, OpenAI Model Prompt Caching, Dynamic Variables in Prompt Templates

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (14 messages🔥):

EPUB file support, LLama model prompt templates, AI context length limitations, Exporting chat history, Running GPT4All remotely

Links mentioned:


tinygrad (George Hotz) ▷ #general (11 messages🔥):

Tinygrad Tensor Compiler, Meeting #53 Agenda, Stale PR Closure, FSDP Bounty Lock Discussion, Understanding Tinygrad

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

Activation Checkpointing, Memory Management in Tinygrad


OpenInterpreter ▷ #general (10 messages🔥):

Installing Open Interpreter, Homebrew and pipx, Open Interpreter functionality


LAION ▷ #general (8 messages🔥):

Stable Audio 3 Open Source Announcement, Hypertension Recognition Dataset Request

Link mentioned: Tweet from undefined: no description found


LAION ▷ #research (1 messages):

Megatron Checkpoint Conversion, Evaluation Scripts, NVIDIA MegaTron-LM

Link mentioned: sciebo - www.hochschulcloud.nrw: C4_50B_cosine_bs-4M_lr-6e-3_warmup-1000 is publicly shared







{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}