Frozen AI News archive

not much happened today

**Google DeepMind** announced updates to **Gemini 2.0**, including an upgraded **Flash Thinking model** with stronger reasoning and native image generation capabilities. **Cohere** launched **Command A**, a **111B** parameter dense model with a **256K context window** and competitive pricing, available on **Hugging Face**. **Meta AI** proposed **Dynamic Tanh (DyT)** as a replacement for normalization layers in Transformers, supported by **Yann LeCun**. **Alibaba** released **QwQ-32B**, a **32.5B** parameter model excelling in math and coding, fine-tuned with reinforcement learning and freely available under **Apache 2.0 license**. **Google DeepMind** also released **Gemma 3** models ranging from **1B to 27B** parameters with a **128K token context window** and over **140 language** support, plus **ShieldGemma 2**, an image safety checker. Benchmarking shows **Gemma 3 27B** has strong vision and memory efficiency but is outperformed by larger models like **Llama 3.3 70B** and **DeepSeek V3 671B**. The **Hugging Face LLM leaderboard** history was shared by @_lewtun.

Canonical issue URL

AI News for 3/14/2025-3/15/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (222 channels, and 2399 messages) for you. Estimated reading time saved (at 200wpm): 240 minutes. You can now tag @smol_ai for AINews discussions!

Happy 2nd birthday to GPT4 and Claude 1. Few would have guessed the tremendous market share shifts that have happened in the past year.

image.png


SPECIAL NOTE: We are launching the 2025 State of AI Engineering Survey today in preparation for the AI Eng World's Fair in Jun 3-5. Please fill it out to have your voice heard!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Language Models and Model Updates

Model Performance and Benchmarking

AI Applications and Tools

AI and Hardware

AI Conferences and Events

Other

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Gemma 3 Fine-Tuning Revolution: Performance and Efficiency in Unsloth

Theme 2. Sesame CSM 1B Voice Cloning: Expectations vs. Reality

Theme 3. QwQ's Rise: Dominating Benchmarks and Surpassing Expectations

Theme 4. Decentralized LLM Deployment: Akash, IPFS & Pocket Network Challenges

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Advanced AI Video Generation with SDXL, Wan2.1, and Long Context Tuning

Theme 2. OpenAI's Sora: Transforming Cityscapes into Dystopias

Theme 3. OpenAI and DeepSeek: The Open Source Showdown


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Google's Gemma 3 Takes Center Stage Across Tools

Theme 2. New Models Emerge: OLMo 2, Command A, Jamba 1.6, PaliGemma 2 Mix

Theme 3. Coding Tools and IDEs Evolve with AI Integration

Theme 4. Training and Optimization Techniques Advance

Theme 5. Infrastructure and Access: H100s, VRAM, and API Pricing


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Cursor IDE Discord


Eleuther Discord


HuggingFace Discord


Perplexity AI Discord


aider (Paul Gauthier) Discord


Latent Space Discord


LM Studio Discord


Nous Research AI Discord


MCP (Glama) Discord


Interconnects (Nathan Lambert) Discord


OpenRouter (Alex Atallah) Discord


Yannick Kilcher Discord


GPU MODE Discord


OpenAI Discord


Notebook LM Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


Cohere Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


AI21 Labs (Jamba) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (301 messages🔥🔥):

Gemma 3 Support in Unsloth, Multi-GPU Training, Dynamic Quantization vs GGUF, GRPO and Reasoning, Vision Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Gemma 3 models, Unsloth support for models, GRPO for reasoning models, QwQ-32B bugfixes, New model uploads

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (5 messages):

Gemma 3, Ollama, Phi Vision, GGUFs vision


Unsloth AI (Daniel Han) ▷ #help (51 messages🔥):

Gemma-3 GGUF and Ollama, Llama 3.2 inference cancellation, Phi-4-mini support, Gemma finetuning error, TurboML Continual Pre-Training

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

Gemma SFT, Maximum Context Length, Memory Usage Calculation

Links mentioned:


Cursor IDE ▷ #general (263 messages🔥🔥):

Cursor performance issues on Linux and Windows, Issues with Claude 3.7, Custom modes in Cursor, Gemini API key issues, Cursor agent spawning terminals

Links mentioned:


Eleuther ▷ #general (2 messages):

LM Studio, SMILES string encoding, ChemDraw


Eleuther ▷ #research (255 messages🔥🔥):

Diffusion Models for Generative Tasks, Search-R1: RL for Autonomous Search Query Generation, Spectral Analysis of Latent Spaces, Noise Sensitivity in Diffusion Models, Inductive Moment Matching (IMM) for Fast Sampling

Links mentioned:


HuggingFace ▷ #general (195 messages🔥🔥):

Ministral 8B, Exaone 8B, Jungle Chess AI, Stable Diffusion, Gemini 2.0

Links mentioned:


HuggingFace ▷ #today-im-learning (1 messages):

ilyachua: Hi all. I am starting on the CV course from hugging face


HuggingFace ▷ #i-made-this (2 messages):

Awesome Vibe Coding, mahimairaja/awesome-csm-1b

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

generate_without_kv_cache function


HuggingFace ▷ #computer-vision (2 messages):

PaliGemma 2 Mix, smolVLM2, QwenVL, Llama 3.2 Multimodal

Links mentioned:


HuggingFace ▷ #agents-course (25 messages🔥):

SerpAPI Key Errors, Deep RL Course, Interactive IDEs for Agent Code, Image to Video Loops, Gemma3 Issues with SmolAgents

Links mentioned:


Perplexity AI ▷ #general (213 messages🔥🔥):

Complexity Extension issues, Kernel locking, Perplexity context window sizes, Grok 3 bugs, Gemini's deep research

Link mentioned: Ten Thousand: no description found


Perplexity AI ▷ #sharing (3 messages):

OpenAI custom agent, Airpods Live Translation, Anthropic CEO AI quit-button


aider (Paul Gauthier) ▷ #general (133 messages🔥🔥):

Claude with Aider, Rust for Aider, Claude Desktop on Linux, Aider MCP Server, Anthropic Status

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (44 messages🔥):

DeepSeek models configuration, Aider's architect mode behavior, Modifying Aider's completion endpoint, Aider configuration files

Links mentioned:


Latent Space ▷ #ai-general-chat (17 messages🔥):

OLMo 2 32B, AI Engineer Singapore 2025, AI Game Generation, Gemini DeepResearch with 2.0, Claude's Birthday

Links mentioned:


Latent Space ▷ #ai-announcements (3 messages):

Snipd Podcast, AI Podcast App, Latent Space Podcast

Link mentioned: Tweet from Latent.Space (@latentspacepod): 🆕 Snipd: The AI Podcast App for Learninghttps://youtu.be/FNRO_SYx68QOur first ever OUTDOOR podcast! @swyx and @KevinBenSmith chat about @aidotengineer NYC, switching from Finance to Tech, how AI can ...


Latent Space ▷ #ai-in-action-club (120 messages🔥🔥):

Cursor vs Claude, Levelsio flight sim, GitDoc VS Code extension, Vibe Coding IDE UI, Auto-git commit

Links mentioned:


LM Studio ▷ #general (92 messages🔥🔥):

Download LM Studio runtimes, Snapdragon X Plus support, Gemini Vision Capabilities, AI Chess Tournament, VRAM usage for Gemma 3

Links mentioned:


LM Studio ▷ #hardware-discussion (44 messages🔥):

memtest_vulkan, H100 rental t/s, Corsair product quality, 4090 vs A6000, RTX8000

Link mentioned: GitHub - GpuZelenograd/memtest_vulkan: Vulkan compute tool for testing video memory stability: Vulkan compute tool for testing video memory stability - GpuZelenograd/memtest_vulkan


Nous Research AI ▷ #general (120 messages🔥🔥):

ElizaOs API framework, Helius API key pricing, Quicknode API key pricing, DeepHermes-3-Mistral-24B-Preview-4bit MLX, Hermes-3-Llama-3.1-70B-FP8 vllm args

Links mentioned:


MCP (Glama) ▷ #general (90 messages🔥🔥):

MCP for Astro clients, MCP Servers & Architecture, Gitlab MCP server on Windows 11, Agentic Coder Conversion to MCP, Multi-Agent Systems (Swarm vs Mesh vs Sequence)

Links mentioned:


MCP (Glama) ▷ #showcase (3 messages):

MCP server management, Awesome Vibe Coding

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (57 messages🔥🔥):

ZIRP Era Regret, AI Startup Valuations, DeepSeek Passport Confiscation, Long Context Evaluation Challenges, Xet Data Chunking Technology

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (16 messages🔥):

Invasion of Privacy, Claude's Birthday, Claude Code Vim mode, Gemma 3 licensing issues

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (5 messages):

Mid-Training Analysis, SF Compute H100s, SF Compute CLI

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rl (11 messages🔥):

GRPO implementation, KL penalty, RLHF algorithms

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Cohere Command A, Jamba 1.6 Large, Jamba 1.6 Mini, Gemma 3 models, Anthropic incident

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (67 messages🔥🔥):

OR ChatGPT model, OpenRouter model icons, Deepseek v3 issues, OLMO-2, Cohere repetition penalties

Links mentioned:


Yannick Kilcher ▷ #general (59 messages🔥🔥):

Rust vs. Go for porting, DeepSeek Hype, OLMo 2 32B, ChatGPT Overrated, Code generation quality: Grok 3 vs Mistral vs OpenAI

Links mentioned:


Yannick Kilcher ▷ #ml-news (1 messages):

@erkinalp:

.ogeneral: I would say neither


GPU MODE ▷ #general (3 messages):

Speech-to-Speech Generation, Moshi by Kyutai Labs, Hertz-dev by Standard-Intelligence

Links mentioned:


GPU MODE ▷ #triton (3 messages):

tl.int1 masks in Triton, tl.advance negative offsets, Triton Windows upgrade to 3.2

Link mentioned: no title found: no description found


GPU MODE ▷ #cuda (6 messages):

cuda::memcpy_async, A100, global vs shared memory


GPU MODE ▷ #off-topic (1 messages):

Block Diffusion, Autoregressive Models, Diffusion Models, ICLR 2025

Link mentioned: SOCIAL MEDIA TITLE TAG: SOCIAL MEDIA DESCRIPTION TAG TAG


GPU MODE ▷ #tilelang (2 messages):

Dynamic Shapes, Segmentation Fault

Link mentioned: segmentation fault with dynamic shapes · Issue #215 · tile-ai/tilelang: # Copyright (c) Microsoft Corporation. # Licensed under the MIT License. from tilelang import tvm as tvm import tilelang.language as T import tilelang.testing import tilelang import torch def matmu...


GPU MODE ▷ #liger-kernel (1 messages):

Gemma3, LigerKernel, RMSNorm

Link mentioned: Adding Support for Gemma3 by DRXD1000 · Pull Request #606 · linkedin/Liger-Kernel: SummaryGemma3 has high similarities to Gemma2 with some differences in RMSNorm CallsThis change enables patching the Text Parts of Gemma3 with Liger kernels.Testing DoneHardware Type: AMD ...


GPU MODE ▷ #self-promotion (2 messages):

Triton bitpacking, Gemlite, GTC CUDA content

Links mentioned:


GPU MODE ▷ #reasoning-gym (31 messages🔥):

Gemma 3 support in vLLM, Group Relative Policy Optimization (GRPO), veRL Training for reasoning-gym, composite configurations in reasoning-gym, curriculum training

Links mentioned:


GPU MODE ▷ #general (7 messages):

verl session, tilelang submission, pip install tilelang


GPU MODE ▷ #submissions (3 messages):

Leaderboard Submissions, Grayscale Leaderboard, Conv2d Leaderboard, H100 GPUs, Modal Runners


OpenAI ▷ #ai-discussions (46 messages🔥):

Declining intelligence, impact of technology and smartphones, food additives and cognitive decline, Deepseek models distillation, ADHD diagnosis rates


OpenAI ▷ #gpt-4-discussions (1 messages):

Claude Sonnet 3.7, o3-mini-high vs o1


Notebook LM ▷ #use-cases (4 messages):

Gemini 2.0 Deep Research, NotebookLM, PhytoIntelligence framework


Notebook LM ▷ #general (41 messages🔥):

Image and table recognition in Notebook LM, Notebook LM Mobile App, Notebook LM Language Settings, Public Notebook Sharing, Google Sheets Integration

Link mentioned: Cat Wait GIF - Cat Wait Im - Discover & Share GIFs: Click to view the GIF


LlamaIndex ▷ #blog (1 messages):

Google Gemini, Google Vertex AI, Unified @googleai integration, Streaming, Async


LlamaIndex ▷ #general (8 messages🔥):

LlamaIndex vs Langchain, OpenAI delta events for tool calling, Agentic RAG applications


Nomic.ai (GPT4All) ▷ #general (7 messages):

Gemma 3 12B, Qwen 2.5 Coder, LM Studio, Multimodal Models, Water Freezing Experiment


DSPy ▷ #general (6 messages):

Explicit Feedback in dspy.Refine, Manual Feedback Implementation in Refine, Reward Function Return Value for Feedback


Cohere ▷ #「💬」general (5 messages):

Command A, OpenRouter Integration, Prime Number Bug, Local API Performance

Link mentioned: Command A - API, Providers, Stats: Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases.Compared to other leading propri...


Cohere ▷ #「🔌」api-discussions (1 messages):

michael: it does, use the https://api.cohere.com/compatibility/v1/chat/completions base_url


Modular (Mojo 🔥) ▷ #general (3 messages):

Discord Scam Account, Account Impersonation


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Discord impersonation, Discord account security


Modular (Mojo 🔥) ▷ #mojo (1 messages):

soracc: Yea, we use it in the stdlib (e.g. in base64) as well.


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

Self-Evaluation, Self-Reflection, Self-Refinement, Oracle Feedback


AI21 Labs (Jamba) ▷ #general-chat (1 messages):

Vertex, AWS






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}