Frozen AI News archive

Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen

**Google DeepMind** launched the **Gemma 3** family of models featuring a **128k context window**, **multimodal input (image and video)**, and **multilingual support for 140+ languages**. The **Gemma 3-27B** model ranks among the top open models on LMArena benchmarks, outperforming several competitors and matching **Gemini-1.5-Pro** on benchmarks. Additionally, **Gemini 2** introduced **Flash Native Image Generation** with advanced image editing capabilities, a feature teased by OpenAI but not launched. The updates highlight significant advances in context length, multimodality, and model efficiency via quantization.

Canonical issue URL

AI News for 3/12/2025-3/13/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (224 channels, and 2511 messages) for you. Estimated reading time saved (at 200wpm): 275 minutes. You can now tag @smol_ai for AINews discussions!

Today's o1-preview (at this point the only model competitive with Flash Thinking at AINews tasks, and yes o1-preview is better than o1-full or o3-mini-high) Discord recap is spot on - Google took the occasion of their Gemma Developer Day in Paris to launch a slew of notable updates:

image.png

https://www.youtube.com/watch?v=UU13FN2Xpyw

Gemma 3. People are loving that it is 128k context. Other than of course strong LMArena scores for an open model:

image.png

it is also a new Pareto frontier for its weight class by a country mile:

image.png

It also looks to completely subsume PaliGemma in incorporating vision as a first class capability (ShieldGemma is still a thing).

Gemini Flash Native Image Generation.

as teased at the Gemini 2 launch (our coverage here), Gemini 2 actually launched image editing, which OpenAI teased and never launched, and the results are pretty spectacular (if you can figure out how to find it in the complicated UI). Image editing has never been this easy.

https://x.com/19kaushiks/status/1899856652666568732?s=46

https://x.com/m__dehghani/status/1899854209081868663?s=46

https://x.com/multimodalart/status/1899881757396099231

https://x.com/fofrAI/status/1899927094727000126


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Model Releases and Updates: Gemma 3 Family

Robotics and Embodied AI

AI Agents and Tooling

Performance and Optimization in AI

AI Research and Papers

Industry and Business

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Gemma 3 Multimodal Release: Vision, Text, and 128K Context

Theme 2. Unsloth's GRPO Modifications: Llama-8B's Self-Learning Improvements

Theme 3. DeepSeek R1 on M3 Ultra: Insights into SoC Capabilities

Theme 4. Gemma 3 Open-Source Efforts: Llama.cpp and Beyond

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. DeepSeek and ChatGPT Censorship: Observations and Backlash

Theme 2. Claude Sonnet 3.7: A Standout in Coding Conversion Tasks

Theme 3. Open-Source Text-to-Video Innovations: New Viral Demos

Theme 4. Spain's AI Content Labeling Mandate: Legal and Societal Implications

Theme 5. Symbolism of the ✨ Emoji: Emergence as an AI Icon


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1: Google's New Multimodal Marvels Take the AI Stage

Theme 2: New AI Models Challenge the Big Guys

Theme 3: AI Tools Can't Catch a Break

Theme 4: Innovation Sparks in AI Tool Integration

Theme 5: Debates Heat Up Over LLM Behaviors


PART 1: High level Discord summaries

Cursor IDE Discord


LM Studio Discord


Nous Research AI Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


aider (Paul Gauthier) Discord


OpenAI Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


GPU MODE Discord


Interconnects (Nathan Lambert) Discord


Nomic.ai (GPT4All) Discord


MCP (Glama) Discord


Codeium (Windsurf) Discord


Yannick Kilcher Discord


Latent Space Discord


Notebook LM Discord


Torchtune Discord


LlamaIndex Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


DSPy Discord


Modular (Mojo 🔥) Discord


Gorilla LLM (Berkeley Function Calling) Discord


AI21 Labs (Jamba) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (468 messages🔥🔥🔥):

Claude 3.7 high load issues, Cursor UI sluggishness, Manus AI and OpenManus, Cline vs Cursor, MCP for Blender

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.13, Google Gemma 3 support, Bug Fixes

Link mentioned: Download LM Studio - Mac, Linux, Windows: Discover, download, and run local LLMs


LM Studio ▷ #general (136 messages🔥🔥):

LM Runtime, Gemma 3 Support, Turn off RAG in LM Studio, Gemma 3 Model Problems, Image Support with Gemma 3

Links mentioned:


LM Studio ▷ #hardware-discussion (201 messages🔥🔥):

ROCm on RX 9000 series, 9070XT reliability, 7900XTX thermal issues, CMP-40HX as an inference card, Phase change thermal paste alternatives

Links mentioned:


Nous Research AI ▷ #announcements (1 messages):

Inference API, Hermes 3 Llama 70B, DeepHermes 3 8B Preview, Nous Portal, API keys

Link mentioned: Nous Portal: no description found


Nous Research AI ▷ #general (286 messages🔥🔥):

LLM Facial Memory System, Pre-loading Credits for Inference API, Graph Reasoning System, Forest-of-Thought, Graph Theory with LLMs

Links mentioned:


Nous Research AI ▷ #interesting-links (10 messages🔥):

audio-flamingo-2, music key detection, royals lorde

Link mentioned: Audio Flamingo 2 - a Hugging Face Space by nvidia: no description found


Unsloth AI (Daniel Han) ▷ #general (176 messages🔥🔥):

Gemma 3 GGUF release, Fine-tuning Gemma 3, Transformers bug, RLHF methods (PPO, DPO, GRPO, RLOO), London, Paris, Berlin multimodal creative AI HackXelerator

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (22 messages🔥):

ChatGPT 4.5 Trolling, Multi-TPU Settings in JAX, Reproducibility Issues in LLM Training, London Paris Berlin AI HackXelerator, Training LLM from Scratch

Link mentioned: LPB 25 - London, Paris, Berlin multi-modal AI Launch Event · Luma: Join Us for the London Paris Berlin 25 AI HackXelerator™ Launch!📍 Central London | 🗓️ Starts 5 April 2025LPB25 blends the energy of a hackathon with the…


Unsloth AI (Daniel Han) ▷ #help (56 messages🔥🔥):

Gemma 3 27b as thinking model, Training for news writing (DPO, ORPO, KTO, GRPO), Unsloth import errors, LoRA vs QLoRA in Unsloth, Finetuning LLava7b in Colab

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

GRPO, Finetuning for Exact Output, Data Preparation for Qwen2.5-VL-7B


Perplexity AI ▷ #general (242 messages🔥🔥):

AI agent called ANUS, Think, Wait, Act, Talk (TWAT) pipeline for AIs, Internal server error 500 with Apple login, Model selector gone in new web update, Perplexity code cleanup epic fail

Links mentioned:


Perplexity AI ▷ #sharing (18 messages🔥):

Bluesky CEO trolls Zuckerberg, Tesla Doubles US Production, Education Department's Massive Loss, Greenland Rejects Trump's offer, PSG Eliminates Liverpool


Perplexity AI ▷ #pplx-api (1 messages):

MCP Server, ModelContextProtocol, Perplexity API connector

Link mentioned: GitHub - ppl-ai/modelcontextprotocol: A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.: A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem. - ppl-ai/modelcontextprotocol


aider (Paul Gauthier) ▷ #general (75 messages🔥🔥):

Gemma 3 release, OlympicCoder Model, Fast Apply for Code Edits, Aider recording feedback, Jetbrains Junie Access

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (71 messages🔥🔥):

Drop repo map, Web search, Claude 3.7 Thinking display, LM Studio error, Aider Usage

Links mentioned:


aider (Paul Gauthier) ▷ #links (7 messages):

LLMs for coding, Productivity boost from LLMs, LLMs helping learn new languages

Link mentioned: Here’s how I use LLMs to help me write code: Online discussions about using Large Language Models to help write code inevitably produce comments from developers who’s experiences have been disappointing. They often ask what they’re doing wrong—h...


OpenAI ▷ #ai-discussions (100 messages🔥🔥):

AI Research Tool Hierarchy, Python vs C# for AI Inference, LLMs and Hallucination Misinformation, Gemini's Native Image Capabilities, Marketing Content with AI

Link mentioned: Introducing Gemini Robotics and Gemini Robotics-ER, AI models designed for robots to understand, act and react to the physical world.: Introducing Gemini Robotics and Gemini Robotics-ER, AI models designed for robots to understand, act and react to the physical world.


OpenAI ▷ #gpt-4-discussions (5 messages):

Image generation, Ethical Reminders in ChatGPT, ChatGPT's intent clarification


OpenAI ▷ #prompt-engineering (21 messages🔥):

Emotional Prompting, Prompt Personalization, Hugging Face for prompt engineering papers, Chain of Thought paper, GPT Customization


OpenAI ▷ #api-discussions (21 messages🔥):

Emotional Prompting, Personalization with models, Hugging Face for prompt engineering papers, Chain of Thought Prompting, Markdown Prompting


HuggingFace ▷ #general (35 messages🔥):

Python vs C# for AI inference, Document Image Quality Assessment, LTX Video DiT Model, Vision Language Models (VLMs), Persistent Storage for Models

Links mentioned:

For Inference Providers who have built support for our…": no description foundModel does not exist, inference API don't work: Hi! We’re taking a closer look into this and I’ll update you soon. Thanks for reporting!


HuggingFace ▷ #today-im-learning (2 messages):

Unsloth for fine-tuning, QA legal dataset in Ukrainian, ZeRO paper


HuggingFace ▷ #cool-finds (2 messages):

Wan2.1 Image to Video, Modal Deployments

Link mentioned: Deploy Wan2.1 Image to Video model for free on Modal: Welcome to our in-depth tutorial on Wan2.1GP—your go-to resource for seamless modal installations and Python scripting! In this video, we cover everything yo...


HuggingFace ▷ #i-made-this (5 messages):

Wan2.1 Image to Video model, Modal deployment, narrative voice for videos, elevenlabs Thomas, AclevoGPT-Gemma-2b-CoT-reasoning-GGUF

Links mentioned:


HuggingFace ▷ #reading-group (3 messages):

Chip Huyen books, ML Systems books, AI engineering books, O'Reilly bookstore recommendations


HuggingFace ▷ #computer-vision (1 messages):

TensorFlow GPU Configuration, Logical and Physical Devices in TensorFlow, NVIDIA GeForce RTX 3050 Laptop GPU

Link mentioned: TensorFlow (experimental) GPU configuration: In this blog, I will discuss the techniques and methods for GPU configuration available from TensorFlow 2.16.1, which is the latest version…


HuggingFace ▷ #NLP (1 messages):

SentenceTransformer, PyTorch


HuggingFace ▷ #smol-course (1 messages):

Tokenizer Message Passing, Dataset Processing


HuggingFace ▷ #agents-course (44 messages🔥):

Agent name variable corruption, Unit 2.3 Availability (LangGraph), Quiz access issues, Local models with smolagents, HF Channel Access


HuggingFace ▷ #open-r1 (1 messages):

lunarflu: thanks for the feedback! excited for anything in particular in the future?


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Gemma 3, Reka Flash 3, Llama 3.1 Swallow 70B, Multimodality, Vision-language input

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (85 messages🔥🔥):

Gemini 2 Flash, Gemma Models, Chutes Provider, Provider Routing, Qwen finetune issues

Links mentioned:


Eleuther ▷ #general (1 messages):

Distill Meetup, Explainable AI Reading Group

Link mentioned: Exploring Explainables Reading Group: Welcome to the Exploring Explainables Reading Group! We use this document to keep track of readings, take notes during our sessions, and get more people excited about interactive scientific communica...


Eleuther ▷ #research (73 messages🔥🔥):

TTT acceleration, Decoder-only architecture expansion, Constant Entropy Expectation, AIME 24 evaluation

Links mentioned:


Eleuther ▷ #lm-thunderdome (5 messages):

AIME24 implementation in lm-eval-harness, math_verify utility, Multilingual perplexity evals

Link mentioned: GitHub - EleutherAI/lm-evaluation-harness at aime24: A framework for few-shot evaluation of language models. - GitHub - EleutherAI/lm-evaluation-harness at aime24


GPU MODE ▷ #general (1 messages):

cappuccinoislife: hi alll


GPU MODE ▷ #triton (5 messages):

VectorAdd issues, GPU programming mantra, Triton community meetup URL

Link mentioned: Triton community meetup March 2025: 🎙️ New to streaming or looking to level up? Check out StreamYard and get $10 discount! 😍 https://streamyard.com/pal/d/6451380426244096


GPU MODE ▷ #cuda (24 messages🔥):

funnel shift performance, variable rate compression, trellis scheme, tensor fragments, predicated funnel shift


GPU MODE ▷ #cool-links (2 messages):

UT Austin Deep Learning Lectures, TensorFlow OpenCL Flame War

Links mentioned:


GPU MODE ▷ #beginner (9 messages🔥):

GPU Architecture Books, PMPP Alternatives, CUDA mock interviews


GPU MODE ▷ #torchao (3 messages):

Float8 Conv, INT8 Conv, Static Quantization


GPU MODE ▷ #self-promotion (2 messages):

FlashAttention for Turing, Weight absorption for MLA

Links mentioned:


GPU MODE ▷ #thunderkittens (2 messages):

Memory Allocation Issues in H100, ThunderKittens Kernel Modifications, Tensor Concatenation Alternatives

Links mentioned:


GPU MODE ▷ #reasoning-gym (1 messages):

``


GPU MODE ▷ #submissions (2 messages):

Modal Runners success, Leaderboard submissions


Interconnects (Nathan Lambert) ▷ #news (44 messages🔥):

Gemma 3 Models, AlphaXiv, Gemini 2.0 Flash, Open Weight Models, DeepMind Robotics

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (2 messages):

Copyright law, Machine Learning, Privacy, Verbatim output

Link mentioned: What my privacy papers (don't) have to say about copyright and generative AI : no description found


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

Content Filters, Claude Code

Link mentioned: Tweet from mgostIH (@mgostIH): Content filters have been a disaster for AI


Interconnects (Nathan Lambert) ▷ #reads (1 messages):

Elicitation Theory, Deep Learning

Link mentioned: On Deep Learning and Farming: It's still 1915: What agriculture can teach us about AI development


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

SnailBot News: <@&1216534966205284433>


Nomic.ai (GPT4All) ▷ #general (48 messages🔥):

Model size and intelligence, GPT4All vs Ollama for server model management, Deepseek 14B or 7B vs Llama 8B, Large context window models, GPT4All limitations with Gemma 3


MCP (Glama) ▷ #general (30 messages🔥):

Glama AI, MCP Logging, Claude Image Rendering, NPM Package Storage, MCP Server Connection Status

Links mentioned:


MCP (Glama) ▷ #showcase (8 messages🔥):

MCP Agent, OpenAI Agent SDK, MCP Servers, unRAID MCP server, MCP Fathom Analytics

Links mentioned:


Codeium (Windsurf) ▷ #discussion (37 messages🔥):

Codeium Extension Issues, Protocol Errors, Neovim Support, VPN Workaround


Yannick Kilcher ▷ #general (14 messages🔥):

YC copycat startups, Maxwell's demon and slow AI, Adaptive Meta-Learning projects, Sakana AI scientist LLM scaling

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (6 messages):

Forward vs backward SDE, Reverse-diffusion SDE


Yannick Kilcher ▷ #agents (1 messages):

mico6424: Which cognitive architecture has a working implementation that's worth looking into?


Yannick Kilcher ▷ #ml-news (7 messages):

Gemma 3, Multimodal Models, Sakana AI paper

Links mentioned:


Latent Space ▷ #ai-general-chat (25 messages🔥):

Mastra AI Framework, Cursor SOTA Embedding Model, Typescript AI Apps, Gemini Native Image Generation, DeepSearch and DeepResearch

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

User Research, Mobile Usage, NotebookLM on Mobile, Usability Study

Link mentioned: Participate in an upcoming NotebookLM user research study!: Hello,I’m contacting you with a short questionnaire to verify your eligibility for an upcoming usability study with Google. This study is an opportunity to provide feedback on something that's cur...


Notebook LM ▷ #use-cases (4 messages):

NoteBookLM Plus, Internal FAQ, API instructions


Notebook LM ▷ #general (12 messages🔥):

RAG vs Full Context Window, Saving Chat Responses, Thinking Model Updates, Language Support

Link mentioned: Hyperborea - Wikipedia: no description found


Torchtune ▷ #general (7 messages):

MPS Issues, Gemma 3, Torchvision MPS Errors

Links mentioned:


Torchtune ▷ #dev (2 messages):

Gemma3, vLLM, Pan & Scan

Link mentioned: [Model] Add support for Gemma 3 by WoosukKwon · Pull Request #14660 · vllm-project/vllm: This PR adds the support for Gemma 3, an open-source vision-language model from Google.NOTE:The PR doesn&#39;t implement the pan-and-scan pre-processing algorithm. It will be implemented by a fo.....


LlamaIndex ▷ #blog (1 messages):

LlamaIndex, Model Context Protocol (MCP), tool discovery


LlamaIndex ▷ #general (7 messages):

LlamaExtract on-premise, New Response API support


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Quiz Deadlines, Research Opportunities for MOOC Learners, Lab Opportunities for MOOC Learners


Cohere ▷ #「💬」general (2 messages):

Cohere multilingual embed model pricing, OpenAI Responses API, Cohere Compatibility

Link mentioned: Web Search and States with Responses API | OpenAI Cookbook: Open-source examples and guides for building with the OpenAI API. Browse a collection of snippets, advanced techniques and walkthroughs. Share your own examples and guides.


Cohere ▷ #「🔌」api-discussions (1 messages):

Chat API seed parameter issue, Inconsistent outputs with same seed


DSPy ▷ #general (2 messages):

DSPy Caching, Pluggable Cache Module

Link mentioned: Feature/caching by hmoazam · Pull Request #1922 · stanfordnlp/dspy: One single caching interface which has two levels of cache - in memory lru cache and fanout (on disk)


Modular (Mojo 🔥) ▷ #mojo (2 messages):

modular max, Linux exec issues, github PR

Link mentioned: [stdlib] Adds functionality to spawn and manage processes from exec. file by izo0x90 · Pull Request #3998 · modular/max: Foundation for this PR is set here, it adds the needed lowlevel utilities:Adds vfork, execvp, kill system call utils. to Mojos cLib bindsAdds read_bytes to file descriptorOnce that PR is merge...


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

Tracking evaluation tools, Evaluation dataset location


AI21 Labs (Jamba) ▷ #jamba (1 messages):

RAG, Pinecone, VPC deployment



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}