Frozen AI News archive

not much happened today

**DeepSeek R1** demonstrates significant efficiency using **FP8** precision, outperforming **Gemma 3 27B** in benchmarks with a **Chatbot Arena Elo Score** of **1363** vs. **1338**, requiring substantial hardware like **32 H100 GPUs** and **2,560GB VRAM**. **OpenAI** labels **DeepSeek** as "state-controlled" and calls for bans on "PRC-produced" models, sparking community backlash accusing **OpenAI** and **Sam Altman** of anti-competitive behavior. Discussions emphasize **DeepSeek's** openness and affordability compared to **OpenAI**, with users highlighting its local and Hugging Face deployment options. Meanwhile, **Gemma 3** receives mixed community feedback on creativity and worldbuilding.

Canonical issue URL

AI News for 3/12/2025-3/13/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (222 channels, and 5887 messages) for you. Estimated reading time saved (at 200wpm): 616 minutes. You can now tag @smol_ai for AINews discussions!

This is the state of models after yesterday's Gemma 3 drop and today's Command A:

image.png

the Windsurf talk from AIE NYC is somehow doing even better than the MCP workshop.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

outage in our scraper today; sorry.


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek R1's FP8 training and efficiency prowess

Theme 2. Gemma 3's Technical Highlights and Community Impressions

Theme 3. Innovation in Large Language Models: Cohere's Command A

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Claude 3.7 Sonnet Creates Unbeatable AI in Arcade Games

Theme 2. Gemini 2.0 Flash: Native Image Generation Now Available

Theme 3. Dramatically Enhance Video AI Quality with Wan 2.1


AI Discord Recap

A summary of Summaries of Summaries by o1-mini-2024-09-12

Anthropic’s Claude Slashes API Costs with Clever Caching

Google and Cohere Battle it Out with Command A and Gemini Flash

LM Studio and OpenManus: Tool Integrations Fuel AI Innovations

AI Development Dilemmas: From Cursor Crashes to Fine-Tuning Fiascos

Policy Prowess: OpenAI’s Push to Ban PRC Models Raises Eyebrows

AI in Research, Education, and Function Calling


PART 1: High level Discord summaries

Cursor IDE Discord


Nous Research AI Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Perplexity AI Discord


OpenAI Discord


HuggingFace Discord


Interconnects (Nathan Lambert) Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


Cohere Discord


MCP (Glama) Discord


Notebook LM Discord


GPU MODE Discord


Yannick Kilcher Discord


Nomic.ai (GPT4All) Discord


Latent Space Discord


LlamaIndex Discord


LLM Agents (Berkeley MOOC) Discord


Modular (Mojo 🔥) Discord


Gorilla LLM (Berkeley Function Calling) Discord


DSPy Discord


tinygrad (George Hotz) Discord


AI21 Labs (Jamba) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (1134 messages🔥🔥🔥):

Claude 3.7 API updates, Cursor slowness/instability issues, Open source alternatives to Manus, MCP for Blender, Cursor Updates and Version Confusion

Links mentioned:


Nous Research AI ▷ #announcements (2 messages):

Inference API release, Hermes 3 Llama 70B, DeepHermes 3 8B Preview, Hybrid Reasoners, DeepHermes 24B

Links mentioned:


Nous Research AI ▷ #general (684 messages🔥🔥🔥):

LLM Facial Memory System, Inference API Credit Pre-loading, Graph Reasoning Systems with Open Source Code, Graph Theory, Gemma-3 and LM Studio Integration

Links mentioned:


Nous Research AI ▷ #ask-about-llms (1 messages):

AI Compilers, Deep Learning Compilation


Nous Research AI ▷ #research-papers (2 messages):

Sakana AI, Model Memorization

Link mentioned: no title found: no description found


Nous Research AI ▷ #interesting-links (11 messages🔥):

Audio-Flamingo-2, Agent Engineering

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

Sakana AI, AI Training Data

Link mentioned: no title found: no description found


Unsloth AI (Daniel Han) ▷ #general (503 messages🔥🔥🔥):

Gemma 3, GGUF, Transformers issue, RLHF, H100

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (41 messages🔥):

GPT-4.5 Trolling, Multi-TPU implementation, Reproducibility issues in model training, London Paris Berlin AI HackXelerator, Training LLMs from scratch

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (127 messages🔥🔥):

Gemma 3 27b as a thinking model, GRPO training, Qwen2.5 model template, GGUF models, lora performance problems

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

Reflection Pattern, ReACT Pattern, Agentic Workflows, Unsloth PR

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (20 messages🔥):

GRPO and response quality, Finetuning for exact output, Structured outputs, Guided decoding accuracy, Qwen2.5-VL-7B finetuning data

Links mentioned:


LM Studio ▷ #announcements (2 messages):

LM Studio 0.3.13, Google Gemma 3 support, GGUF and MLX models, Image processing NVIDIA / AMD GPUs, llama.cpp runtime to 1.19.2

Link mentioned: Download LM Studio - Mac, Linux, Windows: Discover, download, and run local LLMs


LM Studio ▷ #general (267 messages🔥🔥):

LM Runtime Development, Gemma 3 support in LM Studio, RAG control in LM Studio, ROCm support for 9070 series, Gemma 3's image support

Links mentioned:


LM Studio ▷ #hardware-discussion (254 messages🔥🔥):

Vulkan vs ROCm speed, 9070 GPU breaking, 7900 XTX Hotspot issues, PTM 7950 thermal paste, Nvidia CMP-40HX for AI inference

Links mentioned:


aider (Paul Gauthier) ▷ #general (329 messages🔥🔥):

Gemma 3 Release, OlympicCoder Model, Zed's Edit Prediction, Aider MCP Server, Claude's text_editor Tool

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (85 messages🔥🔥):

Drop Repo Map, Add Websearch, Claude 3.7, LM Studio error, aider vs ChatGPT

Links mentioned:


aider (Paul Gauthier) ▷ #links (7 messages):

LLMs for Code, Using LLMs, AI Assisted Programming, Productivity Boost from LLMs, LLMs to learn new languages

Link mentioned: Here’s how I use LLMs to help me write code: Online discussions about using Large Language Models to help write code inevitably produce comments from developers who’s experiences have been disappointing. They often ask what they’re doing wrong—h...


Perplexity AI ▷ #general (395 messages🔥🔥):

ANUS AI naming, Windows app Apple ID, Sonar LLM, Model selector issues, Comet Browser

Links mentioned:


Perplexity AI ▷ #sharing (24 messages🔥):

Bluesky trolls Zuckerberg, Tesla doubles production, Gmail AI calendar integration, Meta AI decodes thoughts

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (1 messages):

MCP Server, ModelContextProtocol

Link mentioned: GitHub - ppl-ai/modelcontextprotocol: A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.: A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem. - ppl-ai/modelcontextprotocol


OpenAI ▷ #ai-discussions (278 messages🔥🔥):

AI Research Tools Hierarchy, Python vs C# for AI Inference Speed, Gemini 2.0 Flash Native Image Generation, AI Safety and Ethical Concerns

Links mentioned:


OpenAI ▷ #gpt-4-discussions (7 messages):

ChatGPT Ethical Reminders, ChatGPT Intent Clarification, ChatGPT Reasoning Refinement


OpenAI ▷ #prompt-engineering (21 messages🔥):

Emotional Prompting, Prompt Engineering, Chain of Thought, Threatening AI Models, Personalized vs. Generalized Models


OpenAI ▷ #api-discussions (21 messages🔥):

Emotional Prompting, Prompt Engineering Papers, Chain of Thought Prompting, Minimal Threat Prompting


HuggingFace ▷ #general (204 messages🔥🔥):

Python for AI transformer models, vLLM vs Transformers performance, Document image quality assessment, LTX Video DiT model, Vision Language Models

Links mentioned:

For Inference Providers who have built support for our…": no description foundModel does not exist, inference API don't work: Hi! We’re taking a closer look into this and I’ll update you soon. Thanks for reporting!merve (Merve Noyan): no description foundQwen/Qwen2.5-14B-Instruct-1M · Hugging Face: no description foundmistralai/Mistral-Nemo-Instruct-2407 · Hugging Face: no description foundmeta-llama/Llama-3.1-8B-Instruct · Hugging Face: no description foundopen-r1/OlympicCoder-7B · Hugging Face: no description found


HuggingFace ▷ #today-im-learning (5 messages):

Unsloth Fine-Tuning, ZeRO Paper, Gemma 3 Knowledge Distillation, OpenCV bootcamp


HuggingFace ▷ #cool-finds (6 messages):

Wan2.1 Image to Video Model, Quantized LLMs for Coding, Extreme Quantizations Fine-Tuning, AI Agents Directory, Embedder Models Collection

Links mentioned:


HuggingFace ▷ #i-made-this (16 messages🔥):

Wan2.1 Image to Video model, Narrative voice for videos, Gemma 2b finetune, Reflection and ReACT patterns, Kyro-n1.1-3B reasoning

Links mentioned:


HuggingFace ▷ #reading-group (3 messages):

Chip Huyen books, ML Systems Book, AI Engineering Book


HuggingFace ▷ #computer-vision (1 messages):

TensorFlow GPU Configuration, TensorFlow 2.16.1, NVIDIA GeForce RTX 3050

Link mentioned: TensorFlow (experimental) GPU configuration: In this blog, I will discuss the techniques and methods for GPU configuration available from TensorFlow 2.16.1, which is the latest version…


HuggingFace ▷ #NLP (3 messages):

SentenceTransformer training with PyTorch, Data augmentation for text translation, COLING paper on translation


HuggingFace ▷ #smol-course (4 messages):

Tokenizer Implementation, Agent Tool Use, Color Mixing Tool, Tool Definition Error


HuggingFace ▷ #agents-course (76 messages🔥🔥):

Agent Name Corruption, Unit 2.3, Local Models in SmolAgents, HF Channel Access, Text-to-Video API

Link mentioned: Tweet from nikmcfly.btc (@nikmcfly69): 🤯 BREAKING: Manus AI created its own open-source alternative. In 25 min, it built a complete AI agent system from scratch!ANUS (Autonomous Networked Utility System)—@eugeneshilow's brilliant ide...


HuggingFace ▷ #open-r1 (1 messages):

lunarflu: thanks for the feedback! excited for anything in particular in the future?


Interconnects (Nathan Lambert) ▷ #news (177 messages🔥🔥):

Gemma 3 Creative Writing, alphaXiv vs HuggingFace papers, Gemini 2.0 Flash Native Image Out, Vertical RL-tuned models, Chinese weights

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (2 messages):

Copyright violation, Privacy and Security, Stable Diffusion

Link mentioned: What my privacy papers (don't) have to say about copyright and generative AI : no description found


Interconnects (Nathan Lambert) ▷ #random (20 messages🔥):

Gemma 3 Training Cost, Gemini GIF Animations, Tuning Character/Personality on Open Models, Gemini Flash 2.0 Experimental

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (8 messages🔥):

Content filters disaster for AI, Commit changes only when asked, Meme served on a silver platter, Good response meme, CSO title search

Links mentioned:


Interconnects (Nathan Lambert) ▷ #cv (1 messages):

Reasoning VLM, Autonomous Driving, AlphaDrive, MetaAD dataset

Link mentioned: Tweet from Jim Bohnslav (@jbohnslav): AlphaDrive: Trains a reasoning VLM to output multiple discrete action plans (accelerate, turn left) for autonomous driving.Much better than zero-shot or SFT on MetaAD, a new dataset of 110K 3s clips. ...


Interconnects (Nathan Lambert) ▷ #reads (1 messages):

Elicitation Theory, Deep Learning as Farming

Link mentioned: On Deep Learning and Farming: It's still 1915: What agriculture can teach us about AI development


Interconnects (Nathan Lambert) ▷ #posts (3 messages):

SnailBot News


Interconnects (Nathan Lambert) ▷ #policy (32 messages🔥):

OpenAI policy proposals, US AI Action Plan, DeepSeek, Google AI policy, AI copyright

Links mentioned:


Eleuther ▷ #general (56 messages🔥🔥):

Distill Meetup, Career Advice for AI Engineer, VSCode Python Indexing

Link mentioned: Exploring Explainables Reading Group: Welcome to the Exploring Explainables Reading Group! We use this document to keep track of readings, take notes during our sessions, and get more people excited about interactive scientific communica...


Eleuther ▷ #research (134 messages🔥🔥):

TTT Acceleration, DeltaProduct Gradient, Dynamic Computation, Thinking Tokens, AIME 24 evaluation

Links mentioned:


Eleuther ▷ #interpretability-general (8 messages🔥):

Evaluating patching effect on Chain of Thought (CoT) answers, LatentCache construction for interpretability, Delphi library for activation collection


Eleuther ▷ #lm-thunderdome (5 messages):

MATH implementation, AIME24 implementation, math_verify utility, multilingual perplexity evals

Link mentioned: GitHub - EleutherAI/lm-evaluation-harness at aime24: A framework for few-shot evaluation of language models. - GitHub - EleutherAI/lm-evaluation-harness at aime24


OpenRouter (Alex Atallah) ▷ #announcements (5 messages):

Gemma 3, Reka Flash 3, Llama 3.1 Swallow 70B, Anthropic downtime, OpenAI web search models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (161 messages🔥🔥):

Flash model issues, OpenRouter API delay, Gemma model performance, Gemini 2 Flash native image output, Chutes free inference

Links mentioned:


Cohere ▷ #「💬」general (74 messages🔥🔥):

Cohere Multilingual Embed Model Pricing, OpenAI Responses API & Agents SDK compatibility with Cohere, Command-A-03-2025 model, Command A vs GPT-4o performance, Command A in sandbox

Links mentioned:


Cohere ▷ #【📣】announcements (1 messages):

Command A release, enterprise model, Cohere API

Links mentioned:


Cohere ▷ #「🔌」api-discussions (45 messages🔥):

Chat API seed parameter issue, OpenAI Compatibility API errors, Tool parameters validation in Cohere API

Links mentioned:


Cohere ▷ #「🤝」introductions (3 messages):

RAG, unsupervised machine translation, CSAM detection, visual novel scene generation, Cohere models advantages


MCP (Glama) ▷ #general (94 messages🔥🔥):

Glama MCP server API, Python SDK Logging, Claude image object rendering, NPM packages, RAG vs MCP

Links mentioned:


MCP (Glama) ▷ #showcase (17 messages🔥):

Model Context Protocol, MCP Server Implementations, OpenAI Agents SDK, Ash Framework Integration

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

User Research, Mobile Usage, Usability Study, Google product enhancements

Link mentioned: Participate in an upcoming NotebookLM user research study!: Hello,I’m contacting you with a short questionnaire to verify your eligibility for an upcoming usability study with Google. This study is an opportunity to provide feedback on something that's cur...


Notebook LM ▷ #use-cases (9 messages🔥):

NotebookLM as internal FAQ, Chat History Access, Generating Scripts with API, Custom Chat Settings for Response Quality, Podcast Generation in Brazilian Portuguese

Link mentioned: NotebookLM: This FREE Google AI Tool Is Making People Rich, But...: 🐝 Join our FREE AI Business Trailblazers Hive Community at https://www.skool.com/ai-trailblazers-hive-7394/about?ref=ff40ab4ff9184e7ca2d1971501f578df Get co...


Notebook LM ▷ #general (98 messages🔥🔥):

RAG vs Full Context Window Gemini, NotebookLM Plus and Google One AI Premium, YouTube video integration, Saving chat responses as notes, Google sheets as a CSV

Links mentioned:


GPU MODE ▷ #general (1 messages):

cappuccinoislife: hi alll


GPU MODE ▷ #triton (13 messages🔥):

VectorAdd zeros, GPU programming mantra, W4A8 linear kernel, SVDQuant

Links mentioned:


GPU MODE ▷ #cuda (30 messages🔥):

Funnel Shift vs. uint64_t, Trellis Scheme Quantization, CUDA 12.4.0 vs 12.4.1, GPU max value algorithm

Links mentioned:


GPU MODE ▷ #torch (1 messages):

libtorch-gpu, onnxruntime, cuda-toolkit, cudnn, Docker image size optimization


GPU MODE ▷ #cool-links (13 messages🔥):

UT Austin Deep Learning Lectures, OpenCL vs CUDA flame war, Modular's take on CUDA alternatives, SYCL portability and Intel's involvement, Block Diffusion Language Models

Links mentioned:


GPU MODE ▷ #jobs (1 messages):

PyTorch, Meta, Engineering Manager, Dev Infra Team, Equal Opportunity

Link mentioned: Software Engineering Manager, Infrastructure: Meta's mission is to build the future of human connection and the technology that makes it possible.


GPU MODE ▷ #beginner (16 messages🔥):

GPU architecture beginner book, Programming Massively Parallel Processors, CUDA books, Theoretical occupancy of a kernel, Nsight compute


GPU MODE ▷ #torchao (3 messages):

float8 conv, cuda kernels, torch inductor template, INT8 conv, static quant


GPU MODE ▷ #rocm (1 messages):

AMD vLLM environment, Conda environment file, Reproducible builds


GPU MODE ▷ #self-promotion (3 messages):

FlashAttention Turing, MLA Weight Absorption, MLA CPU Kernel

Links mentioned:


GPU MODE ▷ #thunderkittens (2 messages):

Memory allocation issues in H100, ThunderKittens kernel modification, Memory access violation

Links mentioned:


GPU MODE ▷ #reasoning-gym (9 messages🔥):

Reasoning-Gym Curriculum, ETH + EPFL Collaboration, Auto-Curriculum RL, Evalchemy Integration, OpenAI Compatible Endpoint

Links mentioned:


GPU MODE ▷ #submissions (2 messages):

Modal Runners, Leaderboard Submissions


Yannick Kilcher ▷ #general (50 messages🔥):

YC Startup Strategy, Maxwell's Demon, Meta-Transform and Adaptive Meta-Learning, LLM Scaling Theory, AI scientist

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (15 messages🔥):

Forward vs backward SDE, Universal State Machine (USM), Gemma 3

Links mentioned:


Yannick Kilcher ▷ #agents (4 messages):

Cognitive Architectures, Open-source Cognitive Architectures


Yannick Kilcher ▷ #ml-news (16 messages🔥):

Gemma 3, Sakana AI, Auto Science AI, MoE Fairness, RTX Riddick

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (56 messages🔥🔥):

GPT-4 vs local LLMs, Ollama vs GPT4All, Deepseek 14B, Web crawling, LocalDocs


Latent Space ▷ #ai-general-chat (51 messages🔥):

Mastra AI framework, Gemini 2.0 Flash Experimental, Jina AI's DeepSearch/DeepResearch, Cohere's Command A, Gemini Deep Research

Links mentioned:


LlamaIndex ▷ #blog (3 messages):

Model Context Protocol, WeAreDevs WebDev & AI Day, LLM x Law Hackathon


LlamaIndex ▷ #general (46 messages🔥):

LlamaExtract on-premise, New Response API, LlamaParse and images, AzureMultiModal chat bugs, Deep research within RAG

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (7 messages):

Quiz Deadlines, Labs and Research Opportunities, Project Timelines, Certification for Non-Berkeley Students


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (4 messages):

LLM Roles, LLM Personas, Decision Making Research Group


Modular (Mojo 🔥) ▷ #general (2 messages):

Mojo and Max Bundling, Mojo on Windows

Link mentioned: Mojo and Max, why bundle them?: I’ve recently started a project with magic init life --format mojoproject but after looking at the dependencies I have: max 25.2.0.dev2025030905 release 9.7 KiB co...


Modular (Mojo 🔥) ▷ #mojo (4 messages):

Modular Max PR, Capturing Closures

Links mentioned:


Modular (Mojo 🔥) ▷ #max (1 messages):

MutableInputTensor visibility, Mojo nightly docs, max.tensor API


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (5 messages):

AST Evaluation, Function Calling Leaderboard, LLM Integration, Parallel Function Calls

Link mentioned: Berkeley Function Calling Leaderboard: no description found


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (2 messages):

Evaluation tools, Datasets availability


DSPy ▷ #general (4 messages):

DSPy Caching, Pluggable Cache Module, Cache Invalidation Strategies, Selective Caching, Monitoring Cache Hit/Miss Rates

Link mentioned: Feature/caching by hmoazam · Pull Request #1922 · stanfordnlp/dspy: One single caching interface which has two levels of cache - in memory lru cache and fanout (on disk)


DSPy ▷ #colbert (1 messages):

ColBERT endpoint, MultiHop program, Connection Refused


tinygrad (George Hotz) ▷ #general (1 messages):

LSTM Model issues, NaN loss debugging, TinyJit integration


AI21 Labs (Jamba) ▷ #jamba (1 messages):

Pinecone Limitations, RAG Changes, VPC Deployment



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}