Frozen AI News archive

not much happened today

At Nvidia GTC Day 1, several AI updates were highlighted: **Google's Gemini 2.0 Flash** introduces image input/output but is not recommended for text-to-image tasks, with **Imagen 3** preferred for that. **Mistral AI** released **Mistral Small 3.1** with 128k token context window and competitive pricing. **Allen AI** launched **OLMo-32B**, an open LLM outperforming **GPT-4o mini** and **Qwen 2.5**. **ShieldGemma 2** was introduced for image safety classification. **LangChainAI** announced multiple updates including **Julian** powered by **LangGraph** and integration with **AnthropicAI's MCP**. Jeremy Howard released **fasttransform**, a Python library for data transformations. **Perplexity AI** partnered with **Kalshi** for NCAA March Madness predictions.

Canonical issue URL

AI News for 3/17/2025-3/18/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (223 channels, and 9014 messages) for you. Estimated reading time saved (at 200wpm): 990 minutes. You can now tag @smol_ai for AINews discussions!

It's Day 1 of Nvidia GTC, so there are a bunch of little announcements coming from San Jose, but nothing particularly market moving:

https://www.youtube.com/watch?v=_waPvOwL9Z8


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Language Models and Releases

Frameworks and Tools

AI Applications and Use Cases

Infrastructure, Hardware, and Scaling

Concerns and Skepticism

Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Criticism of AI Benchmarks: Goodhart's Law in Action

Theme 2. Meta's Open-Source AI Hits a Billion Downloads

Theme 3. LG's EXAONE Deep Models Outperform on Reasoning Tasks

Theme 4. SmolDocling: New Tool for Document Understanding Released

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Augmented Reality with Stable Diffusion: Revolutionizing Real-Time Experiences

Theme 2. France launches Mistral Small 3.1: A New AI Contender Emerges

Theme 3. Hunyuan3D-DiT-v2-mv: New Horizons in 3D Model Generation

Theme 4. Claude and AI Models Recognizing Evaluation Environments: Ethics of 'Playing Dumb'


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Gemma 3 Models and Unsloth: Finetuning, Quantization, and Performance

Theme 2. Claude 3.5 Sonnet and Anthropic Ecosystem: Cost, Agentic Access, and Tooling

Theme 3. Nvidia's GTC Conference: Blackwell Ultra, New Hardware, and Market Moves

Theme 4. Open Source AI Models and Tools: DAPO, Instella, and Fudeno

Theme 5. Community Tooling and Debugging Deep Dives: Triton, Aider, and LM Studio


PART 1: High level Discord summaries

Cursor IDE Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Interconnects (Nathan Lambert) Discord


HuggingFace Discord


Perplexity AI Discord


Nous Research AI Discord


OpenAI Discord


MCP (Glama) Discord


GPU MODE Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


Yannick Kilcher Discord


Notebook LM Discord


Cohere Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


Eleuther Discord


LLM Agents (Berkeley MOOC) Discord


DSPy Discord


tinygrad (George Hotz) Discord


AI21 Labs (Jamba) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (909 messages🔥🔥🔥):

Cursor IDE, Claude Max, MCP Servers, vibe coders, Anthropic issues

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (392 messages🔥🔥):

Full Finetuning and 8-bit Finetuning in Unsloth, Gemma 3 Support in Unsloth, AGPL3 Licensing for Unsloth, GGUF Quantization Formats

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (16 messages🔥):

bnbso alternatives, QLoRA NF4 dequantization, Unsloth open positions

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (178 messages🔥🔥):

Gemma 3, Ollama and Gemma, Phi-4-mini-instruct, Multi-GPU Support, AMD Support

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (20 messages🔥):

Gemma-3-27b vocabulary pruning, 4090 finetuning, GPU power consumption

Link mentioned: fimbulvntr/gemma-3-27b-pt-unsloth-bnb-4bit-pruned-vocab · Hugging Face: no description found


Unsloth AI (Daniel Han) ▷ #research (2 messages):

Gemma 3, VRAM calculation, Zeroth Order Optimization

Link mentioned: GitHub - liangyuwang/zo2: ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory: ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory - liangyuwang/zo2


aider (Paul Gauthier) ▷ #general (480 messages🔥🔥🔥):

Claude Code vs Aider, Claude Code IP theft?, Grok-3 vs Aider, Junie, the Jetbrains AI assistant, Using OpenRouter with Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (47 messages🔥):

Model selection for ideation and planning, Aider API scripting, Sonar integration with Aider, Stopping streaming responses in Aider, Aider's CONVENTIONS.md file inconsistencies

Links mentioned:


aider (Paul Gauthier) ▷ #links (20 messages🔥):

Refact.ai Agent + Claude 3.7 Sonnet, Aider's Polyglot Benchmark, Baidu models, Qwen models, Anthropic's Harmony feature

Links mentioned:


LM Studio ▷ #general (103 messages🔥🔥):

TTS Models in LM Studio, Multimodal models, Gemma 3, Context Compliance Attack (CCA), Open Voice and TTS

Links mentioned:


LM Studio ▷ #hardware-discussion (255 messages🔥🔥):

PCI-e over Firewire, Reference arc design, RGB case fans, Strix Halo, AI Model Speeds

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Endpoint Quality Measurement


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Cline Compatibility Board, Claude 3.5 Sonnet, Gemini 2.0 Pro Exp

Link mentioned: Cline Compatibility Board: no description found


OpenRouter (Alex Atallah) ▷ #general (274 messages🔥🔥):

Mistral 3.1 Small Launch, OpenRouter vs LLM provider's API, Function/tool calling on Openrouter, Cost usage query in script, OpenAI Agents SDK with OpenRouter API

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (18 messages🔥):

Hotshot acquired by xAI, Instella 3B Language Model, Gemini 1.5 & Test-Time Compute, BoN vs Long CoT, Harvard Research on Open-Source

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (2 messages):

Coreweave, Vultr, Crusoe, Cloud pricing, Bare metal


Interconnects (Nathan Lambert) ▷ #ml-drama (18 messages🔥):

Conference Submission Capping, AI Reviewers, Liam Fedus Leaving OpenAI, AI for Materials Science

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (162 messages🔥🔥):

Claude Fandom, Nous AI RL Infra, Mistral Small 3.1, Olmo 2 vs Gemma, Llama 4 'Polus'

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (3 messages):

Mistral Meow, Joke Identification, VTA Strike

Link mentioned: Tweet from Max Woolf (@minimaxir): Testing to see how well LLMs can identify subtle jokes within an image only, and Claude's answer here is very innocent.


Interconnects (Nathan Lambert) ▷ #rl (20 messages🔥):

GRPO paper, DAPO Algorithm, RLHF Book Notes

Links mentioned:


Interconnects (Nathan Lambert) ▷ #cv (1 messages):

InterVL2.5 vs Qwen2.5VL benchmarks, Autonomous driving paper analysis


Interconnects (Nathan Lambert) ▷ #reads (10 messages🔥):

Future of LLMs, xLSTM 7B, Mistral Small 3.1, VisTW-MCQ for VLMs

Links mentioned:


HuggingFace ▷ #general (140 messages🔥🔥):

Model Size Calculation, Video Llama for Prompt Creation, Image Generator Spaces, WAN 2.1 Not Working, Home Server GPUs for Local AI

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

SD VAEs, Stochastic Variational Inference

Link mentioned: Auto-Encoding Variational Bayes: How can we perform efficient inference and learning in directed probabilistic models, in the presence of continuous latent variables with intractable posterior distributions, and large datasets? We in...


HuggingFace ▷ #i-made-this (4 messages):

Fudeno Instruct 4M dataset, ManusMCP AI agent workflows, Gemma-3 multimodal models, Gemini image editing API

Links mentioned:


HuggingFace ▷ #NLP (2 messages):

SetFit, Sentence Transformers, PEFT, tomaarsen/bert-base-uncased-gooaq-peft

Link mentioned: Training with PEFT Adapters — Sentence Transformers documentation: no description found


HuggingFace ▷ #agents-course (59 messages🔥🔥):

LiteLLM and Ollama Integration, Smolagents ManagedAgent deprecation, Agents Course unit 2.3 langgraph materials availability, Troubleshooting Agent Template Errors, Gradio memory allocation issues

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity Marketing


Perplexity AI ▷ #general (171 messages🔥🔥):

Disable Internet Search, Programming Queries Models, Claude vs Perplexity Privacy, GPT 4o Context, Gemini Advanced Limit

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

Meta Community Notes, AI Quit Button, Pineapple Pizza


Perplexity AI ▷ #pplx-api (3 messages):

Integrate French translator, Deep research via API


Nous Research AI ▷ #general (125 messages🔥🔥):

Mistral-Small-3.1-24B-Instruct-2503, llama.cpp support for multimodal models, DAPO algorithm, Phi 4 use cases, Tensor Parallelism

Links mentioned:


Nous Research AI ▷ #ask-about-llms (1 messages):

chilliwiddit: Hey guys what do you think about SWA combined with CoC? Just throwing that out there


Nous Research AI ▷ #research-papers (2 messages):

Differentiable Hebbian Consolidation, Gemini 1.5 Scaling Search

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

Continual Learning, Differentiable Hebbian Consolidation, Gemini 1.5 Scaling Search

Links mentioned:


OpenAI ▷ #ai-discussions (110 messages🔥🔥):

AI in Finance beyond LLMs, Grok's Distraction, Gemini vs other models, DeepSeek ban in the U.S., AI image enhancers

Links mentioned:


OpenAI ▷ #gpt-4-discussions (1 messages):

krishna_83301: Yes


OpenAI ▷ #prompt-engineering (4 messages):

Unhelpful assistant challenge, ChatGPT personalizations, Evolving system messages


OpenAI ▷ #api-discussions (4 messages):

Unhelpful assistant experiment, ChatGPT personalizations, GPT unhelpful state, Darkness of ChatGPT


MCP (Glama) ▷ #general (75 messages🔥🔥):

Tool Calling Support, MCP Client Landscape, Free LLM Inference Services, Deploying MCP Servers Privately, Resources with Python SDK

Link mentioned: Claude Code MCP: An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) throug...


MCP (Glama) ▷ #showcase (3 messages):

ACE - Adaptive Code Evolution, Tesla MCP server

Links mentioned:


GPU MODE ▷ #general (1 messages):

perf counters


GPU MODE ▷ #triton (15 messages🔥):

Triton matrix multiplication issue, Debugging Triton code, Stride issues in Triton, Flash Attention 2 inner kernel

Link mentioned: TRITON - Strange error with matrix multiplication: I have 2 matrices P and V and when I take their dot product with triton I get results that are inconsistent with pytorch. The P and V matrices are as follows. P is basically the softmax which is w...


GPU MODE ▷ #cuda (3 messages):

nsys reports, Blackwell Ultra's attention instruction


GPU MODE ▷ #torch (17 messages🔥):

std::optional vs Either, torchrun hangs silently on OOM, Profiling Scripted Torch Model, FSDP State Dict Types, cuDNN Benchmarking

Links mentioned:


GPU MODE ▷ #algorithms (1 messages):

Nvidia's tanh.approxthroughput, Performance oftanh.approx on Turing architecture


GPU MODE ▷ #beginner (6 messages):

setuptools upgrade, fp16 vector addition CUDA kernel debugging, CUDA_NO_HALF_OPERATORS flag


GPU MODE ▷ #off-topic (1 messages):

pauleonix: Also vim + tmux here (w/ extensions)


GPU MODE ▷ #irl-meetup (3 messages):

Nvidia GTC workshops, Vijay Thakkar slides


GPU MODE ▷ #rocm (1 messages):

iron_bound: https://github.com/mk1-project/quickreduce


GPU MODE ▷ #liger-kernel (2 messages):

Liger Kernel Optimizations, HF Transformer's Tensor Parallel Plans, Qwen Model Compatibility


GPU MODE ▷ #reasoning-gym (3 messages):

Community reception, Exams, Missed work


GPU MODE ▷ #submissions (9 messages🔥):

matmul, vectorsum, grayscale, H100, A100


GPU MODE ▷ #tpu (3 messages):

TPU crash course, New TPU channel


Modular (Mojo 🔥) ▷ #general (15 messages🔥):

Server Rules, LeetGPU challenges, GTC Talks, Nvidia Keynote, Blackwell Ultra

Link mentioned: LeetGPU: no description found


Modular (Mojo 🔥) ▷ #mojo (42 messages🔥):

Compact Dict Status, memcpy vs memset, List fill method, Span fill method Alignment Error, HashMap in stdlib

Link mentioned: GitHub - mzaks/compact-dict: A fast and compact Dict implementation in Mojo 🔥: A fast and compact Dict implementation in Mojo 🔥. Contribute to mzaks/compact-dict development by creating an account on GitHub.


Latent Space ▷ #ai-general-chat (44 messages🔥):

GRPO, DAPO algorithm, Vibe Coding Game Jam, Manus access, EXAONE Deep

Links mentioned:


Yannick Kilcher ▷ #general (30 messages🔥):

Forward-Forward Algorithm, Mirror Neurons, EXAONE vs DeepSeek, AI Voice Models, Practical AI Development Exercises

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (3 messages):

Anthropic's research, Karatsuba Algorithm Extension

Link mentioned: Karatsuba Matrix Multiplication and its Efficient Custom Hardware Implementations: While the Karatsuba algorithm reduces the complexity of large integer multiplication, the extra additions required minimize its benefits for smaller integers of more commonly-used bitwidths. In this w...


Yannick Kilcher ▷ #ml-news (8 messages🔥):

Mistral Small 3.1, OpenAI post-training head departs, Copyrights for AI-generated art

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

Gemini Flash, Inline Citations, Source Selection, Doc, Slide, or YouTube video linking, Scrolling Behavior


Notebook LM ▷ #use-cases (8 messages🔥):

Agentspace, NotebookLM API, PDF Uploads, vLEX Hallucinations

Link mentioned: Google Agentspace: Google Agentspace is the launch point for enterprise-ready AI agents, helping increase employee productivity for complex tasks with one single prompt.


Notebook LM ▷ #general (31 messages🔥):

NotebookLM in corporate training, Agentspace Integration, NotebookLM limitations on data sources, Deep Research limits, Long Context Upgrade

Links mentioned:


Cohere ▷ #「💬」general (16 messages🔥):

Command-A, Multimodal Cohere, Aya Vision, UC Berkeley Chatbot Arena

Link mentioned: imgur.com: Discover the magic of the internet at Imgur, a community powered entertainment destination. Lift your spirits with funny jokes, trending memes, entertaining gifs, inspiring stories, viral videos, and ...


Cohere ▷ #「🔌」api-discussions (5 messages):

Cohere API, Token Balance Error, Billing Setup, LibreChat Integration


Cohere ▷ #「🤖」bot-cmd (1 messages):

alialiali92: Where are the ruins of Babylon?


Cohere ▷ #「🤝」introductions (3 messages):

AI travel companion in Arabic, RAG knowledge base for SME


LlamaIndex ▷ #general (19 messages🔥):

LlamaExtract Access, AI Mentor Hackathon, Multi-Agent System Handoff Issues, Real-Time Data Plugin, LlamaParse Page Length Limit

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

Vision-Language Models, VLMs Research Hub, Multimodal Learning

Link mentioned: GitHub - thubZ09/vision-language-model-hub: Hub for researchers exploring VLMs and Multimodal Learning:): Hub for researchers exploring VLMs and Multimodal Learning:) - GitHub - thubZ09/vision-language-model-hub: Hub for researchers exploring VLMs and Multimodal Learning:)


Nomic.ai (GPT4All) ▷ #general (20 messages🔥):

GPT-o3-mini hidden CoT, LLM Refusal to share CoT, Embeddings storage location

Link mentioned: Frequently Asked Questions: GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. - nomic-ai/gpt4all


Eleuther ▷ #general (9 messages🔥):

Catherine Arnett joins EleutherAI, Multilingual NLP, ARENA coursework collaboration, Website sidebar issues


Eleuther ▷ #research (3 messages):

Superword Tokenizer, Fine-tuning Gemini or OLMo

Link mentioned: SuperBPE: Space Travel for Language Models: The assumption across nearly all language model (LM) tokenization schemes is that tokens should be subwords, i.e., contained within word boundaries. While providing a seemingly reasonable inductive bi...


Eleuther ▷ #interpretability-general (1 messages):

Latent Activations, Sequence Processing


Eleuther ▷ #lm-thunderdome (6 messages):

lm_eval, BioMistral, Ollama support, API key for lm_eval

Link mentioned: GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of language models.: A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

AgentX Competition, Entrepreneurship Track, Research Track, Team Sign-up

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (10 messages🔥):

MOOC certificate, Quiz answer keys, Prototype submission, Coursework deadlines


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (2 messages):

Oracle Feedback, Self-Reflection, Reward Modeling


DSPy ▷ #general (12 messages🔥):

Assertions and Suggestions in DSPy, QdrantRM in DSPy 2.6, DSPy Go implementation

Links mentioned:


tinygrad (George Hotz) ▷ #general (3 messages):

M1 Air Training Limitations, Hosting Inference Demos


AI21 Labs (Jamba) ▷ #general-chat (2 messages):

Welcoming new members, Feature requests, Community Polls


MLOps @Chipro ▷ #events (1 messages):

MLOps, AWS, Featureform

Link mentioned: MLOps Workshop: Building an MLOps Stack from Scratch on AWS: Join us for a 1-hour webinar on Tuesday, March 25th @ 8 A.M. PT for an in-depth discussion on building end-to-end MLOps platforms.


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Tab, Autocomplete, Supercomplete, Tab to Jump, Tab to Import

Links mentioned:



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}