Frozen AI News archive

The new OpenAI Agents Platform

**OpenAI** introduced a comprehensive suite of new tools for AI agents, including the **Responses API**, **Web Search Tool**, **Computer Use Tool**, **File Search Tool**, and an open-source **Agents SDK** with integrated observability tools, marking a significant step towards the "Year of Agents." Meanwhile, **Reka AI** open-sourced **Reka Flash 3**, a **21B parameter reasoning model** that outperforms **o1-mini** and powers their Nexus platform, with weights available on **Hugging Face**. The **OlympicCoder** series surpassed **Claude 3.7 Sonnet** and much larger models on competitive coding benchmarks. **DeepSeek** built a **32K GPU cluster** capable of training V3-level models in under a week and is exploring AI distillation. **Hugging Face** announced **Cerebras** inference support, achieving over **2,000 tokens/s** on **Llama 3.3 70B**, 70x faster than leading GPUs. **Reka's Sonic-2** voice AI model delivers **40ms latency** via the **Together API**. **Alibaba's Qwen Chat** enhanced its multimodal interface with video understanding up to **500MB**, voice-to-text, guest mode, and expanded file uploads. *Sama* praised OpenAI's new API as "one of the most well-designed and useful APIs ever."

Canonical issue URL

AI News for 3/11/2025-3/12/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (224 channels, and 2851 messages) for you. Estimated reading time saved (at 200wpm): 258 minutes. You can now tag @smol_ai for AINews discussions!

In a livestream today, OpenAI dropped a sweeping set of changes to prepare for the Year of Agents:

Atty Eletti told the full story of the design decisions, and sama called it "one of the most well-designed and useful APIs ever".

You can find more code samples and highlights on the exclusive Latent Space interview for today's launch:

https://www.youtube.com/watch?v=QU9QLi1-VvU


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

1. AI Models and Performance: Model releases, benchmarks, performance comparisons of specific models

2. AI Agents and Developer Tools: Focus on tools for building and using AI agents, SDKs, APIs, and agentic workflows.

3. AI Applications and Industry Impact: Real-world applications, industry use cases, and company news.

4. China and AI Competition: Focus on China's AI advancements and competition with the US.

5. AI Research & Techniques: Core AI research concepts and techniques being discussed.

6. Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Gemma 3 Anticipation and Potential Impact

Theme 2. M3 Ultra 512GB Review with Deepseek R1 671B Q4

Theme 3. NVLINK's Impact on RTX 3090 Performance

Theme 4. Alibaba's R1-Omni for Emotion Recognition

Theme 5. Reka Flash 3: New Open-Source 21B Model

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Claude 3.7: Enhancing Developer Skills through Debugging

Theme 2. Nvidia's Gen3C: Advancements in Image to 3D Conversion

Theme 3. Dario Amodei: AI Code Generation Predictions and Skepticism


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Exp

Theme 1: Open AI's Agent Development Ecosystem Evolves

Theme 2: Navigating the Frontier of AI Model Capabilities and Limitations

Theme 3: Community-Driven Tools and Techniques for AI Development

Theme 4: Hardware and Infrastructure Considerations for AI Workloads

Theme 5: Ethical Concerns and Usage Policies in AI Development


PART 1: High level Discord summaries

Cursor IDE Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Nous Research AI Discord


Eleuther Discord


Latent Space Discord


OpenRouter (Alex Atallah) Discord


OpenAI Discord


aider (Paul Gauthier) Discord


LM Studio Discord


HuggingFace Discord


GPU MODE Discord


Interconnects (Nathan Lambert) Discord


MCP (Glama) Discord


Notebook LM Discord


Codeium (Windsurf) Discord


Yannick Kilcher Discord


LlamaIndex Discord


DSPy Discord


Torchtune Discord


Cohere Discord


tinygrad (George Hotz) Discord


Modular (Mojo 🔥) Discord


AI21 Labs (Jamba) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (1048 messages🔥🔥🔥):

Cursor Nightly Bugs, Claude 3.7 Pricing, Potential of Manus AI, Criticism of Cursor's Stability, Local LLM vs Cloud GPU

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity Windows app, Voice dictation, Keyboard shortcuts


Perplexity AI ▷ #general (277 messages🔥🔥):

Revolut Perplexity Pro Promo Code Issues, Claude 3.7 Token Limits, Perplexity Enterprise Integration, Sider AI Features and Comparison, Perplexity Windows App

Links mentioned:


Perplexity AI ▷ #sharing (8 messages🔥):

Student cracks math problem, Air product teased, Trump and Canada Tariffs, Apple software overhaul, Man U stadium plans


Perplexity AI ▷ #pplx-api (6 messages):

API chat completions truncation, Tier-3 structured output

Link mentioned: no title found: no description found


Unsloth AI (Daniel Han) ▷ #general (232 messages🔥🔥):

GRPO trainer batch size and gradient accumulation, reward hacking, finetuning llama 8b 4bit vs normal, high context window and batch size impact, date and time extraction regex vs ai

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

AI Code Fusion Tool, Code Optimization for LLMs

Link mentioned: GitHub - codingworkflow/ai-code-fusion: Desktop app to process repository content into one file: Desktop app to process repository content into one file - codingworkflow/ai-code-fusion


Unsloth AI (Daniel Han) ▷ #help (33 messages🔥):

Qwen2.5, GRPO RL algo, Deepseek distilled models, multi-node multi-GPU, Llama-3.2 1B colab notebook

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

train_on_responses_only Functionality, HuggingFace Space for Unsloth, HF model sharing


Unsloth AI (Daniel Han) ▷ #research (4 messages):

GRPO RL algorithm, Unsloth GRPO, Deepseek distilled models


Nous Research AI ▷ #general (203 messages🔥🔥):

Deep Hermes, Browserless.io for web scraping, Universal Transformers, Nvidia Blackwell GPU Hackathon, Forward Gradients

Links mentioned:


Eleuther ▷ #general (32 messages🔥):

Evaluating small language models, Output formatting challenges, MATH benchmark and finetuning, lm-eval harness, Open source dLLMs


Eleuther ▷ #research (103 messages🔥🔥):

spectral autoregression, Gaussian noise, noise schedules, neural flow diffusion models, variational rectified flow

Links mentioned:


Eleuther ▷ #interpretability-general (1 messages):

Metrics for Patching Results, Tokenizer Issues, Evaluating Important Circuits for Math CoT


Latent Space ▷ #ai-general-chat (57 messages🔥🔥):

Avoma, Factorio Learning Environment, Contextual AI Reranker, OpenAI Agents API, Luma Labs IMM

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

Latent Space Podcast, OpenAI Agents Platform, Responses API, Agents SDK

Link mentioned: Tweet from Latent.Space (@latentspacepod): 🆕 The new OpenAI Agents Platformhttps://latent.space/p/openai-agents-platform@OpenAIDevs are refreshing the entire suite of APIs, Tools, and SDKs for the year of Agents. We grab an exclusive intervie...


Latent Space ▷ #llm-paper-club-west (76 messages🔥🔥):

OpenAI Agents SDK, Responses API, Assistants API sunset, Agent Ops, OpenTelemetry

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

FAQ Page, Quality of Life Updates


OpenRouter (Alex Atallah) ▷ #general (133 messages🔥🔥):

Gemini 2.0 Flash, OpenAI dev-facing reveal, AYA vision by cohere, Parameter calculation removal, DeepSeek-R1 API issues

Links mentioned:


OpenAI ▷ #annnouncements (3 messages):

Agent Tools, Developer Livestream, Developer AMA

Link mentioned: Tweet from OpenAI Developers (@OpenAIDevs): Join us here for an AMA after the livestream! From 10:30–11:30 AM PT, the team behind today’s ships will answer your questions.Reply below with your questions.Quoting OpenAI (@OpenAI) Agent Tools for ...


OpenAI ▷ #ai-discussions (91 messages🔥🔥):

AGI, Grok, GPT-4.5, Gemini, Sider AI

Links mentioned:


OpenAI ▷ #gpt-4-discussions (10 messages🔥):

GPT-4.5 Code Inconsistencies, GPT-4o Code Generation Issues, Trustworthiness of AI Models, New responses API vs chat completions


OpenAI ▷ #prompt-engineering (6 messages):

Jailbreaking, Terms of Service, Allowed Content, Prompting Techniques


OpenAI ▷ #api-discussions (6 messages):

Jailbreaking AI Models, Terms of Service violations, Allowed content guidelines, Prompting Techniques


aider (Paul Gauthier) ▷ #general (66 messages🔥🔥):

Aider --watch Flag, Reasoning Tags, Daily Budget for Aider, DMCA takedown of Claude Code, o1 pro / o3 mini pro API

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (27 messages🔥):

Litellm APIConnectionError, Fine-tuning Qwen Coder, Aider undo issues, aider pro-tip sqlite schema, Aider Leaderboard explanation

Link mentioned: Edit formats: Aider uses various “edit formats” to let LLMs edit source files.


aider (Paul Gauthier) ▷ #links (1 messages):

end4749: Maybe related, seems interesting https://github.com/xingyaoww/code-act


LM Studio ▷ #general (26 messages🔥):

Unity LM Studio Connection, Internal LLM Chat Setup, Python SDK Vision Models, LLM Fiction Interpretation, Copy4AI Extension

Links mentioned:


LM Studio ▷ #hardware-discussion (45 messages🔥):

Swap Usage, Speculative Decoding, AMD Vulkan/ROCm Performance, RTX 2000E inference tests, CXL memories

Links mentioned:


HuggingFace ▷ #general (18 messages🔥):

X DDoS attack, LanguageBind vs ImageBind, Using Chroma DB for chatbot memory

Links mentioned:


HuggingFace ▷ #cool-finds (2 messages):

Reka Flash 3, Cakeify LoRA, Wan2.1 14B I2V 480p

Links mentioned:


HuggingFace ▷ #i-made-this (2 messages):

RAGcoon, Startup assistance, Agentic RAG, Qdrant vector database, LlamaIndex

Link mentioned: GitHub - AstraBert/ragcoon: Agentic RAG to help you build a startup🚀: Agentic RAG to help you build a startup🚀. Contribute to AstraBert/ragcoon development by creating an account on GitHub.


HuggingFace ▷ #computer-vision (1 messages):

Real-time Disease Detection, Pretrained Models for Fine-Tuning


HuggingFace ▷ #NLP (2 messages):

SmolLM, Mobile LLMs, Fine-tuning LLMs

Link mentioned: GitHub - huggingface/smollm: Everything about the SmolLM2 and SmolVLM family of models: Everything about the SmolLM2 and SmolVLM family of models - GitHub - huggingface/smollm: Everything about the SmolLM2 and SmolVLM family of models


HuggingFace ▷ #smol-course (2 messages):

ChatML conversion woes, Llama discrepancies


HuggingFace ▷ #agents-course (39 messages🔥):

LlamaIndex error, smolagents DocstringParsingException, smolagents OpenAI/DeepSeek keys, Ollama integration with Smolagents, Brave Browser's Leo AI

Links mentioned:


GPU MODE ▷ #general (5 messages):

GPU mode, CUDA expertise, San Jose meeting


GPU MODE ▷ #triton (9 messages🔥):

Triton tl.cast vs tl.full, Pipeline softmax kernel in Triton, Tiled GEMM Implementation in Triton, TF32 precision in Triton

Link mentioned: mlcompilers_and_kernels/triton_kernels/SimpleOpKernels/triton_tiled_2d_matmul.py at main · gauravjain14/mlcompilers_and_kernels: Contribute to gauravjain14/mlcompilers_and_kernels development by creating an account on GitHub.


GPU MODE ▷ #cuda (19 messages🔥):

stmatrix padding, BFE/BFI instructions, CUDA binary tools

Link mentioned: 1. Overview — CUDA Binary Utilities 12.8 documentation: no description found


GPU MODE ▷ #beginner (9 messages🔥):

leetgpu.com, leaderboard, GPU mode


GPU MODE ▷ #off-topic (1 messages):

iron_bound: https://jackhopkins.github.io/factorio-learning-environment/


GPU MODE ▷ #rocm (5 messages):

ROCm Version, Ubuntu Versions, vLLM on AMD

Link mentioned: Compatibility matrix — ROCm Documentation: no description found


GPU MODE ▷ #metal (3 messages):

MLX custom kernels, Metal-cpp, GPU Programming, C++ linear algebra library

Link mentioned: Custom Extensions in MLX — MLX 0.23.2 documentation: no description found


GPU MODE ▷ #self-promotion (1 messages):

IPFS Accelerate JS, HuggingFace Port to TS/JS

Link mentioned: feat: Implement initial structure for IPFS Accelerate JS with placeho… · endomorphosis/ipfs_accelerate_py@2f99633: …lder modules and TypeScript conversion


GPU MODE ▷ #reasoning-gym (6 messages):

Private eval benchmark service, reasoning-gym, Curriculum benchmarks

Link mentioned: reasoning-gym/tools at main · open-thought/reasoning-gym: procedural reasoning datasets. Contribute to open-thought/reasoning-gym development by creating an account on GitHub.


GPU MODE ▷ #gpu模式 (4 messages):

cublas autotune, PTX optimization, Triton autotune, Cutlass autotune


Interconnects (Nathan Lambert) ▷ #events (1 messages):

Nvidia Blackwell, SemiAnalysis, GPU Hackathon, Open Source

Link mentioned: Hackathon 2025: SemiAnalysis is kicking things off ahead of NVIDIA GTC! Start your day with engaging morning keynotes, hack all day with low-level NVIDIA GPU programming (maybe even Blackwell), take a breather wit…


Interconnects (Nathan Lambert) ▷ #news (17 messages🔥):

Reka Flash 3, Anthropic ARR Growth, Manus AI Sensation, Nvidia Blackwell GPU Hackathon, Automation and Robotics in China

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (2 messages):

Claude Code Decompilation, GitHub Repo

Link mentioned: Tweet from Dazai (@odazai_): @dnak0v @cheatyyyy They took down the decompiled claude-code 😢


Interconnects (Nathan Lambert) ▷ #random (34 messages🔥):

Qwen Chat Enhanced, New tools for building agents with the API, Anthropic CEO, Dario Amodei predicts most code will be written by AI in 12 months, Sama hypeposts new OpenAI model good at creative writing

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

AI Distillation, Neural Sequence Chunkers, History Compression, DeepSeekR1, Reinforcement Learning Prompt Engineer

Link mentioned: Tweet from Jürgen Schmidhuber (@SchmidhuberAI): Thanks to #DeepSeek, even @CNBC [9] now talks about AI distillation, published in 1991 [1][2]. I called it "collapsing," back then, not “distilling." See also [9][10]. REFERENCES[1] J. Sch...


Interconnects (Nathan Lambert) ▷ #reads (6 messages):

Inductive Moment Matching (IMM), Claude 3.7 Sonnet reasoning

Links mentioned:


MCP (Glama) ▷ #general (40 messages🔥):

MCP servers in Cursor, GitHub org owned repos verification, Claude Desktop config schema, Smithery CLI with Gitlab MCP server, Openai's SDK MCP compatibility

Links mentioned:


MCP (Glama) ▷ #showcase (10 messages🔥):

MCP, Model Context Protocol, Elixir's Phoenix Framework, llama.cpp, vllm

Links mentioned:


Notebook LM ▷ #use-cases (7 messages):

NotebookLM for Exam Prep, NotebookLM for Medical Guidelines and Patient Info, Automating NotebookLM Uploads, NotebookLM Audio Overview


Notebook LM ▷ #general (17 messages🔥):

Gemini, Notebook LM Limits, Language support, Source management, Audio overview

Link mentioned: Upgrading to NotebookLM Plus - NotebookLM Help: no description found


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Referral Challenge, v1.4.6 Patch Fixes, Windsurf Previews, Auto-Linter, MCP Servers

Links mentioned:


Codeium (Windsurf) ▷ #discussion (23 messages🔥):

Codeium VS Code extension issues, Claude 3.7 Sonnet in VS Code, Codeium extension credits, VS Code extension version discrepancy, Codeium server errors


Yannick Kilcher ▷ #general (18 messages🔥):

operation order in deep learning frameworks, Adaptive Meta-Learning, RLHF issues, Matplotlib and Claude 3.7 graphs, hiding complexity

Link mentioned: Adaptive meta-learning | Semantic Scholar: An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease.


Yannick Kilcher ▷ #paper-discussion (1 messages):

``


Yannick Kilcher ▷ #ml-news (3 messages):

LLMs think in tokens, VR prison

Link mentioned: ‘An ideal tool’: prisons are using virtual reality to help people in solitary confinement: Participants view scenes of daily life as well as travel adventures – then process the emotions they trigger through art


LlamaIndex ▷ #general (11 messages🔥):

Llama Extract Access, Premium Plan Signup, Function Calling Models, MP3 Parsing Error


DSPy ▷ #show-and-tell (1 messages):

amanshrestha: https://github.com/openai/openai-agents-python


DSPy ▷ #general (8 messages🔥):

Judge LLM, ChainPoll, Best of N, dspy.Parallel


Torchtune ▷ #general (7 messages):

OpenPipe's deductive-reasoning, FP8 Fine-Tuning, Torchtune QAT Support, Weight Decay Strategies, Evaluation Dataset Logging

Links mentioned:


Torchtune ▷ #dev (1 messages):

Regression Tests, Model Size Finalization, Evaluation Metrics, Comprehensive Measurement Strategies


Cohere ▷ #「💬」general (1 messages):

ceifa: 😮


Cohere ▷ #【📣】announcements (2 messages):

Expedition Aya 2024, Multilingual AI, Multimodal AI, Efficient AI, Cohere API Credits

Links mentioned:


Cohere ▷ #「💡」projects (1 messages):

Nvidia Blackwell GPU Hackathon, SemiAnalysis, PTX Infrastructure

Link mentioned: Hackathon 2025: SemiAnalysis is kicking things off ahead of NVIDIA GTC! Start your day with engaging morning keynotes, hack all day with low-level NVIDIA GPU programming (maybe even Blackwell), take a breather wit…


Cohere ▷ #「🤝」introductions (2 messages):

Multilingual/multicultural communities, Introductions and Community Expectations


tinygrad (George Hotz) ▷ #general (3 messages):

Nvidia Blackwell GPU Hackathon, SemiAnalysis, Blackwell PTX, GB200, GTC

Link mentioned: Hackathon 2025: SemiAnalysis is kicking things off ahead of NVIDIA GTC! Start your day with engaging morning keynotes, hack all day with low-level NVIDIA GPU programming (maybe even Blackwell), take a breather wit…


Modular (Mojo 🔥) ▷ #general (1 messages):

clear3fram3: waiting for the last blog posts on CUDA to pop up 🙂


AI21 Labs (Jamba) ▷ #jamba (1 messages):

Qdrant, Vector DB, ConvRAG, VPC Deployment





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}