Frozen AI News archive

not much happened today

**OpenAI** teased a *Memory update in ChatGPT* with limited technical details. Evidence suggests upcoming releases of **o3** and **o4-mini** models, alongside a press leak about **GPT-4.1**. **X.ai** launched the **Grok 3** and **Grok 3 mini** APIs, confirmed as **o1** level models. Discussions compared **Google's TPUv7** with **Nvidia's GB200**, highlighting TPUv7's specs like **4,614 TFLOP/s FP8 performance**, **192 GB HBM**, and **1.2 Tbps ICI bandwidth**. TPUv7 may have pivoted from training to inference chip use. Key AI events include **Google Cloud Next 2025** and **Samsung's Gemini-powered Ballie robot**. The community is invited to participate in the **AI Engineer World's Fair 2025** and the 2025 State of AI Engineering survey.

Canonical issue URL

AI News for 4/9/2025-4/10/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (230 channels, and 6924 messages) for you. Estimated reading time saved (at 200wpm): 601 minutes. You can now tag @smol_ai for AINews discussions!

Sama drummed up some hype for today's Memory update in ChatGPT, but with very little technical detail, there's not much to go on yet.

There is certainly evidence that o3 and o4-mini are coming soon, as well as some credible press leaks of 4o's upgrade to GPT4.1.

X.ai released the Grok 3 and Grok 3 mini API and Epoch AI independenltly confirmed it as an o1 level model... in a now deleted tweet. We last covered Grok 3 in Feb.


Since it's quiet, do consider answering our call for the world’s best AI Engineer talks for AI Architects, /r/localLlama, Model Context Protocol (MCP), GraphRAG, AI in Action, Evals, Agent Reliability, Reasoning and RL, Retrieval/Search/RecSys , Security, Infrastructure, Generative Media, AI Design & Novel AI UX, AI Product Management, Autonomy, Robotics, and Embodied Agents, Computer-Using Agents (CUA), SWE Agents, Vibe Coding, Voice, Sales/Support Agents at AI Engineer World's Fair 2025! And fill out the 2025 State of AI Eng survey for $250 in Amazon cards and see you from Jun 3-5 in SF!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

TPUs and Hardware Accelerators

Models, Training, and Releases

Agent Development and Tooling

ChatGPT and Model Memory

Google's Gemini Models and Capabilities

Tariffs and Trade

Other

Humor and Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Fixing Token Issues in Bartowski Models

Theme 2. Qwen3 Release Delayed: Community Reacts to Update

Theme 3. "Celebrating Qwen's Iconic LLM Mascot Ahead of Qwen3"

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Exploring AI Developments: Models, Comparisons, and Support

Theme 2. "Navigating AI Innovations and User Challenges"

Theme 3. Excitement and Speculation Surrounding Launch Day


AI Discord Recap

A summary of Summaries of Summaries

Theme 1. Fresh Models Flood the Market: Grok, Optimus, Gemini, and More Emerge

Theme 2. Tooling Up: Frameworks and Platforms Evolve for AI Engineers

Theme 3. Hardware Heats Up: AMD MI300X and Apple M3 Challenge NVIDIA's Dominance

Theme 4. Data Dilemmas: Preparation, Memory, and Copyright Concerns Rise

Theme 5. Agentic Futures: From Trading Bots to Semantic Tool Calling


PART 1: High level Discord summaries

LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


Perplexity AI Discord


LM Studio Discord


Interconnects (Nathan Lambert) Discord


Cursor Community Discord


OpenAI Discord


MCP (Glama) Discord


GPU MODE Discord


Modular (Mojo 🔥) Discord


HuggingFace Discord


Notebook LM Discord


Nous Research AI Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


Eleuther Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Codeium (Windsurf) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LMArena ▷ #general (1194 messages🔥🔥🔥):

GPT-4o Updates, Model Naming Schemes, OpenAI vs. Google, New Model Releases, Coding and AI


Unsloth AI (Daniel Han) ▷ #general (511 messages🔥🔥🔥):

Healthcare AI Dangers, LoRA merging, NHS GenAI triage, Meta Llama 4 Bugfix, Unsloth for image-to-video


Unsloth AI (Daniel Han) ▷ #off-topic (112 messages🔥🔥):

Spiking Neural Networks (SNNs), SNN Training Methods, Neuromorphic Chips, GPT Memory Enhancement, Hot Layer Injection


Unsloth AI (Daniel Han) ▷ #help (132 messages🔥🔥):

GRPO reward function with LLM as judge, Orpheus fine-tuning errors, Llama3 Mongolian, Unsloth GGUF, Gemma-3 finetune


Unsloth AI (Daniel Han) ▷ #research (2 messages):

Evals, Finetunes, Arxiv papers


OpenRouter (Alex Atallah) ▷ #announcements (13 messages🔥):

Grok 3, Grok 3 Mini, Optimus Alpha, Quasar Alpha, Gemini 2.5 Pro


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

AlphaLog AI, Financial Journal


OpenRouter (Alex Atallah) ▷ #general (592 messages🔥🔥🔥):

Gemini 2.5 Pro Preview, Optimus Alpha vs Quasar, Grok 3 API, OpenAI's next model


aider (Paul Gauthier) ▷ #general (573 messages🔥🔥🔥):

Gemini 2.5 Pro vs Claude, Optimus Alpha and Quasar Alpha in Aider, Copilot and Aider Integration, OpenAI Max 6x Price, Sam Altman Excited about New ChatGPT Features


aider (Paul Gauthier) ▷ #questions-and-tips (31 messages🔥):

aider SEARCH/REPLACE syntax, Aider auto add files, Aider lints, Aider Repo-map tokens, Aider model RUST


aider (Paul Gauthier) ▷ #links (1 messages):

Claude 3.5 Sonnet, o3-mini context windows, Codebase token counts


Manus.im Discord ▷ #general (549 messages🔥🔥🔥):

Manus Free Credits, Google Firebase vs Manus, Manus Customer Service, Auto-Translation Tools, Remote Coding Jobs


Perplexity AI ▷ #general (413 messages🔥🔥🔥):

Perplexity Pro Search, Gemini 2.5 Pro Context Length, Broken Spaces, Image Generation Issues, Grok 3 Integration


Perplexity AI ▷ #sharing (1 messages):

lalactus: https://www.perplexity.ai/search/qs-ranking-2025-BNeZsV.XTZCb5op7jqBwQA#0


Perplexity AI ▷ #pplx-api (2 messages):

Playground searches vs API, Website Relevance in Searches


LM Studio ▷ #general (99 messages🔥🔥):

LM Studio and iPhone, Llama-4 on consumer hardware, Deepcogito 70b Template, Gemma 3 Issues, LM Studio Prompt Preprocessor


LM Studio ▷ #hardware-discussion (172 messages🔥🔥):

LM Studio on Cloud Server, llama.cpp vs Ollama, Multi-GPU performance in LM Studio, Mac vs NVIDIA for AI, DGX Spark


Interconnects (Nathan Lambert) ▷ #news (169 messages🔥🔥):

Qwen3 release speculation, MoE VLMs discussion, OpenAI Pioneers Program, GPT-4.1 and naming schemes, Memory feature improvements


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

xeophon.: https://x.com/openainewsroom/status/1910105151492575611?s=61


Interconnects (Nathan Lambert) ▷ #random (27 messages🔥):

OLMo 2 Furious Paper, Discord Message Saving, Prompt Injection, OpenAI Banning for Distillation, Phoebe Model


Interconnects (Nathan Lambert) ▷ #memes (1 messages):

Qwen, Reddit, LLM


Interconnects (Nathan Lambert) ▷ #reads (10 messages🔥):

Seed Thinking Model, Smol LM Branding, Reasoning Datasets Competition


Interconnects (Nathan Lambert) ▷ #policy (13 messages🔥):

Data Dominance, National Data Reserve, USA AI comeback, Dingboard mug


Cursor Community ▷ #general (198 messages🔥🔥):

Restore Checkpoint Functionality, Gemini 2.5 Pro Max API Error, Firebase Pricing, MCP Usage, Cursor Rules


OpenAI ▷ #annnouncements (2 messages):

ChatGPT Memory, BrowseComp


OpenAI ▷ #ai-discussions (139 messages🔥🔥):

Veo 2 release, Grok 3 API, GPT-4-turbo, Sora limitations, ChatGPT moderation layer


OpenAI ▷ #gpt-4-discussions (2 messages):

Memory rollout, Context window, Token limits, Memory storage, Free-tier availability


OpenAI ▷ #prompt-engineering (13 messages🔥):

Assistant API context handling, Prompt Engineering tips, Generating multiple choice questions, Eliciting deeper responses from chatbots


OpenAI ▷ #api-discussions (13 messages🔥):

Prompt engineering, Assistant API, Context Contamination, Generating Multiple Choice Questions, Chat History Analysis


MCP (Glama) ▷ #general (147 messages🔥🔥):

MCP server proxy, parallel tool calls, A2A vs MCP, Semantic tool calling, MCP registry optimizations


MCP (Glama) ▷ #showcase (6 messages):

GraphQL MCP Server, Mobile Application Security MCP Server, MCP Server Hosting Service, Open Source MCP Server for Thingsboard


GPU MODE ▷ #general (2 messages):

AMD stock prices, Trump tariffs


GPU MODE ▷ #triton (2 messages):

Triton vs cuBLAS, Optimizing Triton Kernels, AMD Challenges


GPU MODE ▷ #cuda (1 messages):

Cutlass 3.x, Pointcloud project, ScatteredGather+GEMM fusion


GPU MODE ▷ #announcements (1 messages):

AMD $100K Competition, Kernel Optimization, MI300, Reasoning Models, FP8 GEMM


GPU MODE ▷ #algorithms (1 messages):

LLAMA models, Scout Model, QK Norm, L2 Norm, Chunked Attention


GPU MODE ▷ #cool-links (2 messages):

AlexNet Source Code


GPU MODE ▷ #beginner (64 messages🔥🔥):

Producer-Consumer Model on GPUs, Tensor-based GPU Database, Element-wise Kernel Performance, GPU Parallelism and Thread Execution, Nsight Compute (NCU) Usage


GPU MODE ▷ #rocm (24 messages🔥):

tilelang performance, AMD developer challenge, MI300X benchmarks, ROCm profilers


GPU MODE ▷ #reasoning-gym (3 messages):

SFT+CoT, arc agi 2, arc-agi 1 CoT


GPU MODE ▷ #general (6 messages):

AMD MI300 submissions, OpenAI API alternatives


GPU MODE ▷ #submissions (9 messages🔥):

matmul, A100, T4, H100, L4


GPU MODE ▷ #amd-competition (32 messages🔥):

AMD Competition Details, AMD Contractor Participation, Team Formation, AMD Kernel Experience, HIP vs CUDA


Modular (Mojo 🔥) ▷ #general (39 messages🔥):

Modular Meetup, GPU programming with MAX and Mojo, Compiler status, Blind programmer using Mojo, GPU Support on Mojo


Modular (Mojo 🔥) ▷ #mojo (47 messages🔥):

Mojo OS kernel, __mlir_fold, MimIR paper, Mojo package installation, Contribution graph


Modular (Mojo 🔥) ▷ #max (15 messages🔥):

MAX disk space, Max cache size config, magic clean cache


HuggingFace ▷ #general (30 messages🔥):

Free Jupyter Notebooks, ZeroGPU Quota, Sidebar Closing Courses, Saving HTML Code on Phone, LLM Training Libs


HuggingFace ▷ #i-made-this (16 messages🔥):

Google Firebase Studio, NextJs, LAION initiative, instant interview bot


HuggingFace ▷ #core-announcements (1 messages):

Diffusers v0.33.0, Image & Video Gen Models, Memory Optimizations, torch.compile() support for LoRAs


HuggingFace ▷ #computer-vision (1 messages):

Street Fighter 6, OpenCV, Discord Chat Bot, Gemme3, Langchain


HuggingFace ▷ #NLP (1 messages):

DocLing Project, Mapping Headings, Information Column, Paper ID Download


HuggingFace ▷ #smol-course (3 messages):

smolagents CodeAgent, Ollama, Llama3.2, qwen2:7b, Error in code parsing


HuggingFace ▷ #agents-course (27 messages🔥):

Google ADK, Torch Installation Issues, Gemini Flash API, Ollama Models for AI Agents, Langchain vs LlamaIndex


Notebook LM ▷ #announcements (1 messages):

NotebookLM Plus, Audio Overviews, Source Limits, User Research


Notebook LM ▷ #use-cases (18 messages🔥):

Discord Guidelines, NotebookLM as Notetaking App, Discover Feature Use Case, NotebookLM Plus, Time References in Source Document Titles


Notebook LM ▷ #general (50 messages🔥):

Gemini and TPU, Mobile app release, Chat Style Modifications, PDF image recognition, Source Discovery


Nous Research AI ▷ #general (32 messages🔥):

Psyche Explainer, Logits going negative, Grok 3, Quasar and Optimus Alpha, Live Modulation


Nous Research AI ▷ #ask-about-llms (20 messages🔥):

Nous API system prompts, VLMs for OCR on device, OpenAI API changes, Nemotron model behavior


Nous Research AI ▷ #interesting-links (7 messages):

mlss2025.mlinpl.org, self encrypted cloud backup/sync, use your own openrouter API key, local chats


Torchtune ▷ #general (23 messages🔥):

MCP Explained, MCP vs Regular Tool Calls, MCP Ethereum Analogy, Google A2A Announcement


Torchtune ▷ #dev (21 messages🔥):

Sharding Strategies, Llama4 Support, Scout model, Maverick model, iRoPE implementation


Nomic.ai (GPT4All) ▷ #general (35 messages🔥):

Code Formatting Preferences, DeepSeek vs. Qwen Model Distillation, GPT4ALL logging, Small LLMs for LocalDocs, Chocolatine-3B-Instruct-DPO-v1.2-GGUF


Eleuther ▷ #general (8 messages🔥):

Kuramoto oscillatory networks, Google's Ironwood TPU, Agent Development Kit


Eleuther ▷ #research (21 messages🔥):

Mollifiers in ML Research, Label Smoothing, Reasoning on/off Models, Blurring Issues, Transformer Architectures


Eleuther ▷ #interpretability-general (5 messages):

Influence Functions, Marketing vs Technical teams


LlamaIndex ▷ #general (21 messages🔥):

ChatGPT web app speed vs. RAG search speed, AgentWorkflow linearity, Agents as tools, Job availability


tinygrad (George Hotz) ▷ #general (4 messages):

Nvidia nvdec, Mesa branch, Video decode


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

MultiLazyBuffer error, Llama unexpected output, _transfer function, BufferCopy fallback


DSPy ▷ #general (5 messages):

Codebase Context, Caching Subsystem


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Course Deadlines, Course Website


Cohere ▷ #「💬」general (1 messages):

kaithwas.abhijeet: Has anyone fine-tuned Aya vision 8B parameter model using LoRA or QLoRA?


Codeium (Windsurf) ▷ #announcements (1 messages):

Grok-3, Windsurf, Pricing, Credits




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}