Frozen AI News archive

MiniMax M2 230BA10B — 8% of Claude Sonnet's price, ~2x faster, new SOTA open model

**MiniMax M2**, an open-weight sparse MoE model by **Hailuo AI**, launches with **≈200–230B parameters** and **10B active parameters**, offering strong performance near frontier closed models and ranking #5 overall on the Artificial Analysis Intelligence Index v3.0. It supports coding and agent tasks, is licensed under **MIT**, and is available via API at competitive pricing. The architecture uses **full attention**, **QK-Norm**, **GQA**, partial RoPE, and sigmoid routing, with day-0 support in **vLLM** and deployment on platforms like Hugging Face and Baseten. Despite verbosity and no tech report, it marks a significant win for open models.

Canonical issue URL

A nice win for open models.

AI News for 10/24/2025-10/27/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (198 channels, and 14738 messages) for you. Estimated reading time saved (at 200wpm): 1120 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

4 months after MiniMax M1, Hailuo AI is back with MiniMax M2 (free chatbot, weights, github, docs) with some impressive, but measured claims: a very high 23x sparsity (Qwen-Next still beats it) and SOTA-for-Open-Source performance:

Bar graph showing the Artificial Analysis Intelligence Index v3.0 with various AI models and their performance scores, with MiniMax M2

There are some hairs - it is a very verbose model and there was no tech report this time, but overall this is a very impressive model launch that comes clsoe to the frontier closed models under a very comprehensive set of benchmarks.

Bar graph showing performance benchmarks of various AI models across different tasks, with MiniMax M2 highlighted in red and compared against other models like


AI Twitter Recap

MiniMax M2 open-weights release: sparse MoE for coding/agents, strong evals, and architecture clarifications

Post-training and reasoning: on-policy distillation momentum, long-horizon stress-tests, and agent frameworks

Architectures and attention design: shifting away from linear attention, MoE insights, and context compression

Infra and performance: collectives at 100k+ GPUs, FP8 that actually wins end-to-end, and real-world hardware notes

Frameworks, libraries, and courses

Safety, enterprise, and benchmarking

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Open-Source Model Adoption in Silicon Valley

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI Model and Workflow Innovations

2. AI Citation Milestones

3. Claude Code Usage and Fixes


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. New Models & Frameworks Shake Up the Scene

Theme 2. The Model Performance & Behavior Report

Theme 3. Developer Experience Plagued by Bugs, Costs, and Security Flaws

Theme 4. Low-Level Optimization and GPU Wizardry

Theme 5. The Evolving AI Ecosystem & Industry Standards


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenRouter Discord


HuggingFace Discord


Yannick Kilcher Discord


GPU MODE Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


Nous Research AI Discord


Moonshot AI (Kimi K-2) Discord


Eleuther Discord


aider (Paul Gauthier) Discord


MCP Contributors (Official) Discord


DSPy Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


Windsurf Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1101 messages🔥🔥🔥):

Referral Reward System Changes, Comet Browser Issues, GPT-5-mini is underrated, Google Cooking, AI Models


Perplexity AI ▷ #sharing (4 messages):

Code generation for YouTube, Predicting Outcomes, Image generation, Pitch workspace


Perplexity AI ▷ #pplx-api (5 messages):

Comet API, Sora AI code


LMArena ▷ #general (1239 messages🔥🔥🔥):

AI image generation, AI ethics, AI video generation, Gemini 3, Sora vs Veo


LMArena ▷ #announcements (1 messages):

LMArena, minimax-m2-preview, X.com


Cursor Community ▷ #general (1046 messages🔥🔥🔥):

Token Consumption, New Pricing, Claude Code Limits, Cursor Unstable Build, Cheetah Model


Cursor Community ▷ #background-agents (3 messages):

Background Agents, Tracking Background Agent Progress, Background Agent Creation Errors


OpenAI ▷ #annnouncements (2 messages):

GPT-5, Mental Health Experts, ChatGPT, Sensitive Moments


OpenAI ▷ #ai-discussions (737 messages🔥🔥🔥):

AGI Safety, AI Usage, AI Ethical Implications, Sora 2, Atlas Browser Privacy


OpenAI ▷ #gpt-4-discussions (66 messages🔥🔥):

Microsoft Copilot Breakdown, Builder Profile Verification, Custom GPT Avatar Issues, ChatGPT Quality Drop, Adult-Mode Announcement


OpenAI ▷ #prompt-engineering (76 messages🔥🔥):

Animating PNGs with AI, Prompt Injection, OpenAI Model Spec, Temporal Optimal Video Generation, Prompt Engineering for Code Generation


OpenAI ▷ #api-discussions (76 messages🔥🔥):

Animating PNGs with AI, Prompt Engineering Lessons, Temporal Optimal Video Generation, Exploiting Model Chain of Thought


Unsloth AI (Daniel Han) ▷ #general (376 messages🔥🔥):

Ollama CVE-2024-37032, Qwen3-Next model, Dynamic 2.0 Quantization, Vector artists looking for work, Qwen 2 VL 2B inference on MLX


Unsloth AI (Daniel Han) ▷ #introduce-yourself (5 messages):

AI agent expertise offered, AI trust and safety PhD student


Unsloth AI (Daniel Han) ▷ #off-topic (290 messages🔥🔥):

Andor is best SW content, NN to a biological brain, AI creativity, GPT answer, deepfabric


Unsloth AI (Daniel Han) ▷ #help (92 messages🔥🔥):

Llama obsession, Jais model, Hugging Face model usage, GGUF conversion issues, SageMaker Unsloth installation


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

NVIDIA Blackwell support, Unsloth Optimization Techniques


Unsloth AI (Daniel Han) ▷ #research (17 messages🔥):

GPT-5, Thinking Machines, LoRA, eNTK, La-LoRA


LM Studio ▷ #general (226 messages🔥🔥):

LM Studio crash, User Nicknames, Stellaris finetuning, Published Plugins, Chat logs RAG


LM Studio ▷ #hardware-discussion (380 messages🔥🔥):

vram, Flash attention, intel b60, 4090


OpenRouter ▷ #announcements (1 messages):

tool calling, audio inputs, API key limits, MiniMax M2


OpenRouter ▷ #app-showcase (6 messages):

Next.js Chat Demo with OAuth 2.0, or3.chat Document Editor Project, Shadcn UI Discussion, OpenRouter TypeScript SDK, localStorage plaintext API key security


OpenRouter ▷ #general (459 messages🔥🔥🔥):

Response API System Message, deepinfra/turbo for Meta-llama, OpenRouter Benchmarks, Claude Sonnet 4.5 API usage, Vertex AI API misrouting


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (42 messages🔥):

Minimax M2 Pricing, GPT-5.1 Mini Speculation, Model Naming Conventions, Meta's Llama 4 Reasoning, Discord Channel Degradation


HuggingFace ▷ #general (223 messages🔥🔥):

GPT Pro video, AI glyphs, Model encryption for clients, Licensing models, Infinite storage solution


HuggingFace ▷ #i-made-this (4 messages):

GAN+VAE+Diffusion hybrid modular architecture, Live PyTorch Memory Profiler, Intilium AI Compliance Layer


HuggingFace ▷ #computer-vision (3 messages):

projecting 1D feature vectors to 2D segmentation map, diffusion, VAEs and GANs


HuggingFace ▷ #NLP (1 messages):

syllable separation model, multiple languages


HuggingFace ▷ #gradio-announcements (1 messages):

Free Modal Credits, AI Agents and MCP, Online Hackathon


HuggingFace ▷ #smol-course (10 messages🔥):

Submitting models to leaderboard, Dataset issues in hf jobs, Lighteval and emoji errors


HuggingFace ▷ #agents-course (5 messages):

API Outage, Rate Limiting, 404 Errors


Yannick Kilcher ▷ #general (175 messages🔥🔥):

Elastic Weight Consolidation, Self-Hosted GPU Setups, Catastrophic Forgetting Solutions, ArXiv Paper Discovery Engines, Linear Projections


Yannick Kilcher ▷ #paper-discussion (40 messages🔥):

Neuronpedia Line Break Attribution Graphs, DeepMimic Porting for LAION, Strudel Music Programming for Audio Models, Undergrad Publication Project Ideas, DOI System Failover


Yannick Kilcher ▷ #agents (1 messages):

rogerngmd: Novel idea. Are u using McP


Yannick Kilcher ▷ #ml-news (6 messages):

Elon's Twitter data effects, Schmidhüber arxiv, odyssey.ml experience


GPU MODE ▷ #general (9 messages🔥):

Access to GPU nodes, Torchcomms/ncclx session, Speaker/lecture request, Learning CUDA, Cute's layout algebra


GPU MODE ▷ #triton (18 messages🔥):

Triton Matrix Multiplication Performance on T4 vs A100, Triton Input Pointer Casting in Kernels, Split-K GEMM Kernel in Triton


GPU MODE ▷ #cuda (43 messages🔥):

CUDA fork behavior, GPU bandwidth modeling, Vectorized data types performance, NCU Profiler for memory throughput, Signed vs. unsigned loop indices in CUDA


GPU MODE ▷ #torch (1 messages):

High Dimensional Tensors, Matrix of Matrices


GPU MODE ▷ #cool-links (1 messages):

KernelBench, GPU Kernel Generation, LLM Kernel Generation


GPU MODE ▷ #jobs (5 messages):

Small inference optimized models for code gen, Morph Internship, ML Project Deep Dives


GPU MODE ▷ #beginner (4 messages):

Budget Friendly Cloud GPU Providers, Vast.ai, RunPod.io, Lightning.ai, Compiling Applications to Run on a GPU


GPU MODE ▷ #pmpp-book (1 messages):

Cutlass documentation


GPU MODE ▷ #off-topic (2 messages):

GEMM, Meme Creation


GPU MODE ▷ #irl-meetup (2 messages):

LLVM dev meeting, SuperComputing in St Louis


GPU MODE ▷ #self-promotion (2 messages):

Penny beats NCCL, vLLMs custom allreduce, CuTeDSL reduction, Quack library, RMSNorm CUDA


GPU MODE ▷ #🍿 (5 messages):

GPU Mode Kernel Leaderboard, GitHub Kernels Dataset, Heterogeneous Computing Code on GitHub, Triton/CUDA Repos


GPU MODE ▷ #thunderkittens (1 messages):

Thundermla, sm120, async tma, async mma, tcgen05


GPU MODE ▷ #submissions (7 messages):

prefixsum_v2 leaderboard, vectorsum_v2 leaderboard, A100 results


GPU MODE ▷ #hardware (1 messages):

id_ab_ling: how to download fieldiag


GPU MODE ▷ #cutlass (14 messages🔥):

Availability of Presentation Slides, Representable Layouts in CuTe, Swizzles in CuTe


GPU MODE ▷ #mojo (11 messages🔥):

Pixi vs UV, CUDA version and non-Nvidia, Toolchain installation


GPU MODE ▷ #singularity-systems (8 messages🔥):

JAX vs PyTorch2 for pedagogy, Graph acquisition mechanisms, Dual language problem with Python/C++, Mojo and LLVM intrinsics


GPU MODE ▷ #general (1 messages):

achal: How do you get the benchmark results from the website?


GPU MODE ▷ #multi-gpu (3 messages):

NCCL Debugging, Megatron Optimizer, Distributed Optimizer


GPU MODE ▷ #irl-accel-hackathon (38 messages🔥):

Mini-PyTorch Project, Oulipo Flavor in Coding, GPU Memory Allocation, PyTorch Distributed Hacking, Monarch/Torchforge


GPU MODE ▷ #llmq (1 messages):

CPU offloading, NPU Framework


Modular (Mojo 🔥) ▷ #general (23 messages🔥):

Mojo Setup, MAX Support Contract, AMD Consumer vs Datacenter Cards, Apple Silicon Support, Windows Compatibility


Modular (Mojo 🔥) ▷ #mojo (110 messages🔥🔥):

GPU random module location, Property testing framework, LayoutTensor limitations, MLIR vs LLVM, Mojo's metaprogramming


Modular (Mojo 🔥) ▷ #max (2 messages):

MAX, Huggingface, Torchvision, torch_max_backend


Latent Space ▷ #ai-general-chat (99 messages🔥🔥):

Tahoe AI, ImpossibleBench, MiniMax M2, OpenAI Ads, OpenAI Sora Rate


Latent Space ▷ #genmedia-creative-ai (18 messages🔥):

OpenAI Real-Time Bidirectional Speech Translation, MiniMax M2, fal Generative Media Conference, Odyssey-2 Launch


Nous Research AI ▷ #general (71 messages🔥🔥):

API parameter removal, Reasoning models, Pretraining on 3090, AI and web dev jobs, ML/AI streamers


Nous Research AI ▷ #ask-about-llms (3 messages):

GPT Ideology, Model Meta-Awareness, Claude's Persona


Nous Research AI ▷ #research-papers (8 messages🔥):

KBLaM vs RAGs, AI training data quantity, Business RAG getting common, Microsoft Service Provider


Nous Research AI ▷ #interesting-links (6 messages):

Translation using Data, Temporal Optimal Video Generation, Grandma Optimality, Prompt engineering via rhyme


Nous Research AI ▷ #research-papers (8 messages🔥):

KBLaM vs RAGs, AI training data limitations, Business RAG adoption, Refusal instruction tuning


Moonshot AI (Kimi K-2) ▷ #general-chat (93 messages🔥🔥):

Kimi CLI on PyPI, GLM vs Kimi, Moonshot Coin, Kimi Coding Plan, Ultra Think feature


Eleuther ▷ #general (34 messages🔥):

Open Source AI, GPU Resources Contributions, AI Accelerator Chips, Petals Project, AI Evaluation and Ethics


Eleuther ▷ #research (35 messages🔥):

Searching input spaces for models, Feature Engineering, CSM-1B usage, Theoretical Computer Science papers, Product Key Search


Eleuther ▷ #interpretability-general (2 messages):

Anthropic's Research, Polysemanticity in Neural Networks


aider (Paul Gauthier) ▷ #general (40 messages🔥):

aider-ce Navigator Mode, MCPI PR adding RAG, GitHub Copilot Subscription Benefits, LoRA/QLoRA with Claude, Aider's working directory bug


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

Aider's Future, Aider-ce, Paul Gauthier, Next AI coding tool


aider (Paul Gauthier) ▷ #links (1 messages):

Aider-CE, Chrome-Devtools


MCP Contributors (Official) ▷ #general (7 messages):

MCP Registry, GitHub MCP Registry, Tool's Title Annotation


MCP Contributors (Official) ▷ #general-wg (36 messages🔥):

Global Notifications, Multiple SSE Streams, TypeScript SDK Bug, Server vs Session Confusion


DSPy ▷ #papers (1 messages):

lidar36: They just added the code


DSPy ▷ #general (31 messages🔥):

DSPy vs Langchain, GPT-4o upgrades, Claude code web feature, GEPA love, Streaming with REACT


tinygrad (George Hotz) ▷ #general (12 messages🔥):

Tiny Box hardware specs, FSDP implementation in tinygrad, TinyJIT optimization


tinygrad (George Hotz) ▷ #learn-tinygrad (12 messages🔥):

tinygrad PRs, tinygrad Bounties, TinyJit performance, Kernel Fusion bug


MLOps @Chipro ▷ #events (1 messages):

Data 3.0, AI-Ready Data, Nextdata OS, Autonomous Data Products, Multimodal Data Management