Frozen AI News archive

Chinese Models Launch - MiniMax-M1, Hailuo 2 "Kangaroo", Moonshot Kimi-Dev-72B

**MiniMax AI** launched **MiniMax-M1**, a 456 billion parameter open weights LLM with a 1 million token input and 80k token output using efficient "lightning attention" and a GRPO variant called CISPO. **MiniMax AI** also announced **Hailuo 02 (0616)**, a video model similar to **ByteDance's Seedance**. **Moonshot AI** released **Kimi-Dev-72B**, a coding model outperforming **DeepSeek R1** on SWEBench Verified. Discussions on multi-agent system design from **Anthropic** and **LangChain** highlighted improvements in task completion and challenges like prompt injection attacks, as demonstrated by **Karpathy** and **Columbia University** research. **Sakana AI** introduced **ALE-Agent**, a coding agent that ranked 21st in the AtCoder Heuristic Competition solving NP-hard optimization problems. There is unverified news about an acquisition involving **OpenAI**, **Microsoft**, and **Windsurf**.

Canonical issue URL

We're not sure if open models are all you need but hey they're still shipping

AI News for 6/13/2025-6/16/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 13085 messages) for you. Estimated reading time saved (at 200wpm): 1106 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Behind DeepSeek and Qwen there's a second tier of Chinese Labs that are doing respectable model training, and for reasons unknown both Minimax and Moonshot AI chose today/this weekend to launch their new models:

Yay for Open Models enjoyers :)

There is VERY late breaking news re: OpenAI vs Microsoft vs Windsurf acquisition, but it's too unverified/not technical so we did not make it title story but if confirmed it probably would be.


AI Twitter Recap

Agent & System Development, Architecture & Security

Model Releases, Performance & Capabilities

Developer Tools, Infrastructure & Frameworks

AI Research, Techniques & Evaluation

Industry News, Startups & Global Context

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

1. Recent Open-Source LLM Releases and Quantizations (Qwen3 & MiniMax-M1)

2. Educational Content: DeepSeek Architecture and Tutorials

3. AI Wrapper Startup Viability & New LLM Name Drop (Kimi-Dev-72B)

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AI Video Model Releases and Benchmarks

2. ChatGPT Social and Personalization Experiences

3. AI Adoption, Policy, and Cheating Scandals


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The AI Model Arms Race: New Releases and Comparative Prowess

Theme 2: Agentic AI Ascendant: Swarms, Protocols, and Complex Task Solving

Theme 3: Under the Hood: Fine-Tuning, Optimization, and Hardware Hurdles

Theme 4: Open Source vs. Closed Gardens: Models, Data, and Decentralization Debates

Theme 5: Developer Experience & Platform Pitfalls: Bugs, Billing, and Usability Battles


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


GPU MODE Discord


Manus.im Discord Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


Latent Space Discord


Eleuther Discord


Notebook LM Discord


Torchtune Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Cohere Discord


LlamaIndex Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Perplexity Research Improvements, Finance Pages Key Issues, Tasks Automated Search, Discover Page Update, Finance Futures Graphs


Perplexity AI ▷ #general (1113 messages🔥🔥🔥):

Gemini 2.5 Pro, Claude Opus 4, o3 Pro, MiniMax M1, Genspark


Perplexity AI ▷ #sharing (8 messages🔥):

Shareable Threads, Nvidia GB200, Driver Stability, Emergence, Android App Security


Perplexity AI ▷ #pplx-api (7 messages):

API credit charges, Perplexity Linux CLI client, AI startup resources


LMArena ▷ #general (963 messages🔥🔥🔥):

Kingfall vs Blacktooth, Grok 3.5 release, Gemini 2.5 Pro, Minimax M1 open source, LLM privacy


LMArena ▷ #announcements (1 messages):

Models Erroring Out, Models not responding, Model API Issues


OpenAI ▷ #annnouncements (1 messages):

ChatGPT Image Generation, WhatsApp Integration


OpenAI ▷ #ai-discussions (957 messages🔥🔥🔥):

AI vs Human, GPT Plus Worth It, Sora video generation, Veo 3 video generation, GPT Model Performance


OpenAI ▷ #gpt-4-discussions (86 messages🔥🔥):

GPT-4o's Memory, Fine-tuning GPT Models, Custom GPT Model Selection, DALL-E 3 Removal, Canvas auto updating


OpenAI ▷ #prompt-engineering (135 messages🔥🔥):

Pandoc, HTML parsing, Sora AI prompting, O3 model prompting, GPT coherence


OpenAI ▷ #api-discussions (135 messages🔥🔥):

Pandoc vs awk for parsing, HTML noisy tokens, Long form responses from O3, Sora AI prompts for image generation, GPT coherence loss


Unsloth AI (Daniel Han) ▷ #general (575 messages🔥🔥🔥):

GPU detection issues with Unsloth, Unsloth-DeepSeek-R1-0528-UD-IQ2_M benchmark results, Hugging Face model downloading issue, Unsloth fine-tuning notebooks, AMD compatibility with Unsloth


Unsloth AI (Daniel Han) ▷ #off-topic (13 messages🔥):

KL Divergence Spikes, Google Colab GPU Pricing, TempleOS, Hugging Face Outage


Unsloth AI (Daniel Han) ▷ #help (266 messages🔥🔥):

Qwen2.5 vs Qwen3, GGUF conversion, DPO vs SFT, Gemma3, Llama 3.2


Unsloth AI (Daniel Han) ▷ #research (2 messages):

arxiv link


Cursor Community ▷ #general (750 messages🔥🔥🔥):

Claude 4 Sonnet Performance, Cursor UI Issues, MCP Usage, Code Privacy, Bug Reporting


Cursor Community ▷ #background-agents (40 messages🔥):

GitHub integration, Background Agents Permissions, Background agents and Slack, Background Agents and Privacy Mode, Toggle bug with background agents


HuggingFace ▷ #general (553 messages🔥🔥🔥):

AI-Generated Feedback, Open Sourcing AGI, Bigram Testing, Qwen 2.5, HF Pro Disk Space


HuggingFace ▷ #today-im-learning (2 messages):

HF audio course, Agents course, MCP course


HuggingFace ▷ #cool-finds (1 messages):

cakiki: <@844851718512443423> No referrals please


HuggingFace ▷ #i-made-this (9 messages🔥):

peft-bench, InfiniGPT French Q&A dataset, Shisa AI Japanese model, Swiftide Rust library for agentic RAG applications, QuantIntelli Football Betting Analysis


HuggingFace ▷ #reading-group (2 messages):

Portfolio Theory, Dr. Peter Cotton, Schur Portfolios


HuggingFace ▷ #smol-course (3 messages):

Smolagents, Ollama, Code Agents, Local Model Selection


HuggingFace ▷ #agents-course (21 messages🔥):

HF Inference Costs, Local LLMs with Ollama, Unit 3 Assignments, Agentic RAG Locally, Unauthorized Imports


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

Chess Leaderboard, Book Testing


OpenRouter (Alex Atallah) ▷ #general (533 messages🔥🔥🔥):

OpenRouter Discord Tag Request, Claude Prompt Debugging, GPT-4.1 Mini Offering, Free Model Credit Usage, Multilingual Model Recommendations


LM Studio ▷ #general (247 messages🔥🔥):

TokenBreak Attack, AMD Ryzen AI Mini PC, Increasing RAG size in LM Studio, MiMo VL 7B RL UD support, LLM Image Organizer


LM Studio ▷ #hardware-discussion (95 messages🔥🔥):

GMKtec Windows install issues, RTX 6000 Pro wattage configuration, Graphics cards and coil whine, NVLINK performance experiments, GPU Recommendations for LLMs


GPU MODE ▷ #general (6 messages):

PD disaggregation, Transformer moment for agents, Groq speed, Groq Huggingface


GPU MODE ▷ #triton (9 messages🔥):

tl.constexpr behavior with expressions, Thread-level control flow in Triton, Triton kernel warmup time vs torch.compile, Single-row softmax kernel implementation


GPU MODE ▷ #cuda (55 messages🔥🔥):

CUDA cache policies, TF32 vs FP16 precision, L40 vs 4090 performance, nvcc generating LDS instruction, GCC and NVCC version compatibility


GPU MODE ▷ #torch (2 messages):

CUDA kernel blocksize args, TorchTitan training graph capture


GPU MODE ▷ #announcements (2 messages):

d-Matrix team, Dr. Lisa Su, GPU MODE, kernel data, Project Popcorn


GPU MODE ▷ #beginner (1 messages):

Instruction latencies, arxiv.org


GPU MODE ▷ #torchao (5 messages):

MX-FP4 Matmul, MX-FP8 Matmul, CUTLASS, CuBLAS, FP4 Weight Quality


GPU MODE ▷ #off-topic (1 messages):

HQQ Rebrand, Quantum Quantization


GPU MODE ▷ #rocm (19 messages🔥):

Nvidia to AMD transpilation, AMD stable inference deployment, MI300A architecture, IODs and infinity cache, memory distribution


GPU MODE ▷ #self-promotion (4 messages):

Thrust library, CUDA kernels, segmented sum algorithm, iterators, high_resolution_clock vs steady_clock


GPU MODE ▷ #🍿 (6 messages):

Tensor Core Algorithm Reformulation, RL for Tensor Core Usage, Kernel Code Verification, GPU Thinking Interpretability


GPU MODE ▷ #thunderkittens (4 messages):

ThunderKitten on Older GPUs, TK to AMD port, Attention Kernels with Variable Length


GPU MODE ▷ #reasoning-gym (1 messages):

Chain of Thought, CoT, Symbolic Reasoning, Math Reasoning


GPU MODE ▷ #submissions (16 messages🔥):

MI300 AMD-FP8-MM, Conv2D on H100, VectorAdd Leaderboard Updates


GPU MODE ▷ #factorio-learning-env (162 messages🔥🔥):

Factorio RL, Factorio Agents, Hierarchical RL and LLMs, FLE API


GPU MODE ▷ #amd-competition (1 messages):

AMD GPU, Image Analysis


GPU MODE ▷ #cutlass (1 messages):

BLISRetreat2023, UTexas presentation


Manus.im Discord ▷ #general (196 messages🔥🔥):

Manus credits, Manus speed, Minimax copy of Manus, Manus AI updates, Manus Agent mode


Nous Research AI ▷ #general (112 messages🔥🔥):

Decentralized Pre-training, Hermes 4 Training, Bandwidth Differentials, AI Evals company, Multilingual reasoning in AI


Nous Research AI ▷ #ask-about-llms (8 messages🔥):

Gemini 2.5 Pro, Chain of Thought (CoT) prompting, Reasoning Techniques, API key setup, Hyperbolic integration


Nous Research AI ▷ #research-papers (19 messages🔥):

Bitter Lesson, Generalist vs SME, Grounding in reality, Gene edits to cure cancer


Nous Research AI ▷ #interesting-links (8 messages🔥):

WebSummit talk on closed internet/AI, Robotic Skin, Deep Residual Learning


Nous Research AI ▷ #research-papers (19 messages🔥):

Bitter Lesson, Generalist vs SMEs, Nature Article, Arxiv Paper, Observable Reality


aider (Paul Gauthier) ▷ #general (135 messages🔥🔥):

VS Code forks, TUI, RA-Aid, Context Window Management, LLM Personas


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

LLM-OpenAPI-minifier integration with aider, Setting API keys within Aider, Aider's agentic capabilities, Loading active parameters in VRAM for MoE models like Qwen3


Latent Space ▷ #ai-general-chat (136 messages🔥🔥):

Claude Swarm for team management, Proactive AI agents definition, Anthropic's multi-agent system, LLM as judge for evaluations, Cursor for writers AI tool alternatives


Eleuther ▷ #general (75 messages🔥🔥):

Landmark Papers in new field, LLMs as narrative simulators, AI-generated papers with mathematical errors, pytorch dataloader workers, EleutherAI community vs research focus


Eleuther ▷ #research (50 messages🔥):

DMCA and Copyright Law, Emergent behavior in independent tasks, Llama-3.2-1B-Instruct ARC-AGI, Qwen3 tokenizer and image understanding


Eleuther ▷ #interpretability-general (1 messages):

LLM Fairness, Interpretability Interventions, Unfaithful Chain of Thought


Eleuther ▷ #lm-thunderdome (5 messages):

Benchmark Evaluation Algorithm, Inspect Standard Format, Eval Coalition Effort


Eleuther ▷ #gpt-neox-dev (1 messages):

Vitabyte Founder, GroK-scale Training, Multi-node LLM fine-tuning, ROCm + CUDA, Full stack Ops


Notebook LM ▷ #use-cases (19 messages🔥):

Notebook LM Plus Access, PM Interview Conversational AI Platform, Exam Prep with NotebookLM, Chrome Extension for NotebookLM, Podcast Personality Shaping


Notebook LM ▷ #general (86 messages🔥🔥):

LaTeX in NLM, Image Uploading, Android App of NotebookLM, Podcast issues, Mindmaps on iPad


Torchtune ▷ #dev (69 messages🔥🔥):

DTensor cross-mesh operation, Llama4 maverick finetuning, Iterable packing, Fused optimizer, Flex attention


Torchtune ▷ #papers (15 messages🔥):

Mistral Small, Magistral model, ZO optimizer, Flex integration


Modular (Mojo 🔥) ▷ #general (46 messages🔥):

RDNA4 support, AVX512_BF16, Zen 4, Mojo testing structure, 1-bit model support


Modular (Mojo 🔥) ▷ #mojo (32 messages🔥):

CUDA Stream Synchronization, Mojo C ABI, Mojo Zed Extension, Mojo 'let' deprecation, Mojo AOT compilation


MCP (Glama) ▷ #general (40 messages🔥):

MCP in Agentic Frameworks, A2A Agent Discovery, FastMCP and Server Composition, GitHub APIs in MCP Server, Orchestrator Agent Recommendations


MCP (Glama) ▷ #showcase (7 messages):

SchemaPin for Rug Pulls, Glama MCP servers support streamable HTTP, excel-mcp-server, User analytics and live debugging for MCPs


Cohere ▷ #🧵-general-thread (20 messages🔥):

Cohere documentation typo, Team collaboration with LLMs, AI/backend developer introduction, Cohere's work with the government, Secure ML and privacy preservation


Cohere ▷ #🔌-api-discussions (5 messages):

direct-injected-document tool, Cohere a032025 memory usage


Cohere ▷ #👋-introduce-yourself (6 messages):

AI developers introductions, Custom bots, Automations, Scalable systems, Secure machine learning


LlamaIndex ▷ #blog (3 messages):

Data + AI Summit 2025, Agentic Document Workflows, Multi-Agent System, AI Travel Agents, AI Agents in Production


LlamaIndex ▷ #general (26 messages🔥):

LandingAI vision agent vs LlamaIndex, Synk hiring, Faiss Index, LlamaCloud contact sales page, LlamaExtract parsing errors


DSPy ▷ #show-and-tell (1 messages):

DSPy Optimization Patterns


DSPy ▷ #general (19 messages🔥):

DSPy runners, TextGrad Optimizer, Custom LM Concurrency, DAIS Session Write-Up, BootstrapFewShot Optimizer


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (8 messages🔥):

Certificates, Assignment Selection, MOOC Quiz Archive


MLOps @Chipro ▷ #events (1 messages):

ControlThrive, Outerbounds, ML Consulting


Codeium (Windsurf) ▷ #announcements (1 messages):

Claude Sonnet 4 API Access, Anthropic models, API Pricing