Frozen AI News archive

Reasoning Price War 2: Mistral Magistral + o3''s 80% price cut + o3-pro

**OpenAI** announced an **80% price cut** for its **o3** model, making it competitively priced with **GPT-4.1** and rivaling **Anthropic's Claude 4 Sonnet** and **Google's Gemini 2.5 Pro**. Alongside, **o3-pro** was released as a more powerful and reliable variant, though early benchmarks showed mixed performance relative to cost. **Mistral AI** launched its **Magistral** reasoning models, including an open-source **24B parameter** version optimized for efficient deployment on consumer GPUs. The price reduction and new model releases signal intensified competition in reasoning-focused large language models, with notable improvements in token efficiency and cost-effectiveness.

Canonical issue URL

Reasoning too cheap to meter.

AI News for 6/9/2025-6/10/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 9374 messages) for you. Estimated reading time saved (at 200wpm): 715 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Every 3-4 months we get a big leg down in the cost of the frontier LLM (March 2024, Aug 2024, Jan 2025), and today we got confirmation of the 80% price cut for o3, making it nominally the same cost as GPT 4.1, the non-reasoning model. (you can be forgiven for suspecting the price cut is due to distillation, but this is categorically denied). Of course, the real cost come in the reasoning token efficiency, and fortunately o3 is notably better than Gemini and Deepseek in that department:

Alongside of this o3 price cut, o3 pro was released, which if the o1/o1-pro relationship holds is more or less 10 o3's in a trenchcoat (and is priced that way).

This news is released conveniently on the same day as Mistral's Magistral reasoning model - a 24B open source version and a Medium closed version - that would've otherwise taken today's headline. We're REALLY glad though that Mistral is continuing to release good open source models but unfortunately the o3 price cut is more likely to be the relevant story for the majority of AI engineers today.


AI Twitter Recap

Large Language Models (LLMs) & AI Model Releases

AI Infrastructure & Tools

AI Applications & Use Cases

AI Industry & Market Dynamics

AI Research & Philosophy

Humor, Memes & General Observations


AI Reddit Recap

/r/LocalLlama Recap

1. Mistral Magistral Reasoning Model Releases and Discussion

2. Qwen3 0.6B Embedding Model Semantic Search Demos

3. Cutting-Edge AI Architectures: Apple Parallel-Track MoE and Meta Superintelligence Initiatives

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI o3 and o3-pro: Price Cuts, Model Release & Community Reactions

2. ChatGPT Outage: User Experiences, Memes & Sub Reactions

3. Breakthroughs in Video Generation: Self-Forcing Model Discussions


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The AI Model Arms Race: New Releases and Fierce Competition

Theme 2: Powering AI: Innovations in Tooling, Frameworks, and Platforms

Theme 3: Engineering AI: Deep Dives into Model Mechanics and Optimization

Theme 4: Navigating the AI Frontier: User Experiences, Bugs, and Workarounds

Theme 5: AI in Action: Showcases, Use Cases, and Community Collaborations


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


Eleuther Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Yannick Kilcher Discord


Notebook LM Discord


GPU MODE Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Manus.im Discord Discord


LlamaIndex Discord


Cohere Discord


Torchtune Discord


DSPy Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


LLM Agents (Berkeley MOOC) Discord


Codeium (Windsurf) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Unauthorized Promo Codes, Fair Pricing, Legitimate Promotional Deals


Perplexity AI ▷ #general (1170 messages🔥🔥🔥):

Family Guy character sexuality, O3 pricing and performance, Gemini vs Other models, Perplexity AI New Features & Issues


Perplexity AI ▷ #sharing (2 messages):

``


Perplexity AI ▷ #pplx-api (7 messages):

PPLX API Config Request, Social Media API integration, PPLX Finance Search Mode


LMArena ▷ #general (1130 messages🔥🔥🔥):

User preference vs other metrics, o3 price and performance, Kingfall: a better model


OpenAI ▷ #annnouncements (2 messages):

OpenAI o3-pro, ChatGPT Pro, API access


OpenAI ▷ #ai-discussions (539 messages🔥🔥🔥):

GPT-4 as co-author, ethical and truth alignment in advanced LLM systems, OpenAI Bugs, Claude Pro vs OpenAI, Gemini 2.5


OpenAI ▷ #gpt-4-discussions (29 messages🔥):

Reasoning Models Looping, Mom-GPT Anger Issues, Custom GPT Diversity, Opening Custom GPT Files, Chat File Upload Limits


OpenAI ▷ #prompt-engineering (16 messages🔥):

Model Iteration, API Image Prompting, Hallucinated Translation, AI Server Issues, Image generation difficulties


OpenAI ▷ #api-discussions (16 messages🔥):

Iterative model usage, Image prompting in o3, ChatGPT hallucination issue, AI server slowness


OpenRouter (Alex Atallah) ▷ #announcements (4 messages):

Magistral, Mistral's Reasoning Model, OpenRouter New Models, Model Pages


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Jamflow, Discord testers


OpenRouter (Alex Atallah) ▷ #general (523 messages🔥🔥🔥):

Crypto Payment Options, OpenAI o3 Price Cut, Model Degradation Concerns, OpenRouter and BYOK for o3, LLM choice for research purposes


Cursor Community ▷ #general (436 messages🔥🔥🔥):

Local Models with Cursor, Student Pro Access, Cursor Rules, Agent Mode Hangs, Eslint Issues


Cursor Community ▷ #background-agents (40 messages🔥):

Docker errors with background agents, MCP calls with background agents, Privacy mode on Cursor, Git errors in background agents, Background agent quotas


Eleuther ▷ #general (402 messages🔥🔥):

Userbots, GPTs agents, OpenAI's sidebars, Slop-posting, O3 pro


Eleuther ▷ #research (40 messages🔥):

Google/DM's GaTO paper follow up, Mixed LM head/regression in transformers, SOTA SVG transformer, binary representation of the coordinates as target, fully deduping internet scraped data


Eleuther ▷ #interpretability-general (2 messages):

Coaching Layer, Reasoning Training


Unsloth AI (Daniel Han) ▷ #general (174 messages🔥🔥):

Gemma 3 fine-tuning issues, Unsloth and multi-GPU support, Mistral's new Magistral models, GRPO vs DAPO, DeepSeek Qwen3 Tool Calling Accuracy Increased


Unsloth AI (Daniel Han) ▷ #off-topic (61 messages🔥🔥):

Triton Resources, GRPO runs and reward functions, Orpheus TTS model, Hyperbolic for finetuning, NoisySpeechDetection audio classifier


Unsloth AI (Daniel Han) ▷ #help (145 messages🔥🔥):

Unsloth 2.0 Release, Training AI on Discord Messages, QLoRA Finetuning with Unsloth, Whisper Lora Implementation and Issues, GGUF Model Size Differences


Unsloth AI (Daniel Han) ▷ #research (19 messages🔥):

Vision Language Models Datasets, Reasoning Models Reliability, KV-Cache Pruning, Disaggregated Prefilling and NTP, AIME 2025


HuggingFace ▷ #general (129 messages🔥🔥):

LLMs for HTML/CSS, Entity Recognition for IDs, Lightweight LLMs, ÆNTHESISAI cognitive architecture, Deepseek censored?


HuggingFace ▷ #cool-finds (2 messages):

Reasoning Models, LLM Reliability, Prompt Engineering


HuggingFace ▷ #i-made-this (142 messages🔥🔥):

Truth Engine, Quantum-Resistant Truth Persistence, KVMM: Timm for Keras 3, LLM Agent Framework


HuggingFace ▷ #computer-vision (3 messages):

Bias Datasets, Invoice Extractor, KVMM library, Keras 3


HuggingFace ▷ #NLP (2 messages):

Invoice Extractor, Build your own, Guidance needed, OCR, LLMs


HuggingFace ▷ #agents-course (55 messages🔥🔥):

Langgraph vs Smolagents, E2B in Unit 2.1, Azure OpenAI Model, Dynamic Python Code Generation, Course Completion Deadline


LM Studio ▷ #general (53 messages🔥):

LM Studio Developer Mode on Linux, LM Studio and TTS, LM Studio Image Generation, LM Studio Settings not Saving, LM Studio API Swagger


LM Studio ▷ #hardware-discussion (127 messages🔥🔥):

DGX Spark limitations, Memory bandwidth bottlenecks, Distributed computing for models in homelab, ROCm/HIP PyTorch on Windows, Speculative decoding on different GPUs


aider (Paul Gauthier) ▷ #general (148 messages🔥🔥):

Gemini 2.5 Pro vs Claude Opus, DeepSeek R1 speed, Aider uninstall, OpenAI's O3 Pricing, Kingfall


aider (Paul Gauthier) ▷ #questions-and-tips (16 messages🔥):

aider MCP server, Cloning a large repo, Gemini-2.5-03-25 and Rust, Ollama model unloading, fireworks' deepseek-r1-0528


aider (Paul Gauthier) ▷ #links (1 messages):

agentic embedded coding workflow, PlatformIO, Cline, FREE DeepSeek OpenRouter API, microcontrollers


Nous Research AI ▷ #general (133 messages🔥🔥):

Magistral Benchmarking, GRPO Modifications, Claude's Dynamic Token Limit, Control Tokens, ProRL Effects on Larger Models


Nous Research AI ▷ #research-papers (4 messages):

KV Compression, GRPO for TTS LLMs


Nous Research AI ▷ #interesting-links (8 messages🔥):

AI Heart Monitoring, Frutiger Aero, Biological Computers


Nous Research AI ▷ #research-papers (4 messages):

KV Compression, GRPO for TTS LLMs


Yannick Kilcher ▷ #general (53 messages🔥):

Diffusion models, Hardware failure prediction, Reservoir Computing, Tolman Eichenbaum Machine


Yannick Kilcher ▷ #paper-discussion (13 messages🔥):

Variational Bayesian approach, World modeling and decision making, Introduction to complex subject, BioML people in berlin


Yannick Kilcher ▷ #ml-news (29 messages🔥):

Mistral AI, Magistral, Open Source, GPT-4


Notebook LM ▷ #use-cases (16 messages🔥):

NotebookLM podcast intro, Google Chat integration, Drive file access errors, Video feature release date, Control over Google Workspace document access


Notebook LM ▷ #general (57 messages🔥🔥):

Time tracking apps, Iceland workshop feedback, Geographic access issues, Audio overview issues, Sharing notebooks issues


GPU MODE ▷ #general (4 messages):

deepwiki, GLSL, Vulkano, GPU grouping, clustering algorithm


GPU MODE ▷ #triton (6 messages):

FP16 support, Triton.Config num_warps control, Triton shared memory limits, LeetGPU challenges with Triton precision issues, Triton ROCm libdevice.round error


GPU MODE ▷ #cuda (3 messages):

CUPTI, Performance Counters, nvbench


GPU MODE ▷ #torch (11 messages🔥):

functorch, FSDP2, torch.compile with custom operators


GPU MODE ▷ #jobs (1 messages):

NeoSpace, GB200, CUDA, Brazil


GPU MODE ▷ #irl-meetup (1 messages):

ossmar: Does someone here is attending to the ACM PODC 2025?


GPU MODE ▷ #rocm (9 messages🔥):

SQTT traces, Radeon GPU Analyzer (RGA), rocprofv2, CUDA graphs, Memory access fault


GPU MODE ▷ #liger-kernel (1 messages):

Liger Collective Library, ByteDance Triton-distributed


GPU MODE ▷ #self-promotion (3 messages):

Mojo Programmers on BlueSky, Modular raises funding


GPU MODE ▷ #🍿 (2 messages):

Dataset Generation, Diverse Datasets, Augmented Datasets


GPU MODE ▷ #reasoning-gym (1 messages):

RL, Reasoning Training, Magistral Paper


GPU MODE ▷ #general (10 messages🔥):

Hackathons, Benchmarking, CUDA events


GPU MODE ▷ #submissions (2 messages):

Chinese problem-solving approach, New Bilibili article


GPU MODE ▷ #factorio-learning-env (1 messages):

Roadmap


GPU MODE ▷ #cutlass (2 messages):

CuTE docs, Cutlass, Triton


GPU MODE ▷ #mojo (1 messages):

Modular + AMD, Python Interop


Latent Space ▷ #ai-general-chat (56 messages🔥🔥):

Fireworks AI RFT Beta, OpenAI o3 Pricing, Mistral's Magistral Model, Meta's Potential Scale AI Stake, DeepSeek Model Narrative


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Modular Livestream, Compute Portability


Modular (Mojo 🔥) ▷ #mojo (52 messages🔥):

Mojo parameterization limits, Zig vs Mojo, Python Syntax similar to Go, Mojo-MAX Platform relationship, Double copy explaination


MCP (Glama) ▷ #general (41 messages🔥):

MCP Server Selection, Building n8n for MCP, FastMcp Dependencies, Mature MCP SDK, MCP file downloads


MCP (Glama) ▷ #showcase (10 messages🔥):

Glama build system details, MCP OpenMemory demo, OAuth 2.1 module for MCP servers, MCP servers for *arrs, mcp-openverse npm package


Manus.im Discord ▷ #general (50 messages🔥):

Manus pricing, Veo 3, EDU email accounts, Growth person Mixedbread, AI Search infra


LlamaIndex ▷ #blog (5 messages):

Custom Multi-Turn Memory, Real-time Website Summaries, LlamaIndex Agent as MCP Server, Databricks Data + AI Summit, Knowledge Agents to Automate Workflows


LlamaIndex ▷ #general (14 messages🔥):

Agent Workflow, Handoff Issues, DirectOutputAgent, Multi-Agent Systems, OpenAI Agents SDK


LlamaIndex ▷ #ai-discussion (6 messages):

Open Source Deep Research, Long Context Generation, Local Machine Research, spy-search Github repo


Cohere ▷ #🧵-general-thread (15 messages🔥):

Cohere support channels, Cohere Open Science Community


Cohere ▷ #📣-announcements (2 messages):

Cohere North, GameWarden Integration, EnsembleHP Partnership


Cohere ▷ #🔌-api-discussions (4 messages):

Open Source Repo for Contributions, API Tier Discussion, Reranking API Latency


Cohere ▷ #👋-introduce-yourself (3 messages):

Vitalops, Datatune, Open Source Tools, Data Transformations, Natural Language Data Transformation


Cohere ▷ #🔔-ping-settings (1 messages):

competent: Moved to id:customize


Torchtune ▷ #dev (15 messages🔥):

HuggingFaceModelTokenizer Usage, Muon Performance in torchtune, Tokenizer truncation bugs, Kimi Moonlight paper, Qwen2


DSPy ▷ #general (8 messages🔥):

Transfer learning, DSPy documentation, DSPy 3 announcement, Context optimization in DSPy, Dataset building and export tools


tinygrad (George Hotz) ▷ #general (8 messages🔥):

Failing Tests, Bounty Locked Meaning, NCHWCPUGraph / LLVMGraph Refactor


Nomic.ai (GPT4All) ▷ #general (5 messages):

Nomic Embed Text v1.5, Nomic GPT4All future versions, Python SDK update, GPT4All support for Mistral's Magistral Small


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (2 messages):

Leaderboard Updates, GPU Resources


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

Agent Marketplace Status


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Agentic AI Summit, Early Bird Tickets, UC Berkeley, Speaker announcements


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

SP25 Course, Quiz Questions


Codeium (Windsurf) ▷ #announcements (1 messages):

Planning Mode, Windsurf Wave 10, o3 model pricing