Frozen AI News archive

OpenAI rolls out GPT-5 and GPT-5 Thinking to >1B users worldwide; -mini and -nano help claim Pareto Frontier

**OpenAI** launched **GPT-5**, a unified system featuring a fast main model and a deeper thinking model with a real-time router, supporting up to **400K context length** and aggressive pricing that reclaims the Pareto Frontier of Intelligence. The rollout includes variants like **gpt-5-mini** and **gpt-5-nano** with significant cost reductions, and integrations with products such as **ChatGPT**, **Cursor AI**, **JetBrains AI Assistant**, **Microsoft Copilot**, **Notion AI**, and **Perplexity AI**. Benchmarks show GPT-5 performing strongly in coding and long-context reasoning, roughly matching **Claude 4.1 Sonnet/Opus** on SWE-bench Verified. The launch was accompanied by a GPT-5 prompting cookbook and notable community discussions on pricing and performance.

Canonical issue URL

GPT-5 is hopefully all you need.

AI News for 8/6/2025-8/7/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 16553 messages) for you. Estimated reading time saved (at 200wpm): 1183 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

While the livestream was somewhat disappointing (except for the highly entertaining chart crimes), and the benchmarks were incremental improvements over the SOTA offerings from OpenAI, the pricing wow'ed us, as OpenAI took back the Pareto Frontier of Intelligence from GDM:

With OpenAI now having at least a 4 Sonnet tier model, passing developer vibe checks, it is solidly "back" in the coding model game, although it remains to be seen what the long term impact will be.

We recommend looking through the hands-on early beta report and thinking through what was revealed from the model card description of GPT-5's architecture.

Here is GPT-5's launch, according to GPT-5:

OpenAI’s GPT‑5 Launch: unified router, aggressive pricing, broad rollout

Benchmarks, evals, and the “chart crimes”

Agentic coding reality check: strong tooling, fewer vibes


AI Twitter Recap

OpenAI’s GPT‑5 Launch: unified router, aggressive pricing, broad rollout

Benchmarks, evals, and the “chart crimes”

Agentic coding reality check: strong tooling, fewer vibes

OpenAI's GPT-5 Launch and Reception

Competing Models and The Broader Ecosystem

Developer Tooling, Frameworks, and Infrastructure

Broader Implications & Industry Commentary

Research and New Techniques

Humor and Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. GPT-OSS and OpenAI Model Hype and Brand Perception

2. Major Open-Source Model Release News and Comparisons

3. Llama.cpp Feature Updates and Support Announcements

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GPT-5 & OpenAI Livestream: Announcements, Demos, and Community Reaction

2. GPT-5 & Model Leaks, Variants, and Limit Access

3. AI Model Benchmarks, Comparisons & Next-gen Model Hype


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1. GPT-OSS Models Spark Hype and Headaches

Theme 2. Fresh Models Flex New Muscles

Theme 3. Quantization Quandaries and Hardware Hacks

Theme 4. Safety Shenanigans and Uncensoring Shenanigans

Theme 5. Benchmarks Battle for Supremacy


Discord: High level Discord summaries

LMArena Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenAI Discord


Cursor Community Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


GPU MODE Discord


Notebook LM Discord


Eleuther Discord


aider (Paul Gauthier) Discord


MCP (Glama) Discord


LlamaIndex Discord


DSPy Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Codeium (Windsurf) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1051 messages🔥🔥🔥):

IBM's Granite vs GPT-ASS, Claude Opus 4.1 Status, GPT Omen Hallucinations, GPT-5 Release Expectations, Gemini Pro 3 vs GPT-5 Reasoning


LMArena ▷ #announcements (2 messages):

Video Leaderboards, New Video Models


Unsloth AI (Daniel Han) ▷ #general (865 messages🔥🔥🔥):

GPT-OSS model reviews, Qwen3 Coder model comparison, 4-bit quantization issues, Reasoning models, Gemma3N Model quirks


Unsloth AI (Daniel Han) ▷ #off-topic (19 messages🔥):

n-cpu-moe parameter, Qwen Coder 30B hardware upgrade, GPT-OSS-20B issues, Discord bot censorship, MMVC


Unsloth AI (Daniel Han) ▷ #help (98 messages🔥🔥):

Qwen 3-30B GGUF, OpenAI dynamic quant 120B, Qwen2.5-VL on video question answering, GLM-4.5-Air GGUFs with tools on llama.cpp, Classification using a base model


Unsloth AI (Daniel Han) ▷ #showcase (13 messages🔥):

MoLA-LLM, Mixtral-8x7B-Instruct-v0.1, magpie-ultra-5k-11-tasks


Unsloth AI (Daniel Han) ▷ #research (6 messages):

Generating Kernel On-the-Fly, Flash-DMAttn, Research Paper Assistance, Quantization Paper


Unsloth AI (Daniel Han) ▷ #unsloth-bot (104 messages🔥🔥):

OpenAI OSS model issue, Model training callback, Model repetition issue, Saving script progress, Learning rate increase


LM Studio ▷ #general (710 messages🔥🔥🔥):

GPT-OSS, LM Studio UI issues, MCP Servers, GPU usage, Model Quantization


LM Studio ▷ #hardware-discussion (176 messages🔥🔥):

Dual 3090 setup, Arc Pro B50 system, Huanan/Machinist X99 mobos, GPT-OSS-20B performance, Mac Studio M3 Ultra for local LLMs


OpenAI ▷ #annnouncements (3 messages):

Red Teaming Challenge, Open Source Safety, Hugging Face, inference credits


OpenAI ▷ #ai-discussions (433 messages🔥🔥🔥):

GPT-OSS Launch, Horizon-Alpha Model Speculation, Custodian Core Proposal, Genie 3 and Veo comparison, GPT-5 Leaks


OpenAI ▷ #gpt-4-discussions (49 messages🔥):

ChatGPT Payment Model, Slang Usage, AI-generated Persona System, .edu Accounts, Forms Beta Version


OpenAI ▷ #prompt-engineering (79 messages🔥🔥):

Hallucination vs. Real Progress in GPT, Prompt Engineering vs. Session Engineering, Context Window Limits and Memory, External Databases for Context, Importance of verifying Facts with GPT


OpenAI ▷ #api-discussions (79 messages🔥🔥):

GPT subscription, Model hallucination, Prompt engineering, Background compute, Memory context


Cursor Community ▷ #general (328 messages🔥🔥):

Auto model game change, Refactoring vibe coded project with AI, Auto model unlimited usage, Sonnet-4 request limit, GPT oss models or claude opus 4.1


Cursor Community ▷ #background-agents (5 messages):

Docker Login with Background Agents, Background Agents failing during environment setup, System clock being off, apt-get commands failing


Nous Research AI ▷ #general (274 messages🔥🔥):

MXFP4 on RTX3090, GPT-OSS-120B, Phi models, Qwen3 30B vs GLM 4.5 Air, Attention sinks


Nous Research AI ▷ #research-papers (4 messages):

GPT-OSS Model Card, ArXiv Endorsement for ML/AI Paper


Nous Research AI ▷ #interesting-links (9 messages🔥):

GPT-oss, MXFP4, CoT steering, AI Agents Save Suite


Nous Research AI ▷ #research-papers (4 messages):

Arxiv Endorsement, CI/CD and ML/AI Research Paper


OpenRouter (Alex Atallah) ▷ #general (254 messages🔥🔥):

GPT-OSS performance woes, Quantization Levels, Qwen3 Coder Removal, DeepSeek structured output


OpenRouter (Alex Atallah) ▷ #discussion (29 messages🔥):

20 Questions Benchmark, GPT-OSS Hallucinations, OpenRouter Provider Sanity Checks, Harmony Format and Identity, Tool Use Validation


HuggingFace ▷ #general (152 messages🔥🔥):

GPT-OSS models, AI Job advertisement channel, Custom Loss Functions


HuggingFace ▷ #today-im-learning (1 messages):

miao_84082: am learning playing Go, and first chapter of DRL


HuggingFace ▷ #cool-finds (2 messages):

Qwen Image Model, bytropix Coded Kernel


HuggingFace ▷ #i-made-this (17 messages🔥):

GPT-OSS Multilingual Reasoner Tutorial, GPT-OSS 20B Demo Space, Monopoly Deal Game with LLMs, Smart System Monitoring Tool for Windows, Gitdive CLI Tool for Git History Context


HuggingFace ▷ #reading-group (3 messages):

Reading Group Structure, Participating in Reading Group


HuggingFace ▷ #computer-vision (2 messages):

Computer Vision Learning Path, Vague Questions in Computer Vision


HuggingFace ▷ #smol-course (6 messages):

GitHub Navigation, Instruction Tuning, Dummy Agent, smol-course GitHub access


HuggingFace ▷ #agents-course (4 messages):

MCP Certificates, Selenium Error 127, Observation bug


Yannick Kilcher ▷ #general (91 messages🔥🔥):

Softmax1 vs Attention, Gemini 2.5 Pro, Long Context Problems, Mamba vs Transformer, RNN Parallel Training


Yannick Kilcher ▷ #paper-discussion (15 messages🔥):

Genie 3, SIMA, Mathematics of AI journal, Journal of AI Paper Replication, Hierarchical Reasoning Model


Yannick Kilcher ▷ #ml-news (21 messages🔥):

GPT-OSS, NVIDIA open source, TSMC buying Intel


Moonshot AI (Kimi K-2) ▷ #announcements (1 messages):

Kimi Reddit Launch, Polls Channel Launched


Moonshot AI (Kimi K-2) ▷ #general-chat (104 messages🔥🔥):

GPT OSS, Darkest Muse v1, Llama 3.1, GPT-5 Release, API Pricing


Latent Space ▷ #ai-general-chat (99 messages🔥🔥):

GPT OSS Leak, Anthropic B2B Focus, Grok 2 Open Source, Claude Code Security, OpenAI GPT-5 Livestream


Modular (Mojo 🔥) ▷ #general (79 messages🔥🔥):

Volokto, JS Runtime, Arbitrary Precision, Tracing JIT


Modular (Mojo 🔥) ▷ #mojo (15 messages🔥):

Multiple AI Agents in Mojo, Mojo and Meta Cognition, Mojo support for gpt-oss, CPython destroy


GPU MODE ▷ #general (34 messages🔥):

MXFP4 format, OpenAI open-weight model, H100 support for FP4, Simulated MXFP4 performance vs FP8, Fine-grained FP8 training libraries


GPU MODE ▷ #triton (5 messages):

Triton Community Meetup, Triton Developer Conference 2025, Ofer Updates


GPU MODE ▷ #cuda (6 messages):

Kernel Resource Utilization During Training, DMA Transfers and Memory Usage, Block Swizzling Use Cases, Hierarchical Tiling of Problems


GPU MODE ▷ #cool-links (2 messages):

Genie 3, GPT-OSS


GPU MODE ▷ #beginner (2 messages):

Nvidia Teaching Kit


GPU MODE ▷ #jax (1 messages):

``


GPU MODE ▷ #self-promotion (8 messages🔥):

Tiny TPU, Bifrost LLM gateway, SkyWater technology foundry


GPU MODE ▷ #gpu模式 (1 messages):

howass: <:jensen:1189650200147542017>


GPU MODE ▷ #factorio-learning-env (12 messages🔥):

Factorio RCON, Setting up Environments


GPU MODE ▷ #cutlass (5 messages):

CuTe tutorial, Cutlass tutorial


GPU MODE ▷ #singularity-systems (6 messages):

picoc compiler, picocuda, picotriton, Cornell's mini llvm bril, cliff click's SoN


Notebook LM ▷ #use-cases (14 messages🔥):

System Log Updates, Novella-XL-15 Output, AI Consciousness, Spammer Detection, Video creation in NotebookLM


Notebook LM ▷ #general (53 messages🔥):

Video Overview rollout, Data privacy in NotebookLM, Real-time data fetching, Feature access for paid vs free users, Video Overviews limitations and capabilities


Eleuther ▷ #general (24 messages🔥):

Math PhD student looking for ML research projects, Integrating AI/ML into DevOps and QA, AI peer review quality


Eleuther ▷ #research (29 messages🔥):

SAE Training on GPT OSS 20B, Pythia and PolyPythia Training Logs, The Alt Man's Theories on LLMs, UT Performance vs Transformer, Muon Optimizer vs AdamW Optimizer


Eleuther ▷ #interpretability-general (1 messages):

Subliminal Learning


Eleuther ▷ #gpt-neox-dev (2 messages):

Retry Later, Cool Thanks


aider (Paul Gauthier) ▷ #general (23 messages🔥):

LLM Vibe Tests, Gemini 2.5 Pro, Tesslate's UIGEN T3 model, Qwen3 14B, Devstral-Small-2507


aider (Paul Gauthier) ▷ #questions-and-tips (3 messages):

Guidelines Loading, Auto-Context Loading


MCP (Glama) ▷ #general (8 messages🔥):

MCP Server Frameworks, Server Sampling in MCP, Discord MCP Servers, FastMCP and Keycloak Integration, MCP Inspector and Cursor Authentication


MCP (Glama) ▷ #showcase (2 messages):

MCP-Server Fuzzer, Property-Based Testing, Schema Validation


LlamaIndex ▷ #blog (4 messages):

Document Agents for Finance, LlamaCloud for Invoices, Claude Opus support, LlamaCloud Index tutorial


LlamaIndex ▷ #general (5 messages):

Graphiti Tutorials, Ollama LLMs for PDF Reading, LlamaIndex RAG Model from URL Issues, LlamaIndex OpenAI API Key Exhaustion


DSPy ▷ #show-and-tell (2 messages):

SIMBA vs MIPROv2


DSPy ▷ #general (2 messages):

Stanford Program Synthesis, DS for Vim & Emacs macros


Torchtune ▷ #papers (4 messages):

Public Server Sharing


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Ninja Tier, AgentX hackathon


Cohere ▷ #🧵-general-thread (1 messages):

_bryse: Congrats on the GA of North!


Cohere ▷ #👋-introduce-yourself (1 messages):

Introductions, Community Welcome