Frozen AI News archive

not much happened today

**OpenAI** launched **GPT-5** with a unified user experience removing manual model selection, causing initial routing and access issues for Plus users that are being addressed with fixes including restored model options and increased usage limits. **GPT-5** introduces "Priority Processing" for lower latency at higher price tiers, achieving ~750ms median time-to-first-token in some cases. Microsoft reports full Copilot adoption of **GPT-5**, and API traffic doubled within 24 hours, peaking at 2 billion tokens per minute. Early benchmarks show **GPT-5** leading in reasoning tasks like FrontierMath and LiveBench, with improvements in hallucination control and creative writing, though some models like Grok-4 and Claude-4 Sonnet Thinking outperform it in specific RL-heavy reasoning benchmarks. OpenAI also released extensive migration and feature guides but faced some rollout issues including a broken code sample and a problematic Voice Mode launch. *"Unified GPT-5" ends model pickers, pushing developers away from manual model selection.*

Canonical issue URL

a quiet day.

AI News for 8/7/2025-8/8/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 16496 messages) for you. Estimated reading time saved (at 200wpm): 1217 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of debates over the quality, style and rollout of GPT5, including the surprise decision to immediately deprecate GPT 4o, which has since been rolled back.


AI Twitter Recap

OpenAI’s GPT‑5 launch: unified UX, routing backlash, and rollout fixes

Early GPT‑5 performance: strong across reasoning, with caveats on routing, cost, and effort

Agents and developer tooling: Cursor CLI access, Claude Code background tasks, LangChain/LlamaIndex integrations

Open models, long‑context, and training/serving infra

Google, Anthropic, and “what matters beyond LLMs”

Meta: models vs. routing vs. agents; “benchmarketing” and evals discourse

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3 Ultra-Long Context Model Upgrades

2. Open Source vs. Proprietary AI Model Benchmarks and Debate

3. Running Large Models Efficiently on Consumer Hardware (Llama.cpp & GPT-OSS)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI GPT-5 Release Backlash and Model Removal Controversy

2. GPT-5 Benchmarks, Math, and Comparative Performance Reviews

3. Wan 2.2 Video AI Model Workflows, Guides, and Releases


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. OpenAI GPT-5 Rollout, Routing, and Reality Checks

2. New Agent & Dev Tooling

3. Open‑Source Training and Finetuning Updates

4. Multimodal, Video, and Long‑Context Advancements

5. GPU/Systems Insights and Compilers


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Moonshot AI (Kimi K-2) Discord


HuggingFace Discord


Latent Space Discord


Eleuther Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


Notebook LM Discord


GPU MODE Discord


LlamaIndex Discord


aider (Paul Gauthier) Discord


DSPy Discord


Manus.im Discord Discord


Cohere Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


MCP (Glama) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

kesku: https://fixvx.com/perplexity_ai/status/1953537170964459632 <@&1105626802732404746>


Perplexity AI ▷ #general (873 messages🔥🔥🔥):

Gemini AI Video Generation, GPT-5 performance on Perplexity, Comet Browser AI tasks, Accessing Perplexity Pro


Perplexity AI ▷ #sharing (4 messages):

GPT-5 Release, Solar Powered High-Altitude Platform, Gemini Coding


Perplexity AI ▷ #pplx-api (1 messages):

Front-end improvements


LMArena ▷ #general (1436 messages🔥🔥🔥):

GPT-5 Performance, Gemini 2.5 Pro vs GPT-5, Yupp.ai Legitimacy, LM Arena Outage, Claude 4.1 Opus


LMArena ▷ #announcements (3 messages):

Staff AMA, Video Arena, New models, gpt-5-mini-2025-08-07, gpt-5-nano-2025-08-07


OpenAI ▷ #annnouncements (2 messages):

GPT-5, Sam Altman AMA


OpenAI ▷ #ai-discussions (973 messages🔥🔥🔥):

GPT-5, Gemini Flash, Model Routers, Data scrubbing, Local AI


OpenAI ▷ #gpt-4-discussions (75 messages🔥🔥):

GPT-5 rollout and availability, GPT-5 performance and limitations, Firefox data persistence issue, Hosting custom GPTs, AI tools for LinkedIn management


OpenAI ▷ #prompt-engineering (14 messages🔥):

ChatGPT-5, Prompt Engineering, AI Prompt Management Tool, Model Behavior Exploration, LinkedIn Management Service


OpenAI ▷ #api-discussions (14 messages🔥):

ChatGPT-5 Prompt Box Limitations, Prompt Engineering Techniques, AI Prompt Management Tools, Model Behavior Exploration, Alternative tools for large inputs


Cursor Community ▷ #general (841 messages🔥🔥🔥):

GPT-5 Launch, Free GPT-5, GPT-5 Limitations, Cursor CLI, Model Performance Comparison


Cursor Community ▷ #background-agents (8 messages🔥):

PR creation flow issues, Background workers and PR creation, "@cursor fix this issue" magic


Cursor Community ▷ #announcements (1 messages):

Cursor in Terminal


Unsloth AI (Daniel Han) ▷ #general (1016 messages🔥🔥🔥):

GPT-5, Unsloth support for MXFP4, RVC (voice conversion) language specifics, Dataset preparation, GPT-OSS and GGUF


Unsloth AI (Daniel Han) ▷ #introduce-yourself (14 messages🔥):

Model Fine Tuning Costs, Unsloth AI Documentation, Developer Introductions


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

GPT-OSS, Qwen3-Coder + 2507, Unsloth updates


Unsloth AI (Daniel Han) ▷ #off-topic (15 messages🔥):

LLMs playing board games, GPT-5 performance, Coding with LLMs


Unsloth AI (Daniel Han) ▷ #help (166 messages🔥🔥):

VLLM update fixes, WSL instructions Don't work, GPT-OSS on Tesla T4 is slow, Fine tuning models to write in certain style


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

loayxz: https://huggingface.co/loay/ArabicOCR-Qwen2.5-VL-7B-Vision


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

41M HRM-based Model, Chain-of-Thought Reasoning Mirage, Importance of Datasets, Small Specialized Fine-Tuned Models, Tiny Stories Dataset


OpenRouter (Alex Atallah) ▷ #general (800 messages🔥🔥🔥):

GPT-5 vs GPT-5 Chat, Gemini 3.0 vs GPT-5, Deepseek Switching to Ascend, Horizon Beta Replacement


OpenRouter (Alex Atallah) ▷ #new-models (2 messages):

``


OpenRouter (Alex Atallah) ▷ #discussion (23 messages🔥):

GPT-5 BYOK, o3, OpenRouter Trusted Partner, generation_time, moderation_latency


LM Studio ▷ #general (281 messages🔥🔥):

YouTube downloader alternatives, Custom AI bot, LM Studio vs. VLLM for parallel requests, GLM-4.5 offloading, Qwen model improvements


LM Studio ▷ #hardware-discussion (74 messages🔥🔥):

Apple M4, HX 370, 5080 FE Availability, PSU for 5080 FE and 3090, RTX 3090 for 120b GPT OSS Model


Moonshot AI (Kimi K-2) ▷ #general-chat (214 messages🔥🔥):

GPT-5, Kimi K2, OpenRouter, Qwen, Model Quantization


HuggingFace ▷ #general (182 messages🔥🔥):

GPT-5 release, GPT-OSS finetuning, Eleven Music, Voice companion pipeline, Automatic video cutter


HuggingFace ▷ #i-made-this (8 messages🔥):

AERIS V4 launch, Modular framework for managing persistent memory, Devlancr - Tinder for Developers, AERIS is schizo


Latent Space ▷ #ai-general-chat (145 messages🔥🔥):

GPT-5, Claude Code, Cursor CLI, Model Deprecation, Nitter Maintenance


Latent Space ▷ #ai-announcements (13 messages🔥):

GPT-5, OpenAI Dominance, Transformer Models, GPT-5 Vision, AI General Intelligence (AGI)


Eleuther ▷ #general (115 messages🔥🔥):

NSP vs Attention, Lower compute requirements for training language models, Memory layer for LLMs, GPT-5 drawing incorrect information in images, AR models combined with diffusion models


Eleuther ▷ #research (13 messages🔥):

FineWeb dataset cleanliness, Pythia's Hidden Activation Dynamics, LM Evaluation Harness Exact Match Issues, Learning Rate Schedule Impact


Nous Research AI ▷ #general (83 messages🔥🔥):

GPT-5 Logic Puzzles and Overfitting, Free GPT-5 API Access, Cheap Colab Alternatives, GLM 4.5 Air Performance and Offloading, Multi-GPU setups for MoE models


Nous Research AI ▷ #ask-about-llms (1 messages):

Claude jailbreak


Nous Research AI ▷ #interesting-links (2 messages):

Mechanistic faithfulness, StreamingLLM


Modular (Mojo 🔥) ▷ #general (49 messages🔥):

Mojo TUI library, Textual Python apps, Mojo's inability to create classes, Rust libraries


Modular (Mojo 🔥) ▷ #mojo (12 messages🔥):

Mojo Compiler Register Warnings, VSCode Mojo Extension Instability, Modular Forum, Minecraft Server Rewrite, Minecraft Protocol in Mojo


Modular (Mojo 🔥) ▷ #max (14 messages🔥):

MaxCompiler, LLMs, kernel fusion, torch.compile(), Transformers


Yannick Kilcher ▷ #general (39 messages🔥):

Twitch Streaming, LinkedIn Blogging, Attention Span, Ocean Sound or Fireplace Sound, Gaussian Distribution


Yannick Kilcher ▷ #paper-discussion (3 messages):

AI Avatar, SDXL, Fast Layers vs Slow Layers, Autodifferentiable Architectures, Gradient Estimation


Yannick Kilcher ▷ #ml-news (31 messages🔥):

LLMs for diagnosis, congress.gov bill, Over the counter cold medicine ineffective, Pharmacists prescribing, Tesla special


Notebook LM ▷ #use-cases (6 messages):

NotebookLM Voice, AI Web Builder Tool, Scratchpad Framework, NotebookLM for Binge Watching


Notebook LM ▷ #general (46 messages🔥):

Notebook thumbnails, Audio Overview Issues, Custom Notebooks, Sensitive Content Research, Audio Issues


GPU MODE ▷ #general (10 messages🔥):

Parameter Scaling, Speculative Decoding, Parallel Programming, ROCm Channel Spam


GPU MODE ▷ #triton (1 messages):

Privacy Team Approval for Registration, Registration Process Update


GPU MODE ▷ #cuda (4 messages):

Machine Level Element Type Distinctions, S8/S16 vs U8/U16 Variants


GPU MODE ▷ #beginner (1 messages):

CUDA kernel debugging, Grid-stride loops


GPU MODE ▷ #metal (2 messages):

Naive Matmul Kernels, Memory Access Patterns, Hardware Coalescing


GPU MODE ▷ #self-promotion (4 messages):

Open Source Voxel Renderer, Rust, WebGPU, Data Streaming, Raytracing


GPU MODE ▷ #hardware (1 messages):

paolovic: thank you!


GPU MODE ▷ #factorio-learning-env (12 messages🔥):

Game Engine Speed, Meeting Reschedule, Player Inventory Transfers, Factorio Native Saves


GPU MODE ▷ #cutlass (7 messages):

CuTe Layouts, Jay Shah's Notes on CuTe Layouts, Layout Algebra Counterexamples


GPU MODE ▷ #singularity-systems (2 messages):

Liveness Analysis, Scalar Compilation Performance, Vector Compilation with Autovectorization and SIMTification


GPU MODE ▷ #multi-gpu (2 messages):

Axolotl, N-D Parallelism, HuggingFace Blog


LlamaIndex ▷ #blog (6 messages):

GPT-5, Agent Maze, Zoom RTMS, ZeroEntropy AI rerankers, Claude citations


LlamaIndex ▷ #general (39 messages🔥):

llama-index upgrade for gpt-5, workflow tools not working, OpenAI SDK issue and workaround, AgentWorkflow error, llama_deploy compatibility


aider (Paul Gauthier) ▷ #general (41 messages🔥):

Horizon vs GPT5 for agentic coding, Aider GPT-5 on Azure, Aider version updates, Dad meme thumbs up, Python 3.13 support


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

Cursor alternative design, OpenRouter's GPT5 errors, aider config parsing failures


DSPy ▷ #general (41 messages🔥):

Context7 MCP Server, Claude Code Tooling, DSPy Tool Calling, CrewAI Prompts Optimization with DSPy


Manus.im Discord ▷ #general (14 messages🔥):

Annual Membership Billing Error, Inherit Feature Problems, Login Error, Missing Credits, Manus vs GPT5


Cohere ▷ #🧵-general-thread (4 messages):

command-a-vision-07-2025 timing out, Embed v4 vs v3 for vector search, AI Knowledge Domains


Cohere ▷ #📣-announcements (1 messages):

AI Agent capabilities, Generative AI, Workflow automation, Data security, Compliance


Cohere ▷ #👋-introduce-yourself (6 messages):

New member introductions, Trading systems with RL and AI agents, Transformers and GNNs


Cohere ▷ #🧭-status-feed (1 messages):

Command-a-vision-07-2025, degraded performance, Cohere Status Page


Cohere ▷ #🔬-research (1 messages):

masaru.yamada: Great


tinygrad (George Hotz) ▷ #general (6 messages):

tensor to mathtraits, unit tests failures, github actions


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

ShapeTracker Visualization Tool


Nomic.ai (GPT4All) ▷ #general (6 messages):

GPT-5 Rumors, GPT-OSS-20B-GUFF Installation Issues, GPT4All Update Status, GPT-ASS Critique


MCP (Glama) ▷ #showcase (2 messages):

MCPOmni Connect, OmniAgent, AI agent builder