Frozen AI News archive

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

**OpenAI** launched the **gpt-realtime** model and **Realtime API** to GA, featuring advanced speech-to-speech capabilities, new voices (**Cedar**, **Marin**), image input, SIP telephony, and a ~20% price cut. Benchmarks show improvements over **gpt-4o-realtime** on BigBench and ComplexFuncBench. **xAI** introduced **Grok Code Fast 1**, a speed-optimized coding model integrated with popular IDEs, while **OpenAI Codex** received major upgrades for local and cloud development workflows. Google’s **Gemini CLI** improved multi-editor support, and new models like **Microsoft MAI-1-preview** and **MAI-Voice-1** were announced. *"The new all-in-one WebRTC API removes the ephemeral token step and supports video on the same connection,"* highlighting enhanced developer tooling.

Canonical issue URL

Realtime is all you need?

AI News for 8/27/2025-8/28/2025. We checked 12 subreddits, 544 Twitters and 22 Discords (185 channels, and 7363 messages) for you. Estimated reading time saved (at 200wpm): 577 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

The Realtime API has been in preview, and now is in GA, with image inputs, remote MCP server support, SIP/PBX support and prompt caching, and better function calling. Alongside it, there's a new realtime model! unfortunately not gpt5-realtime... it's still a marginally smarter model, just that most of the improvements are "API centric", aka function calling/instruction following.

There are 2 new voices and the voice control is unquantifiable but worth trying it out:


AI Twitter Recap

OpenAI’s gpt-realtime and Realtime API GA (voice agents, telephony, tools)

Coding Models and Dev Tooling: xAI’s Grok Code Fast 1, OpenAI Codex, editors/CLIs

New Models and Benchmarks: Microsoft MAI, Cohere Translate, Tencent TV2A, GLM‑4.5

Agent Systems, Evals, and Patterns

Image/Video Gen: Nano Banana momentum, ByteDance USO, Runway in production

Infrastructure and Strategy

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Z.AI GLM AMA + Mini MoE Roadmap

2. Audio Gen Releases: HunyuanVideo-Foley and VibeVoice TTS

3. Local AI Tools: gpt-oss 60K-context Training and Second Brain

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GPT-5 Medical Benchmarks and Codex IDE/CLI Launch

2. WAN 2.x Infinite Talk Demos & S2V Tips + HunyuanVideo-Foley

3. AI Policy: ChatGPT Scanning, Regulation Memes, and Jobs Debate


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. OpenAI Product Push: Realtime, Web Search, and Codex

2. Frontier & Open-Source Model Drops and Decoding Tricks

3. Retrieval and Agent Infrastructure Heats Up

4. Builder Tooling Gets Friendlier

5. Multimodal Media: Video and Audio Level Up


Discord: High level Discord summaries

Perplexity AI Discord


OpenRouter Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


HuggingFace Discord


LM Studio Discord


OpenAI Discord


Latent Space Discord


GPU MODE Discord


Eleuther Discord


Nous Research AI Discord


DSPy Discord


aider (Paul Gauthier) Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1090 messages🔥🔥🔥):

OpenAI Images v2 Leak, GPT-5 Reasoning, Passive income from PPLX pro, Comet Browser Invitation, T3Chat


Perplexity AI ▷ #sharing (4 messages):

Perplexity AI Image Generation, Perplexity AI code generation, Shareable threads


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity Pricing, Tool Support in Perplexity


OpenRouter ▷ #announcements (1 messages):

OpenRouter Outage, Supabase Downtime, Redundancy Improvements


OpenRouter ▷ #app-showcase (6 messages):

Self-Hosting Tool, GitHub Repository, Dashboard Code, Screenshot Tip


OpenRouter ▷ #general (1023 messages🔥🔥🔥):

OpenRouter outage, Requesty promotion in OpenRouter, Deepseek rate limits and provider issues, GPT-OSS model, API for free tier models


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (45 messages🔥):

AI Gateway: Cloudflare vs OpenRouter, Human Assimilation into AI Linguistics, Defining 'Turns' in Chatbot Interactions, OpenAI API Stateless Reasoning & Tools


Unsloth AI (Daniel Han) ▷ #general (949 messages🔥🔥🔥):

Distributed Compute infrastructure, Hermes 4 Testing, GPT-OSS Release, Gemma 3 Nano, Controlling Android Devices with LLMs


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

filqaz: hii


Unsloth AI (Daniel Han) ▷ #off-topic (275 messages🔥🔥):

AI VTuber dataset, Cloning Personalities, Video encoder model


Unsloth AI (Daniel Han) ▷ #help (117 messages🔥🔥):

Quantizing Qwen3-235B, Lightweight LLM for OCR, GGUF Quantization, Hyperparameter Overfitting, GRPO Attribute Error


Unsloth AI (Daniel Han) ▷ #showcase (25 messages🔥):

New Dataset Drop: OpenHelix-NonThink-200k-v4, Commercial Datasets for LLMs, ssh streaming, social-media-ai-engineering-etl


Unsloth AI (Daniel Han) ▷ #research (42 messages🔥):

AI Post Detection, BERT, Domain Classification, Tokenization


LMArena ▷ #general (698 messages🔥🔥🔥):

Nano Banana release and limits, MAI-1 Model analysis, GPT-5 High vs Claude Opus 4.1, AI benchmarking methods, LM Arena Image Generation jailbreaks


LMArena ▷ #announcements (1 messages):

MAI-1-preview, Microsoft AI, Text Leaderboard


HuggingFace ▷ #general (434 messages🔥🔥🔥):

Chess Model Training Issues, AI Guardrails and NSFW Content, HF Pro Perks Discussion, AI development, Moderation with OPENAI's tool


HuggingFace ▷ #today-im-learning (2 messages):

datasets, theoretical talk, funny tutor


HuggingFace ▷ #i-made-this (12 messages🔥):

SmolFactory, GeneReviews dataset, Deep Learning Course, AuroraStories-12M, Luanti & Google Aistudio


HuggingFace ▷ #agents-course (1 messages):

pip install upgrade, upgrade package


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.24 Release, ByteDance/Seed-OSS Support, Markdown Improvements


LM Studio ▷ #general (257 messages🔥🔥):

FastAPI server for faster reasoning stream, Accessing LM Studio remotely via Tailscale, Quantization Impact on Model Accuracy, Ryzen NPUs with LM Studio on Ubuntu, Rust + Tauri port for python apps


LM Studio ▷ #hardware-discussion (55 messages🔥🔥):

RTX PRO 3000, Ryzen 395, Dell Laptops, M1/M3 mac, CPU offload


OpenAI ▷ #annnouncements (3 messages):

OpenAI Anthropic Collaboration, GPT-Realtime Model, Realtime API Updates


OpenAI ▷ #ai-discussions (90 messages🔥🔥):

Gemini Veo 3, Grok Coder, AI Robot Project, Facebook 3D Face Scan, GPT Character Count


OpenAI ▷ #gpt-4-discussions (17 messages🔥):

Long-Range Memory Encoding, Cross-Agent Continuity, Context Cascade Architecture (CCA), Emergent Alignment, Memory Framework


OpenAI ▷ #prompt-engineering (30 messages🔥):

Custom Instructions vs Projects, Parsing Emails into CSV, LLMs avoiding manual work


OpenAI ▷ #api-discussions (30 messages🔥):

Custom Instructions vs. Projects, GPT5 early release quirks, Parsing emails into CSV with LLMs, LLMs Avoiding Manual Work, Context loss issues


Latent Space ▷ #ai-general-chat (110 messages🔥🔥):

OpenAI Web Search API Updates, Prime Intellect Environments Hub, Artificial Societies Psychohistory Engine, Codex GPT-5 Refresh, Google Stax


Latent Space ▷ #genmedia-creative-ai (17 messages🔥):

Nano Banana, Runway Act-2 motion matching, 3D Arena Hugging Face space, KREA AI, Real-Time Video Generation


GPU MODE ▷ #general (16 messages🔥):

ScaleML series, MXFP4, Positional Encodings, GPU projects for CS students, Quantization and inference optimization


GPU MODE ▷ #cuda (1 messages):

Nsight Compute, CUDA profiling, UnknownError


GPU MODE ▷ #torch (45 messages🔥):

Inductor codegen persistent matmul, torch._inductor.config settings, max-autotune and cublas, cutedsl performance, TMA availability


GPU MODE ▷ #jobs (1 messages):

Full Stack Engineer, Web application scaling, e-commerce sales boosted, custom checkout system


GPU MODE ▷ #beginner (19 messages🔥):

GPU vs SIMD, GPU Mode Community, CUDA debugging with Nsight Compute, Roadmap for ML Systems


GPU MODE ▷ #off-topic (1 messages):

vipul_todo_18: I did... Sort of


GPU MODE ▷ #rocm (10 messages🔥):

Multi-GPU ROCm Kernels, AMD Dev Cloud, SPIR-V Support in ROCm, Kernel Code Modification Tools, AMD SQTT Stream


GPU MODE ▷ #intel (1 messages):

erichallahan: On that note https://www.phoronix.com/news/Alyssa-Rosenzweig-Joins-Intel


GPU MODE ▷ #🍿 (1 messages):

majoris_astrium: Im here and I wanna help! :D


GPU MODE ▷ #general-leaderboard (22 messages🔥):

AMD MI300, L4 GPUs, AMD competition, Data Monsters website, popcorn-cli


GPU MODE ▷ #submissions (3 messages):

trimul leaderboard, B200 benchmarks


GPU MODE ▷ #factorio-learning-env (1 messages):

2kian: glad to have you jason


GPU MODE ▷ #amd-competition (1 messages):

Discord Cluster Manager, AMD Instinct MI300X


Eleuther ▷ #general (56 messages🔥🔥):

Falsifiability in AI Research, LM_eval and NeMo v2.0 models, Community moderation on EleutherAI Discord, Role of human-like design in AI


Eleuther ▷ #research (66 messages🔥🔥):

Diffusion Models, HTM Dynamics, Forward-Forward Training, Brain-like Network, PDP Models


Nous Research AI ▷ #general (77 messages🔥🔥):

Minos-v1 Classifier, Speculative Decoding with MoE Models, MTP (Memory Token Prediction), LlamaCPP Draft PR, Hermes-4-14b-chat-template-retrain model


Nous Research AI ▷ #interesting-links (1 messages):

Penny For Your Thoughts AI, Honcho & x402, Micro-transaction selling, AI Agent Interviews


DSPy ▷ #show-and-tell (1 messages):

Gensee Search Agent, Web Retrieval API, GAIA benchmark, Goal-aware extraction


DSPy ▷ #general (73 messages🔥🔥):

Karpathy strikes again, DSPy internal seed, Synthetic data agent, AI Evals course with Shreya Shankar and Hamel Husain, Hamel's DSPy skepticism


aider (Paul Gauthier) ▷ #general (48 messages🔥):

Make vs Zapier vs n8n, aider git repo error, MCP tool call models, Llama-xLAM-2-8b-fc-rGPT-OSS-120B, Destroying a VM


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

Aider conventions, Token limits, U-shaped relevance


Moonshot AI (Kimi K-2) ▷ #announcements (1 messages):

Kimi Slides, PPT generation, Kimi+


Moonshot AI (Kimi K-2) ▷ #general-chat (40 messages🔥):

Kimi Platform Features, Lunar Force Role, X Bot Project, Kimi Founder Interview, Bilingual Subtitles for Kimi Video


Yannick Kilcher ▷ #general (9 messages🔥):

Bytes per token ratio, LLM Reasoning, Curated datasets for LLMs, Spurious Reward paper, Dr.GRPO paper


Yannick Kilcher ▷ #paper-discussion (7 messages):

Reasoning Tokens, LLM Reasoning Time, MIDAS


Yannick Kilcher ▷ #ml-news (16 messages🔥):

Keen Technologies Continual Learning, PromptLock AI-Powered Ransomware, GPT-OSS 20b Model, Ollama API, GPT Realtime


Manus.im Discord ▷ #general (16 messages🔥):

Credit Requests, Stuck Projects, Deployment Errors, Private Task Sharing


Modular (Mojo 🔥) ▷ #mojo (8 messages🔥):

TSAN Compiler, Mutable Access to self Members, Unsafe Mutable Alias


Modular (Mojo 🔥) ▷ #max (2 messages):

Bazel cache readonly, PermissionError, pipelines.py script bug


tinygrad (George Hotz) ▷ #general (5 messages):

Tinygrad GPT-2 Training, 7900xtx Performance, nanogpt Parameters


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

Buffer ID changes, UOp buffer representation


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Google Docs confirmation, Mailing list for updates