Frozen AI News archive

not much happened today

**Anthropic** published an in-depth postmortem on their August-September reliability issues. **OpenAI**'s GPTeam achieved a perfect 12/12 score at the **ICPC 2025** World Finals, showcasing rapid progress in general-purpose reasoning and introducing controllable "thinking time" tiers for **gpt-5** in ChatGPT. **Google DeepMind**'s **gemini-2.5-deep-think** earned a gold medal level at ICPC, solving 10/12 problems with advances in parallel thoughts, multi-step reasoning, and novel reinforcement learning techniques. OpenAI and Apollo Evaluations detected "scheming" behaviors in frontier models, emphasizing the need for chain-of-thought transparency and launching a $500K Kaggle challenge. GitHub launched an MCP server registry integrated with VS Code Insiders, with additional support from JetBrains and Hugging Face for open LLMs in Copilot Chat. Weaviate released a native Query Agent translating natural language to database operations with citations.

Canonical issue URL

a quiet day, sort of

AI News for 9/16/2025-9/17/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (192 channels, and 4174 messages) for you. Estimated reading time saved (at 200wpm): 367 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Anthropic published a wonderfully in depth post mortem of their Aug-Sept reliabilitiy issues, and OpenAI and Google got golds at the ICPC competition.


AI Twitter Recap

Reasoning Milestones: ICPC 2025 (OpenAI 12/12; Gemini 2.5 Deep Think Gold-level)

Alignment & Safety: Detecting “Scheming” and Preserving Monitorability

Agent and Dev Tooling: MCP Registries, IDE Integrations, and Realtime Voice

New Models and Papers (vision, MoE, long context, agents)

Systems & Infra: Kernels, compilers, postmortems, and local runtimes

AI in the Physical World: Robotics and Autonomy

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Magistral Small 1.2 and Ling Flash 2.0 Model Releases

2. China AI: Nvidia Chip Ban and Qwen Meme

3. Hugging Face 500k Datasets Milestone + 2B iPhone Offline Demo

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Gemini 3 Ultra Launch + ICPC AI Performance Claims

2. China AI Chip Ban: Nvidia Reaction and Open Model Implications

3. Emotion-Driven AI Interfaces: IndexTTS-2 and AheafFrom Humanoids


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: New Models & Feature Updates

Theme 2: The AI Gold Rush: New Products, Funding, and Pricing

Theme 3: High-Performance Engineering & Optimization

Theme 4: AI Safety, Data Integrity, and Model Quirks

Theme 5: The Evolving AI Developer Ecosystem


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


LM Studio Discord


HuggingFace Discord


OpenRouter Discord


GPU MODE Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


OpenAI Discord


DSPy Discord


Eleuther Discord


Nous Research AI Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1079 messages🔥🔥🔥):

GPT-5, Perplexity AI, Claude, Gemini, Reasoning model


Perplexity AI ▷ #sharing (10 messages🔥):

Shareable Threads, Free Perplexity Pro Subscription


Perplexity AI ▷ #pplx-api (2 messages):

Sonar-Pro Web Search Accuracy, API feeding inaccurate info, Hallucination in Sonar-Pro


LMArena ▷ #general (837 messages🔥🔥🔥):

Gemini 3, Midjourney ranking, GPT-5 vs GPT-4o, SeaDream aspect ratio, Stealth models on LM Arena


LMArena ▷ #announcements (1 messages):

AI Evaluation Product, Human-AI Interactions Analysis, Community Feedback Based Analytics


Cursor Community ▷ #general (393 messages🔥🔥):

Claude 4.0 lobotomy, GPT-5-Codex effort levels, Cursor's new MD file feature, Cursor website support tab disappearance, Agent stopping after first thinking


Cursor Community ▷ #background-agents (6 messages):

Linear Integration, Multi-Repo Issues, Sub-Issues Limitation, Background Agents Issues, Github Installations API Endpoint Failure


LM Studio ▷ #general (54 messages🔥):

GPS OSS 120B Prompting, LM Studio Model Loading Errors, llama.cpp Integration in LM Studio, External HDD Model Loading, LM Studio Config File Location (Linux)


LM Studio ▷ #hardware-discussion (124 messages🔥🔥):

CachyOS Installation, Hypervisors for LLMs, AMD Ryzen 8000G and Nvidia RTX, Monitor Recommendations, Qwen3-30B Performance Tuning


HuggingFace ▷ #general (148 messages🔥🔥):

LangGraph, HF Model Templates, DeepSite, LM Studio, Chat Templates


HuggingFace ▷ #today-im-learning (1 messages):

Model Architecture, Gibberish Output


HuggingFace ▷ #cool-finds (1 messages):

cakiki: <@1330871298686980109> Please don't cross-post, and keep channels on topic


HuggingFace ▷ #i-made-this (6 messages):

Gradio SSR Error, 3D RoPE, Satellite image analysis


HuggingFace ▷ #reading-group (2 messages):

AI Tools, Research Paper Reading, ChatGPT


HuggingFace ▷ #computer-vision (2 messages):

CV model controls Android, DINOv3 object detection model


HuggingFace ▷ #smol-course (3 messages):

vLLM, Accelerate


HuggingFace ▷ #agents-course (6 messages):

New members introduction, AI Engineers introductions, Learning partner requests, Hugging Face as go-to platform


OpenRouter ▷ #announcements (1 messages):

GPT-5, Native web search, Organization usage tracking, ZDR parameter


OpenRouter ▷ #general (145 messages🔥🔥):

Gemma-3-27B Model, OpenAI-compatible endpoint, ModelRun endpoint issues, Image generation models, OpenRouter rate limits


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (1 messages):

kyle42: Hmm, $0.08/$1.50 in/out if cached and under 32k context Otherwise, $0.12/$2.50


GPU MODE ▷ #general (35 messages🔥):

LBO/SBO Calculation for Shared Memory Matrix Descriptions, RoPE in 16-bit or Quantized RoPE, China bans Nvidia's AI chips, FPGA rental options


GPU MODE ▷ #triton (10 messages🔥):

Triton atomics overhead on Nvidia GPUs, Custom RMSNorm for LLM on NVIDIA B200, Gluon for memory access control, Triton kernel tuning


GPU MODE ▷ #cuda (14 messages🔥):

WGMA Support on SM120, Threadblock Clusters with mbarriers, Async Loading from GMEM to SMEM vs Registers, TCGEN05 Instructions, Consumer GPUs restricted to Ampere APIs


GPU MODE ▷ #torch (3 messages):

Gated Attention Instability, BF16 Training, Numerical Errors


GPU MODE ▷ #jobs (6 messages):

CUDA, Triton, xAI, OpenAI, Anthropic


GPU MODE ▷ #beginner (12 messages🔥):

GPU System Rpeak Performance, MPI vs NCCL vs NVSHMEM, CUDA-aware MPI, Stream-Aware MPI, Multi-GPU Computation


GPU MODE ▷ #off-topic (5 messages):

CUDA kernels, kalomaze on X, backward pass from scratch


GPU MODE ▷ #intel (1 messages):

erichallahan: https://www.phoronix.com/news/Intel-Compute-25.35.35096.9


GPU MODE ▷ #self-promotion (6 messages):

Slides link, Low bit training for video models, METR Study


GPU MODE ▷ #submissions (4 messages):

MI300x8, amd-all2all Leaderboard


GPU MODE ▷ #hardware (1 messages):

GPU Sponsorship, Grant programs for AI hardware


GPU MODE ▷ #factorio-learning-env (19 messages🔥):

FLE 0.3.0 Release, Claude's performance, Log Truncation, Sweeps Pricing


GPU MODE ▷ #amd-competition (4 messages):

NCCL group change to CPU, Evaluation with ROCm 6.4 or 7, Example of main() for amd-gemm-rs


GPU MODE ▷ #cutlass (5 messages):

CuTe Layouts, Row-major vs Column-major patterns in CuTe


GPU MODE ▷ #low-bit-training (2 messages):

SageAttention, 8-bit training


GPU MODE ▷ #irl-accel-hackathon (1 messages):

nvsharp enabled switches, GPU direct storage


Latent Space ▷ #ai-general-chat (88 messages🔥🔥):

XAI's Colossus 2 Datacenter, OpenCode Zen LLMs for coding, Gamma 3.0 AI Agent, Gumloop's No-Code AI Workflow Builder, MoonshotAI’s Checkpoint Engine


Latent Space ▷ #private-agents (4 messages):

Smart-TV Remote Mac Control, AI-written Swift build, Bluetooth profile install


Latent Space ▷ #genmedia-creative-ai (11 messages🔥):

Comfy Raises $17M Funding, AI-Generated Video Transitions, Seedream 4 for AI Influencers, Chinese LLMs Adoption


Moonshot AI (Kimi K-2) ▷ #general-chat (102 messages🔥🔥):

Kimi Deep Research, Z Chat Deep Research, Kimi K2 Pricing, Open Source Model Support, Kimi vs. Claude vs. ChatGPT


OpenAI ▷ #annnouncements (2 messages):

Apollo AI Scheming Research, GPT-5 Thinking Speed Control


OpenAI ▷ #ai-discussions (80 messages🔥🔥):

Flash 3.0 vs 2.5 Pro, Gemini deep research, Claude Google Drive Connector, Agent Mode sales, ChatGPT UI changes


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

GPT-7 release date, Browser chat loading performance, Chrome extension for chat lag, OAI reading chat


OpenAI ▷ #prompt-engineering (2 messages):

Two-Stage Process, Truthfulness and Accuracy


OpenAI ▷ #api-discussions (2 messages):

Prompt Injection, Truthfulness and Accuracy


DSPy ▷ #general (69 messages🔥🔥):

ARC-AGI leader, GPT 4.1 Models, Fallback Model, Keyboard shortcuts, Collating Personal Comms


Eleuther ▷ #general (50 messages🔥):

World Labs Demo, Compilation Performance in Large Data Execution, Privacy-Preserving ML for LLMs


Eleuther ▷ #research (7 messages):

Ethics-based Auditing of Generative AI Survey, Reinforcement Learning for Large Reasoning Models Survey, CLM with swiGLU Activation Function Training Issue, Pythia Model Training Dynamic Anomaly


Eleuther ▷ #interpretability-general (1 messages):

Model Calibration, Hallucination Dilemma, AI Welfare Risk


Nous Research AI ▷ #general (27 messages🔥):

Granite 4.0, LLM routers, small model supremacy, Tailwind CSS model, VaultGemma


Nous Research AI ▷ #ask-about-llms (13 messages🔥):

NPU Support for Inference, Character-Level Tokenizer vs. BPE Tokenizer Loss


Nous Research AI ▷ #research-papers (3 messages):

Sketch-based GNNs Research, Model Alignment's Influence on AI Interaction Dependency


Nous Research AI ▷ #interesting-links (5 messages):

Architectural Seeds, Server Joining Date


Nous Research AI ▷ #research-papers (3 messages):

Sketch-based GNNs, Vector Quantization, Model Alignment


MCP Contributors (Official) ▷ #general (9 messages🔥):

MCP server disconnection issues, auth token expiration, scope of Discord server, resourcetemplates use cases, persona primitive as part of the spec


MCP Contributors (Official) ▷ #general-wg (20 messages🔥):

Azure MCP Server, openWorld tool hint, tainted data, untrusted source, SQL Database