Frozen AI News archive

not much happened today

**GPT-5 Codex** rollout shows strong agentic coding capabilities with some token bloat issues. IDEs like **VS Code Insiders** and **Cursor 1.6** enhance context windows and model integration. **vLLM 0.10.2** supports aarch64 and NVIDIA GB200 with performance improvements. **AMD ROCm** updates add modern attention, sparse MoE, and distributed inference. **TRL** introduces Context Parallelism for long-context training. Robotics and RL data pipelines improve with **Unsloth** and **LeRobotDataset v3**. **Qwen3-Next-80B** runs efficiently on Mac M4 Max with MLX. **Tencent's HunyuanImage 2.1** is a 17B bilingual text-to-image model with 2048×2048 resolution and restricted open weights.

Canonical issue URL

a quiet day

AI News for 9/15/2025-9/16/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (192 channels, and 3874 messages) for you. Estimated reading time saved (at 200wpm): 367 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A major resolution for Tiktok's US business, which is somewhat AI impacting but mostly business news.


AI Twitter Recap

Agentic coding and IDEs: GPT‑5 Codex rollout, IDE context, MCP everywhere

Inference and training infra: vLLM on aarch64/GB200, ROCm update, CP in TRL, Mac MLX speed

New models, agents, and spatial intelligence

Autonomy and robotics

Benchmarks, evals, and retrieval tooling

Policy and safety moves

Top tweets (by engagement)

Notes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Local AI Compute: Modded 4090 and Qwen3-Next-80B MLX Benchmarks

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI ChatGPT Usage Study and Use-Case Breakdown (700M users)

2. OpenAI Agentic Coding: Codex/GPT‑5 Breakthrough Claims and Insider Reports

3. AI Tool Updates: Qwen Pose Transfer V2 LoRA and Claude Code ‘Think Mode’ UI


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

1. New Models & Tools Hit the Streets

2. Performance & Optimization Debates

3. AI Development & Agentic Workflows

4. AI Benchmarking & Evaluation Under Fire

5. NSFW AI & Peculiar Projects Capture Attention


Discord: High level Discord summaries

Perplexity AI Discord


HuggingFace Discord


LMArena Discord


OpenRouter Discord


Eleuther Discord


OpenAI Discord


Cursor Community Discord


GPU MODE Discord


Latent Space Discord


LM Studio Discord


Nous Research AI Discord


DSPy Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Perplexity Pro Connectors, Email integration, Calendar integration, Notion integration, Github integration


Perplexity AI ▷ #general (896 messages🔥🔥🔥):

Vape Server, Multi-Model AI Orchestration, Perplexity Finance on iOS, Comet Browser for Android, Jobs and Auto Apply


Perplexity AI ▷ #sharing (3 messages):

Shareable Threads


Perplexity AI ▷ #pplx-api (2 messages):

API vs Web UI Citation Discrepancies, Sonar-Pro Web Search Accuracy


HuggingFace ▷ #general (368 messages🔥🔥):

FinePDFs Dataset, RAG accuracy, Random Token Masking, Clanker Detector LLM, AI Research Startup


HuggingFace ▷ #today-im-learning (2 messages):

Transformers architecture, Agent Course Access


HuggingFace ▷ #i-made-this (3 messages):

Android OS control model, Swiftide 0.31, Reddit Content Bot


HuggingFace ▷ #reading-group (1 messages):

Code Visualization, AI-assisted Blog Writing, Dynamic Graph Neural Networks


HuggingFace ▷ #smol-course (16 messages🔥):

smol fine tuning course, lighteval on Colab T4, integrating the translations from v1, older versions of vllm and triton


HuggingFace ▷ #agents-course (1 messages):

kong9646: hello.... working through unit one here....:)


LMArena ▷ #general (373 messages🔥🔥):

LM Arena Web/Apps Issues, Image Size on LM Arena, Monetization Concerns for LMArena, Side-by-Side Image Editing, Gemma Vault Speculation


LMArena ▷ #announcements (4 messages):

Battle, Side by side, Direct - Why?, August Contest Update, Text-to-Image & Image Edit Leaderboards Updated, AI Eval Product Update


OpenRouter ▷ #announcements (1 messages):

grok-2 deprecation, grok-3 release, grok-4 release


OpenRouter ▷ #general (287 messages🔥🔥):

Co-op Gooning with AI, NSFW bot development, OpenRouter Presets for Pre-Prompts, Gemma-3-27B Model API, AI Sex Dolls Implications


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (21 messages🔥):

Gemini 3 Pro vs 2.5 Flash, 2.5 Pro checkpoins changing, Google Expectations


Eleuther ▷ #general (267 messages🔥🔥):

RoPE intuition, Limitations of ML, Jianlin Su's blog posts, LLMs help avoid work, Hardware for LLM Experiments


Eleuther ▷ #research (23 messages🔥):

LM Eval Discrepancies, Good CLM Training Examples, Hallucination Prediction, ARC AGI 2


Eleuther ▷ #lm-thunderdome (8 messages🔥):

CLM Training Frameworks, MosaicML Composer, Dataset inference comparison


OpenAI ▷ #annnouncements (2 messages):

Codex CLI, GPT-5-Codex, agentic coding, IDE Extension, Github code reviews


OpenAI ▷ #ai-discussions (125 messages🔥🔥):

Codex web git commits and linters, Bachelors degree for AI Masters, AI and data science job market saturation, LLMs for Burmese language, GPT-5-Codex release


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

Swagger Schemas with Fastify, Custom GPT Stacking Bug, GPT-7 Release, GPT Weekly Limits


OpenAI ▷ #prompt-engineering (60 messages🔥🔥):

Chatbox character limit, LLM limitations, Prompt engineering techniques, Generating human-like speech


OpenAI ▷ #api-discussions (60 messages🔥🔥):

Chatbox Character Limit, LLM Limitations, Discordianism and AI, Positive Framing Prompts, Generating Humanisms in AI Speech


Cursor Community ▷ #general (219 messages🔥🔥):

Model Switching in Queued Messages, Auto Model Selection, Cursor's token usage and cost, Codex vs Claude Code, Cursor's Rules


Cursor Community ▷ #background-agents (3 messages):

Custom Branch and PR Naming, Linear Integration Challenges, Multi-Repo Issues, Sub-Issue Limitations, Agent Detachment Workaround


GPU MODE ▷ #general (11 messages🔥):

GB200/GB300 Availability on Coreweave, PruneAI Talk, LBO/SBO Calculation for Shared Memory Matrix


GPU MODE ▷ #triton (2 messages):

Triton Block Size Calibration, Nvidia GPU Atomics Overhead


GPU MODE ▷ #cuda (10 messages🔥):

P2P Memory Access, Symmetric Memory, wgmma on sm120 (consumer blackwell), mbarriers in threadblock clusters


GPU MODE ▷ #torch (12 messages🔥):

torch.compile schema, mutation annotations, return tuples, float vs double, tensor types


GPU MODE ▷ #beginner (6 messages):

H100 Performance, TFLOPS variance, Matrix Multiplication, Architectural Rpeak


GPU MODE ▷ #torchao (3 messages):

autoquant_v2, batch size 1


GPU MODE ▷ #off-topic (15 messages🔥):

CUDA debugging, Darksynthwave, PrimeIntellect, BackendBench, Performant CUDA kernels


GPU MODE ▷ #rocm (12 messages🔥):

Iris Memory Management, ROCm 7.0, tl.load vs iris.load, Kernel timeout errors


GPU MODE ▷ #intel (4 messages):

IPEX Deprecation, PyTorch Upstreaming, Intel Optimization Strategy


GPU MODE ▷ #metal (1 messages):

Metal command buffer timeout


GPU MODE ▷ #self-promotion (2 messages):

Attention Variants, MLA Explained, Quantization Survey


GPU MODE ▷ #submissions (20 messages🔥):

A100 performance, MI300x8 performance, Profiling errors, HIP/ASM perf


GPU MODE ▷ #factorio-learning-env (18 messages🔥):

Lua changes, Frontier model sweeps, Claude sicko mode, Error in GetEntities, Stray log line fixed


GPU MODE ▷ #amd-competition (19 messages🔥):

A2A kernel rules, Dispatch and Combine Kernels, Intra-node communication, GEMM + RS Kernel Rules, Simulated MOE and Combine Kernels


GPU MODE ▷ #cutlass (1 messages):

drazi1983: Welcome. Thanks for asking. And really nice diagrams!


GPU MODE ▷ #singularity-systems (27 messages🔥):

picograd progress, sitp updates, jupyter notebook in rust mdbook, heterogenous programming, CUDA vs HIP


GPU MODE ▷ #general (1 messages):

BioML trimul kernel competition, GPUMODE swag


GPU MODE ▷ #low-bit-training (1 messages):

Mobicham's LLM work, DiT, LLM Training, Quartet


GPU MODE ▷ #irl-accel-hackathon (1 messages):

Low-Bit-Training for Video Models, GitHub Project for Video Model Training


Latent Space ▷ #ai-general-chat (97 messages🔥🔥):

Mercor, SWEBench, Cursor's Bugbot, OpenCode Zen, Gamma 3.0


Latent Space ▷ #genmedia-creative-ai (9 messages🔥):

Nano Banana prompt, Bytedance video model, HeyGen rebrand, Video Agent Public Beta, Alisa Acquisition


LM Studio ▷ #general (72 messages🔥🔥):

Abliterated Models & Censorship, LM Studio Version Confusion, Model Generation Speed, Qwen3-Next-80B on LM Studio, VRAM Rule of Thumb Clarification


LM Studio ▷ #hardware-discussion (24 messages🔥):

Personal Cloud with Nextcloud, VPN Meshnet for Cloud Gaming, Setting up Qdrant Vector Database, Ryzen AI MAX 395 performance with qwen3-coder-30b, MacOS Sequoia memory usage for 70B models


Nous Research AI ▷ #general (83 messages🔥🔥):

XML for agentic coding, MI50 GPUs, AMD RDNA5, Codex vs Claude for coding, LLM Routers and small model supremacy


Nous Research AI ▷ #research-papers (2 messages):

AI Boyfriend Relationships, Sketch-Based GNNs


Nous Research AI ▷ #research-papers (2 messages):

AI Boyfriend research, Sketch-based GNNs


DSPy ▷ #show-and-tell (4 messages):

Tau Bench results with fastWorkflow and DSPy, VoxCron tool launch, GEPA for workflow optimization


DSPy ▷ #general (65 messages🔥🔥):

DSPy use cases, LM inference based on client, Refining topic matching in DSPy, Feeding classifier a list of topics, Arc-AGI leader prompt-optimization


Modular (Mojo 🔥) ▷ #general (15 messages🔥):

Mojo/Max and Python 3.13 Compatibility, Apple Metal Support Early Stages, MI355X Support in Nightly Version, Pixi Package Manager Benefits


Modular (Mojo 🔥) ▷ #mojo (17 messages🔥):

Allocator API, Parametric traits and requires, Mojo LSP rework, Networking update blockers, Compiler bug on Mac with mojo test


Yannick Kilcher ▷ #general (7 messages):

LLMs are Bayesian, Catastrophic forgetting, Online vs Batch Learning


Yannick Kilcher ▷ #paper-discussion (8 messages🔥):

Yellow Line Smoothness, VaultGemma Released


Yannick Kilcher ▷ #agents (1 messages):

Anthropic MCT Tools API, LLMs vs ARC Tool Use


Yannick Kilcher ▷ #ml-news (8 messages🔥):

AI-powered PDF Editor, Agents to Payments Protocol, Arc Prize results


aider (Paul Gauthier) ▷ #general (9 messages🔥):

GPT-5 Codex, aider's chat-mode, aider's architect mode, code mode


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

Gemini issues, Ollama endless loop, Architect Mode


tinygrad (George Hotz) ▷ #general (9 messages🔥):

Simplicity vs Elegance, MI350 Kernel Benchmarks, MLIR-based compiler


Manus.im Discord ▷ #general (6 messages):

Knowledge Limit, Credit Rollover, AI Loop & Refund


MCP Contributors (Official) ▷ #general (1 messages):

golang streaming http MCP server, Scalability of MCP server, Auth in MCP server, Sessions and resumability in MCP server, Dynamic capabilities in MCP server