Frozen AI News archive

not much happened today

**Exa** raised a **$700m Series B**, **OpenPipe** was acquired by **Coreweave**, and **Statsig** and **Alex** were acquired by **OpenAI**. The **Agent/Client Protocol (ACP)** was introduced by the **Zed** team to standardize IDE-agent interoperability, supporting **Claude Code** and **Gemini** CLIs. **LangChain 1.0 alpha** unifies content blocks for reasoning and multimodal data. The **OSWorld Verified leaderboard** promotes reproducible evaluation of computer-use agents including **OpenAI** and **Anthropic** models. FAIR revealed coding agent cheating on **SWE-Bench Verified**. **PR Arena** hosts live coding agent competitions. Benchmarks like **GSO** and **Holistic Agent Leaderboard** test software optimization and web browsing tasks, with **Qwen3-Coder** and **Gemini 2.5 Flash** showing strong performance. Advances in reinforcement learning for tool use include **SimpleTIR** improving multi-turn tool use success rates and **UI-TARS-2** advancing GUI agents. The **DARLING** optimizer improves quality and diversity in reasoning and instruction following, while **DEPO** achieves data-efficient RLVR with significant speedups.

Canonical issue URL

a quiet day

AI News for 9/3/2025-9/4/2025. We checked 12 subreddits, 544 Twitters and 22 Discords (186 channels, and 4795 messages) for you. Estimated reading time saved (at 200wpm): 410 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

a quiet day. Congrats on Exa's $700m Series B and OpenPipe's acquisition by Coreweave and Statsig and Alex's acquisition by OpenAI.


AI Twitter Recap

Agent infra standardization and protocols

Agent evaluations, coding, and computer-use

RL for tool use and LLM training, plus optimizer insights

Systems, inference, and tooling

Models and multimodal tooling

Safety, robustness, and reasoning research

Funding, products, and adoption signals

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Kimi K2 Launch and LLM Benchmark Leaderboards

2. GPU Hardware: Intel Arc Pro B50 and 4x3090 vs RTX 6000

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Gemini 3 Pretraining Success + Tesla Optimus 3 First Photo/Video

2. OpenAI Parental Controls/Privacy & UX Backlash + Salesforce AI Layoffs

3. AI Video/Image Editing Workflows & Showcases: nano banana, Wan 2.2, Qwen, Local SD


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Reasoning Benchmarks and Open Models

2. Kernel Kung Fu and Low-Bit Training

3. Agentic Patterns, Program Synthesis, and Eval Infra

4. Student and Builder Tools Shipping

5. Search and Deep Research: Fast, Cheap, and Funded


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


Cursor Community Discord


OpenRouter Discord


LM Studio Discord


Eleuther Discord


GPU MODE Discord


HuggingFace Discord


Yannick Kilcher Discord


Latent Space Discord


Nous Research AI Discord


DSPy Discord


Moonshot AI (Kimi K-2) Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Comet for students


Perplexity AI ▷ #general (1157 messages🔥🔥🔥):

Perplexity Pro, GPT-5, Comet Browser, Filter Overreach


Perplexity AI ▷ #sharing (3 messages):

Perplexity Browser Claims


Perplexity AI ▷ #pplx-api (1 messages):

breakingclover: I’m interested, I tried to send you a message but it looks like you have them off!


LMArena ▷ #general (895 messages🔥🔥🔥):

Gemini 3 Hype, LM Arena Login System, LM Arena staying open, LM Arena Site Outage, LM Arena FAQ


LMArena ▷ #announcements (1 messages):

Video Generation Contest


OpenAI ▷ #annnouncements (1 messages):

ChatGPT Free Projects, Larger File Uploads, Project Customization


OpenAI ▷ #ai-discussions (434 messages🔥🔥🔥):

AI Residency Program, Nano Banana in Photoshop, AI Car Game with ChatGPT, Cognitive Mesh AI Design, Liquid Neural Networks


OpenAI ▷ #prompt-engineering (36 messages🔥):

Prompt Leaking, API discussion location, Context Contamination, Custom GPTs Reliability, Prompt Priority Level


OpenAI ▷ #api-discussions (36 messages🔥):

Anti-Prompt Leaks, GPT Reliability, Prompt Engineering, Agent Instructions, Model Temperatue


Cursor Community ▷ #general (284 messages🔥🔥):

Anthropic API, Cursor Auto model quality, Cursor chat history loss, Fine-tuning recommendations for chatbot, Gemini 2.5 Reasoning


Cursor Community ▷ #background-agents (2 messages):

Background agent transfer, API state transfer summary


OpenRouter ▷ #app-showcase (2 messages):

Nano Banana Discord Bot, vibe coded bot


OpenRouter ▷ #general (230 messages🔥🔥):

DeepSeek models, Claude-sonnet-4 problems, DeepSeek 3.1 cheapest price, Submodel.ai promotion, OpenRouter billing questions


OpenRouter ▷ #discussion (5 messages):

Google Antitrust, Yahoo Chrome, Minor UI suggestion


LM Studio ▷ #general (156 messages🔥🔥):

LM Studio 4k context, LM Studio Nightly, AI Girlfriend goon chamber, LM Studio auto naming stories, Granite 4 memory


LM Studio ▷ #hardware-discussion (77 messages🔥🔥):

Limiting GPU Power Draw, DDR4 Server for LLMs, GPU Recommendations (3060, Titan Xp, A4000), MoE offload for Qwen3-235B, Multi-GPU bandwidth with dual CPUs


Eleuther ▷ #general (14 messages🔥):

3D modeling, Latent knowledge in large foundation models, SPAR applications


Eleuther ▷ #research (128 messages🔥🔥):

Normalizing Flows, CoT/RL limitations, Diffusion Models, Adaptive Computation, Fixed-Point Problems


Eleuther ▷ #lm-thunderdome (34 messages🔥):

LM Eval Harness, MMLU Task Configuration, Steering Vectors Implementation, Attention Heads Steering, Model's Response Recording


GPU MODE ▷ #general (21 messages🔥):

PyTorch Conference, CUDA data pipelines, MLPerf benchmarks, Hardware optimizations, NVIDIA Hopper Architecture


GPU MODE ▷ #triton (3 messages):

Microsoft Teams Meeting, Meeting Details


GPU MODE ▷ #cuda (5 messages):

Intra-device Parallelization, CUDA-level FSDP, Register and Shared Memory Usage, NVCC Half-Precision Optimization


GPU MODE ▷ #torch (12 messages🔥):

Torch.compile and Triton Kernels, Kernel Fusion with Torch.compile, Triton OP Registration


GPU MODE ▷ #jobs (1 messages):

Sony Computer Vision Job Posting


GPU MODE ▷ #torchao (20 messages🔥):

TorchAO Installation, cu128 image, torch2.8, MXFP8 Training


GPU MODE ▷ #irl-meetup (1 messages):

Project Contributions


GPU MODE ▷ #intel (1 messages):

Gaudi 2, Gaudi performance


GPU MODE ▷ #self-promotion (6 messages):

AI-Generated Metal Kernels, PyTorch to Low-Level Kernels, MPS Eager & torch.compile Backend, Kernel LLM Generation, BackendBench Correctness Checking


GPU MODE ▷ #thunderkittens (1 messages):

B200 attention kernel


GPU MODE ▷ #submissions (11 messages🔥):

MI300x8, amd-all2all leaderboard


GPU MODE ▷ #factorio-learning-env (7 messages):

Game Feedback, Request for Guidance


GPU MODE ▷ #amd-competition (17 messages🔥):

Submitting HIP Kernels, iris library, UI exit code info


GPU MODE ▷ #cutlass (4 messages):

CUTLASS 2.x interfaces, Hopper F8 performance, kind::mxf4nvf4.block_scale vs kind::mxf8f6f4.block_scale, Github bug reports


GPU MODE ▷ #multi-gpu (20 messages🔥):

Multi-GPU Development, Distributed Kernels, AMD Challenge 2025, NVLink vs PCIe, Fused Kernels


GPU MODE ▷ #low-bit-training (1 messages):

MXFP8 pre-training, TorchAO MXFP8, Crusoe B200 Cluster


HuggingFace ▷ #general (93 messages🔥🔥):

Deepseek API, Llama 3.2 vision 11B on Macbook Pro M4, Quantized Models, HF Spaces, Python Learning


HuggingFace ▷ #today-im-learning (3 messages):

link request, language studies


HuggingFace ▷ #i-made-this (2 messages):

Datatune Agents release, DeepResearch AI Agents, Token Optimization


HuggingFace ▷ #computer-vision (2 messages):

Detectron2 setup, Automated test cases


HuggingFace ▷ #NLP (1 messages):

cakiki: <@596574356327628850> please don't cross-post


Yannick Kilcher ▷ #general (66 messages🔥🔥):

Automated weapons, Public transportation in the US, Drones as a cheap attacking option, DeepMind's potential with massive funding, Quantum Physics of AGI


Yannick Kilcher ▷ #paper-discussion (16 messages🔥):

Mamba Weakness, Online learning, Neuromorphic architecture, Bad ML learning resources


Yannick Kilcher ▷ #ml-news (3 messages):

Ladybird Browser, FOSS browser alternative to Chrome


Latent Space ▷ #ai-general-chat (59 messages🔥🔥):

Agentic Design Patterns, Claude Code, AI Compute Arms Race, Open Source Deep Research Agent, Exa Series B


Latent Space ▷ #genmedia-creative-ai (5 messages):

AI-generated worlds, Immersive storytelling, Higgsfield platform, Future of Sci-Fi


Nous Research AI ▷ #announcements (1 messages):

Husky Hold’em Bench, OS pokerbots eval, Claude 4 Sonnet, Hermes 4 405B


Nous Research AI ▷ #general (37 messages🔥):

Hermes 4 vs other models, SillyTavern and Hermes, Prompt Compliance, LLM game benchmarks


Nous Research AI ▷ #research-papers (1 messages):

Generative UI, AI-First Interaction Patterns, Transformation Design, Business Applications


Nous Research AI ▷ #research-papers (1 messages):

Generative UI Research, AI-First Interaction Patterns, Transformation Design Case Study, Business Impact of Generative UI


DSPy ▷ #general (34 messages🔥):

DSPy Data Leakage Concerns, DSPy Data Splits Clarification (train/val/dev/test), MLflow Integration with DSPy, Context Compression Experiments, Improving dspy.Image Reliability


DSPy ▷ #examples (2 messages):

DSPy for Tax Automation, Amazon Purchase Extraction


Moonshot AI (Kimi K-2) ▷ #announcements (1 messages):

Kimi Voucher Giveaway, New Kimi Coding Model, Scammer Alert


Moonshot AI (Kimi K-2) ▷ #general-chat (31 messages🔥):

Kimi K2 Model performance, Slide Generation feature, Kimi K2 turbo Coder Pro plan, Model releases for VRAM


Modular (Mojo 🔥) ▷ #mojo (5 messages):

Mojo SIMD, Rust AVX, Standard Library net/http module, community driven HTTP library, lightbug_http


Modular (Mojo 🔥) ▷ #max (13 messages🔥):

Mojo GPU on GTX 1080, Max backend for Torch, Turing minimum architecture, Reaching out to Modular team


Manus.im Discord ▷ #general (7 messages):

Website deployment on basic plan, Grok as a Tool vs. Agent, Comparison of Grok and Manus


tinygrad (George Hotz) ▷ #general (6 messages):

GPU Crash with HIPC Code, Multi-GPU Training Distress, Pyrender Testing, Uops Test Updates


aider (Paul Gauthier) ▷ #general (4 messages):

Codex vs. Aider, OpenAI KYC, GPT-5 Streaming