Frozen AI News archive

Figma's $50+b IPO

**OpenAI**'s stealth model **horizon-alpha** on **OpenRouter** sparks speculation as a precursor to **GPT-5**, showing strong reasoning and SVG generation capabilities, comparable to **Gemini 2.5 Pro**. **Alibaba** released the **Qwen3-Coder** family, including a fast **Qwen3-Coder-Flash (30B-A3B)** variant with agentic features and 1M context length support via **UnslothAI**. **Cohere** launched **Command A Vision**, a 111B parameter open-weights vision-language model outperforming **GPT-4.1** and **Llama 4 Maverick** on enterprise benchmarks. **Black Forest Labs** introduced **FLUX.1 Krea [dev]**, an open-weights photorealism model compatible with fine-tuning tools like **diffusers** and **ostrisai**. **Zhipu AI** unveiled **GLM-4.5**, a hybrid reasoning open model with agentic capabilities available on **Together AI**. Discussions highlight the rising importance of **inference-time training** and **reasoning model generalization**. **Mistral AI** released the technical report for **Voxtral** continuing its open science efforts.

Canonical issue URL

A happy outcome for a generational web platform.

AI News for 7/30/2025-7/31/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 5332 messages) for you. Estimated reading time saved (at 200wpm): 471 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

As you know we try to keep things technical, but significant tech business stories do break through. The occasion of a new publicly listed software decacorn is both likely to solidify Figma's position as a web design platform (with some AI work) and likely to mint lots of millionaires who will fund the next wave and cycle of tech.


AI Twitter Recap

Model Releases, Updates, and Performance

AI Tooling, Frameworks, and Infrastructure

AI-Generated Media and Content

Industry, Funding, and Geopolitics

Broader Discourse and Developer Culture

Humor and Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-Coder-30B-A3B and Flash Model Announcements and Benchmarks

2. Chinese Open-Source AI Model Momentum and Global Rankings

3. Upcoming and Potential Benchmark Innovations: Deepseek ACL 2025

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI GPT-5 and Stealth Model Developments

2. Wan2.2 and Flux: New Model Releases and Benchmarks

3. Steampunk Video Game Concepts and Prompt Techniques


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1: Models Muscle Up with New Releases

Theme 2: Hardware Hustles for AI Speed

Theme 3: Censorship Clashes in Model Mayhem

Theme 4: Agents Arm Up with Security Shields

Theme 5: Education Explodes with Study Groups


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LMArena Discord


HuggingFace Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Latent Space Discord


Notebook LM Discord


LM Studio Discord


LlamaIndex Discord


GPU MODE Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


MCP (Glama) Discord


DSPy Discord


Nous Research AI Discord


Cohere Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1152 messages🔥🔥🔥):

R1 1776 Removal, Comet for Android, Gemini 2.5 Pro Speed, OpenRouter API for R1 1776, Daily Message Cap Limits


Perplexity AI ▷ #pplx-api (4 messages):

Enterprise API pricing, Deep Research API


Unsloth AI (Daniel Han) ▷ #general (670 messages🔥🔥🔥):

Hater Spams, GLM 4.5 Air, Qwen3-30B, OpenAI Censorship, Unsloth and Llama 3.1


Unsloth AI (Daniel Han) ▷ #introduce-yourself (6 messages):

Unsloth introduction, Low end-to-end latency in TTS voice cloning


Unsloth AI (Daniel Han) ▷ #off-topic (9 messages🔥):

Gemma 3 4B Fine-Tuning, Custom Tokens, Watermark Removal, RoPE FTW, Language Translation


Unsloth AI (Daniel Han) ▷ #help (64 messages🔥🔥):

Phi-4 Generate Flags Error, GGUF Conversion and Quantization of Fine-Tuned Models, Llama-CLI Performance Issues, RuntimeError in Google Colab, Unsloth BNB 4-bit Conversion


Unsloth AI (Daniel Han) ▷ #research (8 messages🔥):

Quantization optimization, Dynamic 4bit quantization, Hi-Fi Gan replacement, Autoregressive models, Mels dislike


Unsloth AI (Daniel Han) ▷ #unsloth-bot (102 messages🔥🔥):

GRPO trainer batch size, SFTrainer validation error, Model fine-tuning parameters, Llama 3.2 data preparation, Gemma 3 fine-tuning


Cursor Community ▷ #general (475 messages🔥🔥🔥):

MCP Browser, Parallel Agent Tasks, VSCode Marketplace with Cursor, Automatic Scrolling, Sonnet Model


Cursor Community ▷ #background-agents (8 messages🔥):

Background Agent Commands, Docker Build Cache, Port Hijacking, Background Agents for Research


LMArena ▷ #general (413 messages🔥🔥🔥):

dot.lol data, GPT-5 Release, EU GDPR impact, Zenith Model Relaunch, Video Arena Channels


LMArena ▷ #announcements (1 messages):

Video Arena, LMArena bot, Staff AMA


HuggingFace ▷ #general (290 messages🔥🔥):

HF Space restarts, P104-100 GPU, LLM Deployment, Qwen 30B, SmolLM3


HuggingFace ▷ #cool-finds (4 messages):

Muon Optimizer, Smithery


HuggingFace ▷ #i-made-this (5 messages):

Petite Elle Model, Gradio MBTI App, Video Editing MCP Server, Github Python Dataset


HuggingFace ▷ #reading-group (2 messages):

Diffusion Models, Flow Matching, MIT curriculum


HuggingFace ▷ #computer-vision (1 messages):

hedi1421: Thanks 😅


HuggingFace ▷ #NLP (1 messages):

Fixing transformers issue, DeepSpeed Integration


HuggingFace ▷ #smol-course (1 messages):

DuckDuckGo deprecation, Smolagents merge


HuggingFace ▷ #agents-course (3 messages):

RAG System Construction, Tool Definition Problems in Unit 1


OpenAI ▷ #ai-discussions (261 messages🔥🔥):

Study and Learn feature, GPT-5, Copilot, Gemini vs ChatGPT, AI Ecosystems


OpenAI ▷ #gpt-4-discussions (24 messages🔥):

GPT-5 versions, O4 mini vs 4o, Missing Chat History, ChatGPT memory issues


OpenAI ▷ #prompt-engineering (10 messages🔥):

Personalized GPT setup, AI Memory Format, Optimized AI VM


OpenAI ▷ #api-discussions (10 messages🔥):

GPT project guidance, Personalized AI models, AI memory format


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

DeepTrail, DeepSecure, AI agent authorization, Agent delegation, Policy enforcement


OpenRouter (Alex Atallah) ▷ #general (152 messages🔥🔥):

NotebookLLM, OpenRouter Pricing, Blocking quants via API, Becoming a provider, API Key Issues


OpenRouter (Alex Atallah) ▷ #new-models (2 messages):

``


OpenRouter (Alex Atallah) ▷ #discussion (63 messages🔥🔥):

Quantized Providers, Groq's Quantization, Deepinfra Pricing, Vertex for Claude, OpenRouter's Ori bot


Latent Space ▷ #ai-general-chat (188 messages🔥🔥):

Metislist ranking, Arcee.ai Releases AFM-4.5B, NotebookLM video overviews, Claude abuse, ryOS release


Latent Space ▷ #ai-announcements (17 messages🔥):

Anthropic Fellows Papers, LLM Paper Club, Social Media Engagement


Notebook LM ▷ #use-cases (32 messages🔥):

Tableau Vizql NLP Orchestration, Gemini Agentic Framework Prototype, NotebookLM Podcast Creation, Obsidian and NotebookLM Integration, NotebookLM Usage Analytics


Notebook LM ▷ #general (156 messages🔥🔥):

NotebookLM Video Overview Limits, Studio UI Changes, Video Generation Length, NotebookLM Rollout, Missing the new Notebook LM Feature


LM Studio ▷ #general (117 messages🔥🔥):

LM Studio model renaming, Earthquake off the coast of Russia, LM Studio copy/paste conversation feature request, LM Studio Model stuck in time, Qwen 30B garbage output


LM Studio ▷ #hardware-discussion (64 messages🔥🔥):

GPU Usage, Strix Halo, Threadripper vs Epyc, Soldered RAM, 9070 XT Performance


LlamaIndex ▷ #blog (4 messages):

LlamaCloud document agents, LlamaCloud Managed Embeddings, Automated Asset Manager Fund Analysis, LexiconTrail agentic AI systems


LlamaIndex ▷ #general (126 messages🔥🔥):

LlamaCloud PDF detection issues, Character AI architecture, Neo4j Knowledge Graph issues, Flowmaker Gemini 2.5 Pro bug


LlamaIndex ▷ #ai-discussion (3 messages):

RAG debugging, Sparse retrieval, Semantic drift, Chunking collapse, Memory breakdowns


GPU MODE ▷ #general (5 messages):

Expert Parallelism (EP) vs Tensor Parallelism (TP), Merge Sort troubles on GitHub


GPU MODE ▷ #triton (17 messages🔥):

Torch Compile, Triton Code Generation, PTX Code Extraction, Inductor Configuration, GEMM Autotuning


GPU MODE ▷ #cuda (9 messages🔥):

livestream review, request accepted


GPU MODE ▷ #torch (2 messages):

CUPTI metrics in kineto, torch.profiler metrics


GPU MODE ▷ #beginner (6 messages):

CUDA streams, Megatron-LM, Group GEMM, NYC Hackathon, Beginner Hackathon Tips


GPU MODE ▷ #irl-meetup (1 messages):

ali_8366: Anyone here from Montreal? Would love to have a coffee chat


GPU MODE ▷ #webgpu (1 messages):

vishomaru: Hello, anybody here was successful in profiling compute shaders with AMD GPU Profiler?


GPU MODE ▷ #self-promotion (3 messages):

AI Hackathon, CuTeDSL Blogpost, Software Pipelining


GPU MODE ▷ #general-leaderboard (3 messages):

Popcorn-cli DeserializationError, BadCredentialsException on MI300, B200 Timeout Issues, Discord Run Errors


GPU MODE ▷ #factorio-learning-env (5 messages):

Benchmarking Explanation


GPU MODE ▷ #cutlass (20 messages🔥):

gmem synchthreads, cp.async.cg vs cp.async.ca, cutedsl ptx wrapper, nvvm wrapper, cutedsl older drivers


GPU MODE ▷ #multi-gpu (1 messages):

Distributed Training, LLMs, Distributed memory tricks


Eleuther ▷ #general (38 messages🔥):

GPU Inference vs M3 Ultra, LLMs Offshore with low latency and bad internet, Topological data analysis experts, Speech-LLM models and audio instruction-following capabilities, Manipulating vector embeddings for machine translation


Eleuther ▷ #research (2 messages):

REST models, Compute cost


Eleuther ▷ #interpretability-general (17 messages🔥):

In-Context Learning (ICL), Interpretability Tools, Sparse Autoencoders (SAEs), Lucas Critique, Activation Distributions


Eleuther ▷ #lm-thunderdome (1 messages):

Model Evaluation Metrics


Eleuther ▷ #multimodal-general (1 messages):

Diffusion Models Study Group, Flow Matching, MIT Curriculum


Eleuther ▷ #gpt-neox-dev (4 messages):

MoE Implementation, grouped_mm, Low Precision Training, Float8 Training


Modular (Mojo 🔥) ▷ #general (7 messages):

CUDA generalization paper, TLS handshake EOF error, Mojo package installation, Region-specific access issues


Modular (Mojo 🔥) ▷ #mojo (41 messages🔥):

Mojo external calls vs libc, Mojo to Python overhead, Embedding CPython in Mojo binaries, Python performance, Mojo and hot loops


aider (Paul Gauthier) ▷ #general (38 messages🔥):

Aider site framework, Deepseek-chat OpenRouter Issues, SWE-bench Leaderboard, Aider's Role in Model Training, Qwen3 Coder 30B-A3B announcement


aider (Paul Gauthier) ▷ #questions-and-tips (3 messages):

Open Model Selection, Hardware Considerations for Aider, Runpod Credits, R1 Model, Qwen Coder Model


Yannick Kilcher ▷ #general (29 messages🔥):

LLM Safety Alignment Research, AI alignment blogs, CUDA with Claude, Z.AI 54 open source repos, Math in paper discussion


Yannick Kilcher ▷ #ml-news (1 messages):

Qwen3, GPT-4o


Moonshot AI (Kimi K-2) ▷ #general-chat (30 messages🔥):

Kimi chatbot, Moonshot AI vibe, OpenHands, Training dataset of Kimi, Scale AI


Manus.im Discord ▷ #general (25 messages🔥):

Lume vs Suna, Manus' Comic Creation, The Future of Manus


MCP (Glama) ▷ #general (22 messages🔥):

MCP Server Security, BDD Testing with LLMs and MCP, Windows-MCP issues with CursorTouch and Claude, FastMCP tool selection, Hosted MCP server


MCP (Glama) ▷ #showcase (2 messages):

DeepTrail, Deepsecure, Open Source Auth, Delegation Layer for AI agents, Secure Multi-agent workflows


DSPy ▷ #general (18 messages🔥):

DSPy learnable parameters proposal, Signature implementation using f-strings, DSPy vs GEPA


Nous Research AI ▷ #general (15 messages🔥):

AMD vs Nvidia for gaming, Qwen coding model release, RLVR discussion


Cohere ▷ #🧵-general-thread (3 messages):

MRC model comparison, Summer school channel request, Senior Software Engineer remote job


Cohere ▷ #🔌-api-discussions (1 messages):

Langchain-Cohere citation mode, langchain_cohere.ChatCohere


Cohere ▷ #👋-introduce-yourself (6 messages):

AI Safety, LLM Bias Mitigation, GPU Kernel Optimization


Torchtune ▷ #general (2 messages):

LoRA-style adapter in Torchtune, Merged weights in Torchtune


Torchtune ▷ #papers (2 messages):

ACL Paper Award, Glianorex finetunes


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Certificate Declaration Form


MLOps @Chipro ▷ #events (2 messages):

Diffusion Models Study Group, MIT Diffusion Models Curriculum, Flow Matching, Generative AI, AI Education


Nomic.ai (GPT4All) ▷ #general (1 messages):

Deploying custom language models, Hugging Face deployment, GUI for user queries