Frozen AI News archive

not much happened today

**Chinese AI labs** have released powerful open-source models like **GLM-4.5** and **GLM-4.5-Air** from **Zhipu AI**, **Qwen3 Coder** and **Qwen3-235B** from **Alibaba**, and **Kimi K2** from **Moonshot AI**, highlighting a surge in permissively licensed models. **Zhipu AI's GLM-4.5** is a 355B parameter MoE model competitive with **Claude 4 Opus** and **Gemini 2.5 Pro**. **Alibaba's Qwen3 Coder** shows strong code generation performance with a low edit failure rate, while **Moonshot AI's Kimi K2** is a 1 trillion-parameter MoE model surpassing benchmarks like **LiveCodeBench**. In video and image generation, **xAI** launched **Grok Imagine**, and **Wan2.2** impressed with innovative image-to-video generation. Robotics advances include **Figure's Figure-01 and Figure-02** humanoid robots and **ViTPose++** for pose estimation in basketball analysis. **SmolLM3** training and evaluation code was fully released under Apache 2.0. **OpenAI** introduced **Study Mode** in **ChatGPT** to enhance interactive learning, and **Runway** rolled out **Runway Aleph**, a new in-context video model for multi-task visual generation. The community notes a competitive disadvantage for organizations avoiding these Chinese open-source models. *"Orgs avoiding these models are at a significant competitive disadvantage,"* noted by @corbtt.

Canonical issue URL

a quiet day.

AI News for 7/29/2025-7/30/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 5378 messages) for you. Estimated reading time saved (at 200wpm): 467 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A lot of hype about GPT5 releasing tomorrow due to random Twitter anon speculation.


AI Twitter Recap

Model Releases and Performance

AI Agents, Tooling & Applications

Infrastructure, Efficiency & Optimization

Research, Techniques & Evaluation

Industry & Broader Discourse

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-30B-A3B and Related Model Launches and Performance Discussion

2. GLM4.5 Model Launches, Benchmarks, and User Impressions

3. Meta Superintelligence Strategy and Community Reactions

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI GPT-5 Anticipation and Evidence

2. WAN 2.2 Animation Model Release and Community Tools

3. Anthropic Claude Feature Expansion and Community Usage


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-3-mini

Theme 1. New Model Releases and Comparisons

Theme 2. API and Integration Hurdles

Theme 3. Performance Optimization Tactics

Theme 4. Data Privacy and Security Debates

Theme 5. Community Education and Events


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LMArena Discord


HuggingFace Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Latent Space Discord


Notebook LM Discord


LM Studio Discord


LlamaIndex Discord


GPU MODE Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


MCP (Glama) Discord


DSPy Discord


Nous Research AI Discord


Cohere Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1152 messages🔥🔥🔥):

R1 1776 Removal, Comet for Android, Gemini 2.5 Pro Speed, OpenRouter API for R1 1776, Daily Message Cap Limits


Perplexity AI ▷ #pplx-api (4 messages):

Enterprise API pricing, Deep Research API


Unsloth AI (Daniel Han) ▷ #general (670 messages🔥🔥🔥):

Hater Spams, GLM 4.5 Air, Qwen3-30B, OpenAI Censorship, Unsloth and Llama 3.1


Unsloth AI (Daniel Han) ▷ #introduce-yourself (6 messages):

Unsloth introduction, Low end-to-end latency in TTS voice cloning


Unsloth AI (Daniel Han) ▷ #off-topic (9 messages🔥):

Gemma 3 4B Fine-Tuning, Custom Tokens, Watermark Removal, RoPE FTW, Language Translation


Unsloth AI (Daniel Han) ▷ #help (64 messages🔥🔥):

Phi-4 Generate Flags Error, GGUF Conversion and Quantization of Fine-Tuned Models, Llama-CLI Performance Issues, RuntimeError in Google Colab, Unsloth BNB 4-bit Conversion


Unsloth AI (Daniel Han) ▷ #research (8 messages🔥):

Quantization optimization, Dynamic 4bit quantization, Hi-Fi Gan replacement, Autoregressive models, Mels dislike


Unsloth AI (Daniel Han) ▷ #unsloth-bot (102 messages🔥🔥):

GRPO trainer batch size, SFTrainer validation error, Model fine-tuning parameters, Llama 3.2 data preparation, Gemma 3 fine-tuning


Cursor Community ▷ #general (475 messages🔥🔥🔥):

MCP Browser, Parallel Agent Tasks, VSCode Marketplace with Cursor, Automatic Scrolling, Sonnet Model


Cursor Community ▷ #background-agents (8 messages🔥):

Background Agent Commands, Docker Build Cache, Port Hijacking, Background Agents for Research


LMArena ▷ #general (413 messages🔥🔥🔥):

dot.lol data, GPT-5 Release, EU GDPR impact, Zenith Model Relaunch, Video Arena Channels


LMArena ▷ #announcements (1 messages):

Video Arena, LMArena bot, Staff AMA


HuggingFace ▷ #general (290 messages🔥🔥):

HF Space restarts, P104-100 GPU, LLM Deployment, Qwen 30B, SmolLM3


HuggingFace ▷ #cool-finds (4 messages):

Muon Optimizer, Smithery


HuggingFace ▷ #i-made-this (5 messages):

Petite Elle Model, Gradio MBTI App, Video Editing MCP Server, Github Python Dataset


HuggingFace ▷ #reading-group (2 messages):

Diffusion Models, Flow Matching, MIT curriculum


HuggingFace ▷ #computer-vision (1 messages):

hedi1421: Thanks 😅


HuggingFace ▷ #NLP (1 messages):

Fixing transformers issue, DeepSpeed Integration


HuggingFace ▷ #smol-course (1 messages):

DuckDuckGo deprecation, Smolagents merge


HuggingFace ▷ #agents-course (3 messages):

RAG System Construction, Tool Definition Problems in Unit 1


OpenAI ▷ #ai-discussions (261 messages🔥🔥):

Study and Learn feature, GPT-5, Copilot, Gemini vs ChatGPT, AI Ecosystems


OpenAI ▷ #gpt-4-discussions (24 messages🔥):

GPT-5 versions, O4 mini vs 4o, Missing Chat History, ChatGPT memory issues


OpenAI ▷ #prompt-engineering (10 messages🔥):

Personalized GPT setup, AI Memory Format, Optimized AI VM


OpenAI ▷ #api-discussions (10 messages🔥):

GPT project guidance, Personalized AI models, AI memory format


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

DeepTrail, DeepSecure, AI agent authorization, Agent delegation, Policy enforcement


OpenRouter (Alex Atallah) ▷ #general (152 messages🔥🔥):

NotebookLLM, OpenRouter Pricing, Blocking quants via API, Becoming a provider, API Key Issues


OpenRouter (Alex Atallah) ▷ #new-models (2 messages):

``


OpenRouter (Alex Atallah) ▷ #discussion (63 messages🔥🔥):

Quantized Providers, Groq's Quantization, Deepinfra Pricing, Vertex for Claude, OpenRouter's Ori bot


Latent Space ▷ #ai-general-chat (188 messages🔥🔥):

Metislist ranking, Arcee.ai Releases AFM-4.5B, NotebookLM video overviews, Claude abuse, ryOS release


Latent Space ▷ #ai-announcements (17 messages🔥):

Anthropic Fellows Papers, LLM Paper Club, Social Media Engagement


Notebook LM ▷ #use-cases (32 messages🔥):

Tableau Vizql NLP Orchestration, Gemini Agentic Framework Prototype, NotebookLM Podcast Creation, Obsidian and NotebookLM Integration, NotebookLM Usage Analytics


Notebook LM ▷ #general (156 messages🔥🔥):

NotebookLM Video Overview Limits, Studio UI Changes, Video Generation Length, NotebookLM Rollout, Missing the new Notebook LM Feature


LM Studio ▷ #general (117 messages🔥🔥):

LM Studio model renaming, Earthquake off the coast of Russia, LM Studio copy/paste conversation feature request, LM Studio Model stuck in time, Qwen 30B garbage output


LM Studio ▷ #hardware-discussion (64 messages🔥🔥):

GPU Usage, Strix Halo, Threadripper vs Epyc, Soldered RAM, 9070 XT Performance


LlamaIndex ▷ #blog (4 messages):

LlamaCloud document agents, LlamaCloud Managed Embeddings, Automated Asset Manager Fund Analysis, LexiconTrail agentic AI systems


LlamaIndex ▷ #general (126 messages🔥🔥):

LlamaCloud PDF detection issues, Character AI architecture, Neo4j Knowledge Graph issues, Flowmaker Gemini 2.5 Pro bug


LlamaIndex ▷ #ai-discussion (3 messages):

RAG debugging, Sparse retrieval, Semantic drift, Chunking collapse, Memory breakdowns


GPU MODE ▷ #general (5 messages):

Expert Parallelism (EP) vs Tensor Parallelism (TP), Merge Sort troubles on GitHub


GPU MODE ▷ #triton (17 messages🔥):

Torch Compile, Triton Code Generation, PTX Code Extraction, Inductor Configuration, GEMM Autotuning


GPU MODE ▷ #cuda (9 messages🔥):

livestream review, request accepted


GPU MODE ▷ #torch (2 messages):

CUPTI metrics in kineto, torch.profiler metrics


GPU MODE ▷ #beginner (6 messages):

CUDA streams, Megatron-LM, Group GEMM, NYC Hackathon, Beginner Hackathon Tips


GPU MODE ▷ #irl-meetup (1 messages):

ali_8366: Anyone here from Montreal? Would love to have a coffee chat


GPU MODE ▷ #webgpu (1 messages):

vishomaru: Hello, anybody here was successful in profiling compute shaders with AMD GPU Profiler?


GPU MODE ▷ #self-promotion (3 messages):

AI Hackathon, CuTeDSL Blogpost, Software Pipelining


GPU MODE ▷ #general-leaderboard (3 messages):

Popcorn-cli DeserializationError, BadCredentialsException on MI300, B200 Timeout Issues, Discord Run Errors


GPU MODE ▷ #factorio-learning-env (5 messages):

Benchmarking Explanation


GPU MODE ▷ #cutlass (20 messages🔥):

gmem synchthreads, cp.async.cg vs cp.async.ca, cutedsl ptx wrapper, nvvm wrapper, cutedsl older drivers


GPU MODE ▷ #multi-gpu (1 messages):

Distributed Training, LLMs, Distributed memory tricks


Eleuther ▷ #general (38 messages🔥):

GPU Inference vs M3 Ultra, LLMs Offshore with low latency and bad internet, Topological data analysis experts, Speech-LLM models and audio instruction-following capabilities, Manipulating vector embeddings for machine translation


Eleuther ▷ #research (2 messages):

REST models, Compute cost


Eleuther ▷ #interpretability-general (17 messages🔥):

In-Context Learning (ICL), Interpretability Tools, Sparse Autoencoders (SAEs), Lucas Critique, Activation Distributions


Eleuther ▷ #lm-thunderdome (1 messages):

Model Evaluation Metrics


Eleuther ▷ #multimodal-general (1 messages):

Diffusion Models Study Group, Flow Matching, MIT Curriculum


Eleuther ▷ #gpt-neox-dev (4 messages):

MoE Implementation, grouped_mm, Low Precision Training, Float8 Training


Modular (Mojo 🔥) ▷ #general (7 messages):

CUDA generalization paper, TLS handshake EOF error, Mojo package installation, Region-specific access issues


Modular (Mojo 🔥) ▷ #mojo (41 messages🔥):

Mojo external calls vs libc, Mojo to Python overhead, Embedding CPython in Mojo binaries, Python performance, Mojo and hot loops


aider (Paul Gauthier) ▷ #general (38 messages🔥):

Aider site framework, Deepseek-chat OpenRouter Issues, SWE-bench Leaderboard, Aider's Role in Model Training, Qwen3 Coder 30B-A3B announcement


aider (Paul Gauthier) ▷ #questions-and-tips (3 messages):

Open Model Selection, Hardware Considerations for Aider, Runpod Credits, R1 Model, Qwen Coder Model


Yannick Kilcher ▷ #general (29 messages🔥):

LLM Safety Alignment Research, AI alignment blogs, CUDA with Claude, Z.AI 54 open source repos, Math in paper discussion


Yannick Kilcher ▷ #ml-news (1 messages):

Qwen3, GPT-4o


Moonshot AI (Kimi K-2) ▷ #general-chat (30 messages🔥):

Kimi chatbot, Moonshot AI vibe, OpenHands, Training dataset of Kimi, Scale AI


Manus.im Discord ▷ #general (25 messages🔥):

Lume vs Suna, Manus' Comic Creation, The Future of Manus


MCP (Glama) ▷ #general (22 messages🔥):

MCP Server Security, BDD Testing with LLMs and MCP, Windows-MCP issues with CursorTouch and Claude, FastMCP tool selection, Hosted MCP server


MCP (Glama) ▷ #showcase (2 messages):

DeepTrail, Deepsecure, Open Source Auth, Delegation Layer for AI agents, Secure Multi-agent workflows


DSPy ▷ #general (18 messages🔥):

DSPy learnable parameters proposal, Signature implementation using f-strings, DSPy vs GEPA


Nous Research AI ▷ #general (15 messages🔥):

AMD vs Nvidia for gaming, Qwen coding model release, RLVR discussion


Cohere ▷ #🧵-general-thread (3 messages):

MRC model comparison, Summer school channel request, Senior Software Engineer remote job


Cohere ▷ #🔌-api-discussions (1 messages):

Langchain-Cohere citation mode, langchain_cohere.ChatCohere


Cohere ▷ #👋-introduce-yourself (6 messages):

AI Safety, LLM Bias Mitigation, GPU Kernel Optimization


Torchtune ▷ #general (2 messages):

LoRA-style adapter in Torchtune, Merged weights in Torchtune


Torchtune ▷ #papers (2 messages):

ACL Paper Award, Glianorex finetunes


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Certificate Declaration Form


MLOps @Chipro ▷ #events (2 messages):

Diffusion Models Study Group, MIT Diffusion Models Curriculum, Flow Matching, Generative AI, AI Education


Nomic.ai (GPT4All) ▷ #general (1 messages):

Deploying custom language models, Hugging Face deployment, GUI for user queries