Frozen AI News archive

not much happened today

**Chinese labs** have released a wave of powerful, permissively licensed models in July, including **Zhipu AI's GLM-4.5** and **GLM-4.5-Air**, **Alibaba's Qwen3 Coder** and **Qwen3-235B**, and **Moonshot AI's Kimi K2**. These models feature large-scale Mixture of Experts architectures with active parameters ranging from 3B to 32B and context windows up to 256K tokens. **Zhipu AI's GLM-4.5** competes with **Claude 4 Opus** and **Gemini 2.5 Pro** in benchmarks. **Moonshot AI's Kimi K2** is a 1 trillion-parameter MoE model surpassing other open-weight models on **LiveCodeBench** and **AceBench**. In video and image generation, **xAI** launched **Grok Imagine**, and **Wan2.2** impressed with its Image-to-Video approach. **Ideogram** released a character consistency model. Robotics advances include **Figure's Figure-01 and Figure-02** humanoid robots and **ViTPose++** for pose estimation in basketball analysis. The **SmolLM3** training and evaluation code was fully released under an Apache 2.0 license. *"Orgs avoiding these Chinese open-source models are at a significant competitive disadvantage,"* noted by @corbtt.

Canonical issue URL

a quiet day.

AI News for 7/28/2025-7/29/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 6913 messages) for you. Estimated reading time saved (at 200wpm): 556 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

In the absence of major news, you might want to check out the Search and Retrieval track which is now fully released, of which the most popular talk so far has been Jerry Liu's talk on Knowledge Work Agents.

This track is a nice complement to similar topics on GraphRAG, RecSys, and MCP.


AI Twitter Recap

Model Releases and Performance

AI Agents, Tooling & Applications

Infrastructure, Efficiency & Optimization

Research, Techniques & Evaluation

Industry & Broader Discourse

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-30B-A3B-Instruct-2507 Model Release and Community Impressions

2. GLM 4.5 Model Launches, Benchmarks, and Ecosystem Integration

3. Meta Observations on AI Model Progress (Memes and Commentary)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Wan 2.2 Model Release Benchmarks and Comparisons

2. OpenAI GPT-5 and Study Mode Announcements

3. AI Impact on Jobs and Society: Industry Predictions and Ethical Concerns


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Emerging AI Models & Performance Dynamics

Theme 2. AI Development & Infrastructure Challenges

Theme 3. AI Platform & Ecosystem Innovations

Theme 4. Ethical AI & User Experience Concerns

Theme 5. Advancements in AI Research & Techniques


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


OpenAI Discord


LM Studio Discord


Moonshot AI (Kimi K-2) Discord


Notebook LM Discord


Latent Space Discord


HuggingFace Discord


Eleuther Discord


GPU MODE Discord


Nous Research AI Discord


Yannick Kilcher Discord


MCP (Glama) Discord


DSPy Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


Gorilla LLM (Berkeley Function Calling) Discord


Torchtune Discord


MLOps @Chipro Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

R1 1776 removal, Claude 4.0 Sonnet, Sonar model


Perplexity AI ▷ #general (1101 messages🔥🔥🔥):

Comet browser cloud sync, Qwen3 30B model, unified memory, Nvidia 5080, OpenRouter


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity Deep Research API Support, Sonar Deep Research Output Issues


Unsloth AI (Daniel Han) ▷ #general (757 messages🔥🔥🔥):

vibe coding, WASM, FIPS 140-3, trl breaks everything, GLM 4.5 Air


Unsloth AI (Daniel Han) ▷ #off-topic (17 messages🔥):

HuggingFace Tokenizers, Windows 7 lifetime extension, Gemma 3 4B fine-tuning, RoPE Positional Encoding


Unsloth AI (Daniel Han) ▷ #help (106 messages🔥🔥):

Gemma 3 finetuning issues, TRL downgrade for Unsloth, Qwen2-VL tokenizer issues, GGUF conversion problems, InternVL model loading errors


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

marioz_70065: My humble exploit of unsloth has been published http://dx.doi.org/10.1002/isaf.70011


Unsloth AI (Daniel Han) ▷ #research (32 messages🔥):

LLMs Vision, Video encoders, Audio image relation, QwenOmni, Gemma 3 vision quantization


Unsloth AI (Daniel Han) ▷ #unsloth-bot (97 messages🔥🔥):

Memory usage variability, Verifying LoRA Weight Updates, CUDA error debugging, ImportError TRL library, Unsloth training LLM


LMArena ▷ #general (591 messages🔥🔥🔥):

GPT-5, GLM 4.5, Qwen, Data Privacy, Model Evaluation


LMArena ▷ #announcements (1 messages):

GLM-4.5, LMArena


Cursor Community ▷ #general (463 messages🔥🔥🔥):

O3 performance, Windsurf vs Cursor, Context window usage, Cursor Auto mode improvements, Cursor models and pricing


Cursor Community ▷ #background-agents (8 messages🔥):

Background Agent UI, Background Agent Snapshots, Local Background Agents, Background Agent Formatting, Docker Build Cache


Cursor Community ▷ #announcements (1 messages):

Cursor 1.3 Release, Terminal Sharing with Agent, Context Usage in Chat, Faster Edits


OpenRouter (Alex Atallah) ▷ #app-showcase (11 messages🔥):

AgentSmith launch, OpenRouter integration, Agent templates


OpenRouter (Alex Atallah) ▷ #general (347 messages🔥🔥):

GLM vs Kimi pricing, Model Settings being deleted randomly, 401 error with Deepseek, Qwen3 as architect, GPT 4.1 web search issues


OpenRouter (Alex Atallah) ▷ #discussion (9 messages🔥):

OpenRouter PR, Model Quality Transparency, Standard Lane Routing, DeepSeek Model Complaints


OpenAI ▷ #annnouncements (1 messages):

ChatGPT Study Mode, AI and Education, Step-by-Step Learning


OpenAI ▷ #ai-discussions (225 messages🔥🔥):

Copilot Vision in Edge, GPT-5 Release Date, Reasoning depth slider, Gemini models in Google Drive, AI Agency for automation


OpenAI ▷ #gpt-4-discussions (3 messages):

Scholar ChatGPT, GPT-5 Versions, Zenith Coding Model


OpenAI ▷ #prompt-engineering (6 messages):

GPT project resources, Personalized Model Interactions, AI Memory Format Prompt


OpenAI ▷ #api-discussions (6 messages):

GPT project resources, Personalized model interaction, New memory format


LM Studio ▷ #general (196 messages🔥🔥):

Voxtral Mini usage, LM Studio Performance drops, GLM 4.5 Tool Support, OpenWebUI setup with LM Studio, Qwen3 model


LM Studio ▷ #hardware-discussion (43 messages🔥):

AMD Ryzen AI MAX+ 395, Llama 70b Q6 performance, Devstral model, Qwen2.5-Coder, Gemma models


Moonshot AI (Kimi K-2) ▷ #general-chat (150 messages🔥🔥):

DeepSeek R1 launch, OpenAI monopoly, Kimi K2 love, API Key Errors, Kimi and Emojis


Notebook LM ▷ #announcements (3 messages):

Featured Notebooks rollout, New Studio UI, Video Overviews rollout


Notebook LM ▷ #use-cases (23 messages🔥):

Nursing Materials in NotebookLM, Gemini Agentic Framework, Audio Overview in NotebookLM, Obsidian and NotebookLM Integration, RFP reading with NLP


Notebook LM ▷ #general (119 messages🔥🔥):

PDF Upload Issues for Paid Users, Nursing Materials on NotebookLM, Podcast Personalization, NotebookLM RAG System, Character AI


Latent Space ▷ #ai-general-chat (138 messages🔥🔥):

Zenith AI models, LlamaIndex Oxylabs Integration, AI Pricing Models, Fireworks AI Valuation, AFM-4.5B Model


HuggingFace ▷ #general (83 messages🔥🔥):

Hugging Face support, Dalle-mini troubles, Hamilton-Norwood scale model training, ragflow production environment, low-latency deployment techniques for LLMs


HuggingFace ▷ #today-im-learning (13 messages🔥):

DRL Chapter 1, LLMs course Chapter 2, Transformers, LLM inference, Learnpytorch.io subscription


HuggingFace ▷ #cool-finds (1 messages):

cakiki: <@920321842013675620> Please don't cross-post and keep channels on topic.


HuggingFace ▷ #i-made-this (3 messages):

Model Loading Problems, Lyzr AI Launch


HuggingFace ▷ #reading-group (1 messages):

Diffusion Models Study Group, MIT's Diffusion Models Curriculum, Generative AI


HuggingFace ▷ #computer-vision (2 messages):

Pretrained Model for Image Similarity, Orientation Sensitivity in Image Matching


HuggingFace ▷ #agents-course (3 messages):

RAG system for long conversations, Filter out less important tokens


Eleuther ▷ #general (34 messages🔥):

LLMs as interp agents, Transluce's mech interp work, Modelscope vs Hugging Face, Diffusion Reading Group recordings, Low latency LLM deployment in marine environments


Eleuther ▷ #research (9 messages🔥):

ArXiv's experimental LaTeX rendering, AI Peer Pressure research


Eleuther ▷ #scaling-laws (1 messages):

LLM Based data-compression, Scaling Laws, Non-text data compression


Eleuther ▷ #interpretability-general (20 messages🔥):

MATS 9.0 Applications, Circuit Discovery for POS, ICL Breaking Interpretability Tools, SAE Generalization Failures, Lucas Critique and LLM Safety


Eleuther ▷ #lm-thunderdome (8 messages🔥):

SQuAD F1 Score, HalluLens Implementation, lm-harness Metrics Configuration


Eleuther ▷ #multimodal-general (1 messages):

Diffusion Models Study Group, Flow Matching, MIT's Diffusion Models Curriculum


Eleuther ▷ #gpt-neox-dev (3 messages):

TokenSmith Release, MoE Implementation, Grouped GEMM, Low Precision MoE training


GPU MODE ▷ #general (7 messages):

Passing args by pointer, Cloud provider with single b200


GPU MODE ▷ #triton (19 messages🔥):

Profiling Triton kernels with Nsight Compute, Getting Triton and PTX code from torch compile, Forcing pure Triton in Torch Inductor, GEMM with a ping-pong schedule in Triton


GPU MODE ▷ #cuda (6 messages):

CUBIN files, ELF fatbinaries, nvidia sdk


GPU MODE ▷ #beginner (2 messages):

GPU Mode Leaderboard Challenges, Learning Resources


GPU MODE ▷ #thunderkittens (1 messages):

level_04 bug, missing zero(C_accum)


GPU MODE ▷ #general-leaderboard (1 messages):

fido01698: 33342 with sample trimul.py get from template command


GPU MODE ▷ #hardware (1 messages):

SLURM vs k8s, Multi-GPU training, Kubeflow, HPC Forums


GPU MODE ▷ #factorio-learning-env (3 messages):

can_place_entity bug


GPU MODE ▷ #cutlass (3 messages):

TV-layout visualizer, cute-dsl, gist.github.com


GPU MODE ▷ #multi-gpu (24 messages🔥):

DTensor Learning, Single GPU Distributed, manual_seed_all for ranks


Nous Research AI ▷ #general (56 messages🔥🔥):

MoE vs Dense Models, Local LLM finetuning, GLM Model Architectures, Anthropic API restrictions


Nous Research AI ▷ #interesting-links (1 messages):

kneeanderthul: https://github.com/ProjectPAIE/sovereign-file-tracker


Yannick Kilcher ▷ #general (29 messages🔥):

Sparsity in CPUs/GPUs, MoE Performance, Kimi K2 vs Claude, Optimal Active Parameter Count


Yannick Kilcher ▷ #ml-news (10 messages🔥):

YouTube shorts, TikTok's algorithm, Personalized Content, ChatGPT study mode


MCP (Glama) ▷ #general (22 messages🔥):

Blockchain Authorization, Monday.com AI duo, MCP User Context Separation, MCP Server on EC2, BDD Side Project


MCP (Glama) ▷ #showcase (6 messages):

VS Code MCP Extension, MCPJam Inspector, Nexus mobile app store


DSPy ▷ #show-and-tell (1 messages):

dhar007: That's the new DSPy optimizer, isn't it 🙂


DSPy ▷ #papers (1 messages):

optimizer


DSPy ▷ #general (23 messages🔥):

GEPA: Reflective Prompt Evolution, Optimizing tool response feedback, dspy.Variable and dspy.Parameter, AI Engineer specializing in agentic systems


Modular (Mojo 🔥) ▷ #mojo (19 messages🔥):

external_call usage in Mojo, C ABI function calls, Mojo standard library development, File descriptor features, Mojo module naming feature request


Modular (Mojo 🔥) ▷ #max (4 messages):

PyTorch 2.7 dependency, Max's PyTorch version, Nightly Builds, Minimum PyTorch Version


LlamaIndex ▷ #announcements (1 messages):

FlowMaker, LlamaIndex office hours, Document Agents for Finance, S3VectorStore, LlamaParse header and footer detection


LlamaIndex ▷ #blog (5 messages):

Agent Design Patterns, Web Scraping AI Agents, LlamaCloud Nodes for n8n, AI Document Agents, LlamaCloud Managed Embeddings


LlamaIndex ▷ #general (13 messages🔥):

Export Flowmaker to Python, Llamacloud PDF Detection Issue, File Extension Naming Conventions


Manus.im Discord ▷ #general (15 messages🔥):

Grok for Prompting, Manus Credit System, Agentic Systems Comparison (Lume vs. Suna)


tinygrad (George Hotz) ▷ #general (9 messages🔥):

Tensor Implementation in Tinygrad, Closed PR #11410 Analysis, Alternative Implementation Ideas


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (4 messages):

BFCLv4, Open Source Agent Systems, API Key Offerings, Multi-Agent Systems


Torchtune ▷ #general (1 messages):

LoRA-style adapter, TorchTune support


Torchtune ▷ #dev (2 messages):

RL Tests, CI Debugging


MLOps @Chipro ▷ #events (2 messages):

Diffusion Models, Study Group, Generative AI, MIT Curriculum


Nomic.ai (GPT4All) ▷ #general (2 messages):

Nomic dataset Access, contrastors repo, model Selection


Cohere ▷ #👋-introduce-yourself (1 messages):

Introductions, Community Hopes