Frozen AI News archive

not much happened today

**OpenAI** has fully rolled out its ChatGPT agent to all Plus, Pro, and Team users and is building hype for the upcoming **GPT-5**, which reportedly outperforms **Grok-4** and can build a cookie clicker game in two minutes. **Alibaba's Qwen** team released the open-source reasoning model **Qwen3-235B-Thinking**, achieving an **89%** win rate over **gpt4-0314** using a new RL algorithm called **Group Sequence Policy Optimization (GSPO)**. **Runway** introduced **Runway Aleph**, a state-of-the-art in-context video model for editing and generating video content. **Hugging Face** highlights the growing momentum of open-source AI, especially from Chinese teams. Other updates include **Kling's** upgrades for image-to-video generation and **Google's Imagen 4 Ultra** being recognized as a top text-to-image model. **Anthropic** integrated **Claude** with **Canva** for branded visual designs but faces stability issues. The **PyTorch** team released optimized checkpoints for **SmolLM3** to speed up inference.

Canonical issue URL

a good day for Open Source AI

AI News for 7/24/2025-7/25/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (226 channels, and 8449 messages) for you. Estimated reading time saved (at 200wpm): 595 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

it's worth looking at Qwen 3 Thinking, and the AIE SWE Agents track which is now fully released.


AI Twitter Recap

Major Model Releases & Updates (Open Source vs. Closed Source)

AI Tooling, Frameworks, and Agents

Technical Insights & Research

Robotics & Industry Commentary

AI Applications & Use Cases

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-235B Model and Benchmark Performance Release Wave

2. Qwen3 Model Variants: Thinking, Instruct, and Smaller Models

3. AI Coding and Code Benchmark Performance (SWE-Bench, GLM-4.1V)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI Agent Mode and GPT-5 Rumors and Releases

2. Claude Code and Anthropic Feature Updates

3. Wan 2.x Model Advances and Community Benchmarks


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. The New Model Onslaught and GPT-5 Rumor Mill

Theme 2. Performance Praises, Pitfalls, and Outright Bugs

Theme 3. In the Trenches of Fine-Tuning, Quantization, and RAG

Theme 4. The Expanding AI Developer Toolkit and Infrastructure

Theme 5. AI Consciousness, Censorship, and a "Woke" White House


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Moonshot AI (Kimi K-2) Discord


LM Studio Discord


Eleuther Discord


Latent Space Discord


Nous Research AI Discord


GPU MODE Discord


Yannick Kilcher Discord


Manus.im Discord Discord


Cohere Discord


Notebook LM Discord


aider (Paul Gauthier) Discord


LLM Agents (Berkeley MOOC) Discord


LlamaIndex Discord


Torchtune Discord


MCP (Glama) Discord


Nomic.ai (GPT4All) Discord


Modular (Mojo 🔥) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Perplexity AI, AMA, Residency Program, r/csMajors


Perplexity AI ▷ #general (1202 messages🔥🔥🔥):

Comet Browser, GPT-5, Perplexity Max, Battery Temperature on iOS, Huawei trifold


Perplexity AI ▷ #sharing (2 messages):

Perplexity AI Search URLs


Perplexity AI ▷ #pplx-api (1 messages):

vikvang: hey! it should be working now. are you still experiencing problems?


OpenAI ▷ #annnouncements (1 messages):

ChatGPT agent rollout


OpenAI ▷ #ai-discussions (1013 messages🔥🔥🔥):

Agent mode, AI wedding planner, Consciousness, OpenRouter, Qwen3


OpenAI ▷ #gpt-4-discussions (18 messages🔥):

GPT-5 LLM Arena, O3 fake sources, ChatGPT PDF issues, Codex Git error


OpenAI ▷ #prompt-engineering (12 messages🔥):

Introspective Thought Structuring with Prompts, Emotional Framing in Prompts, Prompt Engineering vs Creative Tooling, AI Language and Output, Custom Instructions and Model Behavior


OpenAI ▷ #api-discussions (12 messages🔥):

Prompt Engineering for Personal Reflection, Emotional Structuring with AI, AI therapist, anti-sychophancy custom instructions


LMArena ▷ #general (878 messages🔥🔥🔥):

GPT-5, Qwen 3, O3 Alpha, Lobster, Zenith and Summit


LMArena ▷ #announcements (1 messages):

Video Arena Bot, AI Video Models, LMArena bot


Unsloth AI (Daniel Han) ▷ #general (704 messages🔥🔥🔥):

Magistral release hype, Qwen3 Coder Setup, Fine-tuning vs RAG, GRPO for vision models, Qwen3 Thinking GGUFs


Unsloth AI (Daniel Han) ▷ #introduce-yourself (9 messages🔥):

Hardware Acceleration for ML Models, Community Introductions


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

ELiTA and TaMeR research, Singing voice style replication, Fourier spectrum colors


Unsloth AI (Daniel Han) ▷ #help (81 messages🔥🔥):

Qwen 1.7B for tool calling, Gemma 3 1B GRPO notebook issues, vLLM support for Gemma 3, Gemma3-27b-it for GRPO training, Unsloth and Hugging Face transition scores


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Unsloth fine-tuning video, Gemma-3n:2e, Llama-3.1, Donald Trump AI, AI presidential debate


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

LLMs for classifying social media posts, Seq2Seq models like FLAN-T5


Unsloth AI (Daniel Han) ▷ #unsloth-bot (50 messages🔥):

Fine-tuning methods for LLMs, Saving and pushing models to Hugging Face, QAT support in Unsloth, Changing RoPE max positional embeddings, Dynamic 2.0 file for Qwen3 Coder models


Cursor Community ▷ #general (463 messages🔥🔥🔥):

Cursor file deletion bug, Frustrations with Cursor's 'auto' model, Context Usage Feature, Claude Swarm vs Cursor, Alternative coding tools


Cursor Community ▷ #background-agents (5 messages):

Background agents waiting for start script, Fetching inline GitHub pull request comments, Monitoring of [email protected]


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

Personality.gg, AI Translation, Slang translation, Contextual understanding


OpenRouter (Alex Atallah) ▷ #general (269 messages🔥🔥):

Qwen SimpleQA Drama, Qwen3 Coder vs Free, Deepseek V3 Base Model Gone?, Deepseek as Dipsy, OpenAI blocking China


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter (Alex Atallah) ▷ #discussion (109 messages🔥🔥):

OpenRouter Serverless Architecture, Cloudflare R2 Storage, Large File Support, WandB Inference as Competitor, Compute Exchange


HuggingFace ▷ #general (243 messages🔥🔥):

LLMs for legal work, Hugging Face Inference API, Fine-tuning LLMs, GPUs for FOSS AI Hosting, Qwen3-Thinking Model


HuggingFace ▷ #today-im-learning (2 messages):

LLM fine-tuning, LoRa, Whisper, Danish speech data


HuggingFace ▷ #i-made-this (2 messages):

Rhapsody project, Quantized models, HQQ quants, llama3.1-8B, torchao library


HuggingFace ▷ #computer-vision (11 messages🔥):

nnunet SOTA, Google's SAM2 models, Danbooru dataset dimensions, Image embedding model training, 8-dim output semantic meaning


HuggingFace ▷ #agents-course (5 messages):

smolagents, llamaindex, Course Submission Limits


Moonshot AI (Kimi K-2) ▷ #general-chat (156 messages🔥🔥):

Kimi K2 pricing model, Kimi K2 coding-specialized version, Kimi K2 + Reasoning + Vision, Serverless Kimi K2, Kimi K2 use cases


LM Studio ▷ #general (131 messages🔥🔥):

MCP Servers for Online Search, LLM Plugins Development, Changing Model Download Location, Remote LM Studio Setup, LLM Tier Lists and Quantization


LM Studio ▷ #hardware-discussion (17 messages🔥):

4090, iGPU for video output, Budget-friendly GPUs, 5070ti, VRAM limitations


Eleuther ▷ #general (74 messages🔥🔥):

Validation Set Corruption, Algoverse AI program, Human-like AI Personality, Hyperparameter Gaming, SOAR program vs Algoverse


Eleuther ▷ #research (17 messages🔥):

HRM loops, Causality in models, KV Caching strategies, Qwen finetuning


Eleuther ▷ #gpt-neox-dev (3 messages):

Security Vulnerability Reporting, Async Checkpointing for NeoX


Latent Space ▷ #ai-general-chat (69 messages🔥🔥):

Qwen3 Model, GPT-5 Launch, Claude Opus Rate Limits, Nitter Rate Limiting, Tidbit AI Tool


Nous Research AI ▷ #announcements (1 messages):

Psyche office hours, Discord event space


Nous Research AI ▷ #general (46 messages🔥):

Stage Channel Creation, Psyche Office Hours, Hermes 3-405B, Anthropic Reliability, Atropos Updates


Nous Research AI ▷ #research-papers (2 messages):

Dataset Publishing, Unknown Architecture


Nous Research AI ▷ #interesting-links (11 messages🔥):

Codex I, Nvidia Cutlass, Higgs Audio TTSEE, Philosophical AI discussion


Nous Research AI ▷ #research-papers (2 messages):

Dataset Architecture, Dataset Publishing


GPU MODE ▷ #general (16 messages🔥):

AutoCite app feedback, VSCode vs Overleaf, Hackathon sleep arrangements, NYC Hackathon


GPU MODE ▷ #triton (10 messages🔥):

Triton Masking, Triton block_ptr deprecation, Triton vector @ matrix multiplication, GEMV Kernel, GEMM implementation


GPU MODE ▷ #cuda (1 messages):

Nsight Copilot


GPU MODE ▷ #torch (2 messages):

Torch uint8 workaround, Triton


GPU MODE ▷ #announcements (1 messages):

NYC Hackathon, Jane Street, Tri Dao, Soumith Chintala, Coreweave


GPU MODE ▷ #cool-links (2 messages):

ChipBenchmark, Tilderesearch Tweet


GPU MODE ▷ #jobs (1 messages):

AMD Global Hiring, US-Based Interns


GPU MODE ▷ #beginner (2 messages):

HF Hub vs Repo for Model Weights


GPU MODE ▷ #torchao (3 messages):

Weight Pruning Research, Wanda & Wanda++ for weight pruning, Adaptive Pruning and Tuning (APT), Custom Kernels like Squared-ReLU


GPU MODE ▷ #self-promotion (1 messages):

Warp specialization, CuTeDSL Tile Scheduler, Persistent GEMM kernel, Hopper TMA and WGMMA, Cluster-based TMA load


GPU MODE ▷ #thunderkittens (1 messages):

bf16 high error rates, matmul kernels


GPU MODE ▷ #status (1 messages):

VS Code syntax highlighting, PyTorch Load Inline Highlighter


GPU MODE ▷ #factorio-learning-env (13 messages🔥):

Sonnet Benchmarking, Action Space Context, OpenRouter


GPU MODE ▷ #cutlass (1 messages):

CuTe, shared memory, swizzle, Layout, make_tiled_copy


Yannick Kilcher ▷ #general (22 messages🔥):

NeurIPS reviews, Karpathy on academic paper inflation, Alternative paper platforms, LLM Context Management, Downvote Politics


Yannick Kilcher ▷ #paper-discussion (9 messages🔥):

Paper Discussion, Arxiv Sharing, Mathy Papers, Large-Scale Evening Meeting


Yannick Kilcher ▷ #ml-news (9 messages🔥):

Grok Training Data, DEI in AI Models, Industrial Policy, Gemini Model Controversy, Imagen-4 and Gemini 2.5


Manus.im Discord ▷ #general (17 messages🔥):

Spam bots, Server issues, Vibe Coding AI, Scientific Manus paper


Cohere ▷ #🔌-api-discussions (11 messages🔥):

Helicone.ai integration with Cohere models, Command R+ vs. Command A, On-prem deployment of Cohere models


Cohere ▷ #👋-introduce-yourself (3 messages):

Crafted Logic Lab, Cognitive OS Assistant, Helicone.ai gateway, Humanist AI Values


Cohere ▷ #🧭-status-feed (1 messages):

Cohere Model Outage, Command models down


Cohere ▷ #🔬-research (1 messages):

Command R+, Humanity's Last Exam test, Hummingbird Anatomy Question


Notebook LM ▷ #use-cases (8 messages🔥):

Chat GPT agent login issues, Missing Share button, Metadata in Source


Notebook LM ▷ #general (7 messages):

Podcast Generation, File Uploading Error


aider (Paul Gauthier) ▷ #general (8 messages🔥):

GPT5, Textual 5.0.0, Qwen3-coder, Aider and testing


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (8 messages🔥):

Agents class at Berkeley, Certificate Issues, Article Submission


LlamaIndex ▷ #blog (3 messages):

LLM APIs vs Production Document Parsing, Screenshot Parsing Gaps, Accuracy Issues in Parsing, Natural Language Git Commands, S3 Vector Storage Integration


LlamaIndex ▷ #general (4 messages):

Docx Parsing with Images, LlamaIndexOpenTelemetry Traces


Torchtune ▷ #general (5 messages):

Large Scale PEFT, LoRA/Q-LoRA hooks, Scheduler knobs, RL alignment


Torchtune ▷ #dev (2 messages):

FSDP+TP Issues, NCCL Timeout, HuggingFace DCP Saver


MCP (Glama) ▷ #general (5 messages):

Memory Hallucinations, MCP Server Recommendations, Macaw Security Beta, Cloudflare Pay-Per-Crawl, Agentic Commerce


MCP (Glama) ▷ #showcase (1 messages):

MCP OAuth, OAuth flow


Nomic.ai (GPT4All) ▷ #general (4 messages):

GPU Recommendations, RX 9060 XT vs RX 6800 XT, Vector Storage Limitations


Modular (Mojo 🔥) ▷ #mojo (1 messages):

Modular's choice of Nanobind/Pybind over Cython for Python interop, Cython's limitations at scale, Approachability of Cython vs. Nanobind/Pybind


MLOps @Chipro ▷ #events (1 messages):

bamiji: alright then, thanks for responding


Codeium (Windsurf) ▷ #announcements (1 messages):

Qwen3-Coder release, Windsurf Server Tags