Frozen AI News archive

not much happened today

**Anthropic** introduces durable agents and MCP tasks for long-running workflows, with practical engineering patterns and integrations like Prefect. **Booking.com** deploys a large-scale agent system improving customer satisfaction using LangGraph, Kubernetes, GPT-4 Mini, and Weaviate. **Perplexity** rolls out user-level memory and virtual try-on features. **Claude Opus 4.5** leads on LisanBench and Code Arena WebDev benchmarks with mixed community feedback on its "thinking" and "non-thinking" modes, while improving cost-efficiency and UX with batch APIs and context compaction. Research on multi-agent systems shows **LatentMAS** reduces communication tokens by 70-84% and improves accuracy using Qwen3 models, and reasoning trace distillation achieves significant token reduction with maintained accuracy, highlighting the importance of reasoning trace style.

Canonical issue URL

Happy thanksgiving!

AI News for 11/25/2025-11/26/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 9014 messages) for you. Estimated reading time saved (at 200wpm): 713 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We're taking the last round of signups for the 2025 Dev Writers Retreat. Join us in San Diego after NeurIPS!


AI Twitter Recap

Agent systems: long-running harnesses, MCP tasking, and production deployments

Claude Opus 4.5: evals, cost/UX learnings, and new skills

Efficient reasoning and multi-agent communication

Beyond gradients and scaling systems

Multimodal and generative modeling updates

Open ecosystem, evaluation, and governance

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Alibaba Text-to-Image Model Launch

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Opus 4.5 Model Success Stories

2. New AI Model Announcements and Benchmarks

3. Humorous AI and Tech Memes


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. Next-Gen Image and Video Models Hit Production Workflows

2. Agentic UX, Code Assistants, and Chat Frontends Evolve

3. GPU Kernels, Distributed Inference, and Training Tricks

4. Open Tools, Protocols, and Model Routing Infrastructure

5. Safety, Robustness, Data Economics, and Evaluation Reality Checks


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


GPU MODE Discord


OpenAI Discord


LM Studio Discord


OpenRouter Discord


Nous Research AI Discord


Eleuther Discord


Latent Space Discord


Yannick Kilcher Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


Moonshot AI (Kimi K-2) Discord


DSPy Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1279 messages🔥🔥🔥):

Cameo Word Choice, Pro Grounding, Flux 2 Models, LMarena Updates, NB Pro


LMArena ▷ #announcements (4 messages):

Image Edit Update, New Model Update, Leaderboard Update, Flux-2-pro, Flux-2-flex


Perplexity AI ▷ #general (1082 messages🔥🔥🔥):

AI doom, Palantir Technologies, Nvidia and Open AI partnership, Bypassing AI Detectors, Perplexity limits


Unsloth AI (Daniel Han) ▷ #general (182 messages🔥🔥):

FP8 RL Documentation, Optimization Techniques, Qwen3VL vs 30B-A3B, AI GPU Kernels, Embedding Models


Unsloth AI (Daniel Han) ▷ #off-topic (173 messages🔥🔥):

Claude Opus 4.5, wakeword solution, MS or PhD interviews, Long context training, Humanoid stamina


Unsloth AI (Daniel Han) ▷ #help (103 messages🔥🔥):

IPEX vs llama.cpp Vulkan, HF model to GGUF conversion, Continued pretraining vs Fine-tuning, Qwen3 8B Fine-tuning issues, AMD GPU support for bitsandbytes


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

ERNIE AI Developer Challenge, Baidu ERNIE, Unsloth finetuning, AMD notebooks


Unsloth AI (Daniel Han) ▷ #research (12 messages🔥):

Evolutionary Strategies at Scale, LESA: Learnable LLM Layer Scaling-Up, Efficient Training on CPU


Cursor Community ▷ #general (371 messages🔥🔥):

Haiku documentation accuracy, Cursor agent's plan markdown storage, Free Agent Review, Education discounts


GPU MODE ▷ #general (7 messages):

Triton Kernels, Partially Trainable Embedding, Logits Softmax Operation, Curriculum Learning


GPU MODE ▷ #triton-gluon (5 messages):

Proton vs Nsight Systems, Tensor Descriptors, Auto Tune Parameters, Tritonparse, Persistent Matmul Tutorial


GPU MODE ▷ #cuda (17 messages🔥):

GEMM with tensor cores, NVIDIA Tensor Cores performance optimization resources, BF16 matrix multiplication, CUDA implementation details, Matrix data loading strategies


GPU MODE ▷ #torch (3 messages):

Gradient Checkpointing, Torch Differentiation, Boolean Flagging


GPU MODE ▷ #beginner (13 messages🔥):

Contributing to XLA, GPU/CUDA Benchmarking Warmup Runs, Kernel Characteristics Affecting Warmup Time, Thermal Limits in Benchmarking, nvbench thermal states


GPU MODE ▷ #jax-pallas-mosaic (2 messages):

jax.pmap vs jitting on single GPU, Multi vs single GPU systems


GPU MODE ▷ #off-topic (1 messages):

Memes


GPU MODE ▷ #irl-meetup (1 messages):

szymonoz: I'll be coming to NeurIPS and traveling to SF afterwards, hmu if you want to chat gpus 😄


GPU MODE ▷ #intel (1 messages):

2bit Dequantization on Intel GPU, GPU Dequantization Methods, Torch Performance on Intel GPU


GPU MODE ▷ #self-promotion (1 messages):

aerlabs: https://x.com/aerlabs_/status/1993561244196868370


GPU MODE ▷ #🍿 (1 messages):

LLM initiatives, LLM Kernel Generation, Agentic Systems


GPU MODE ▷ #thunderkittens (10 messages🔥):

CUDA kernels, Flash Attention, MoE kernels, Linear Attention backwards, FFT conv backwards


GPU MODE ▷ #submissions (114 messages🔥🔥):

NVIDIA leaderboard submissions, nvfp4_gemv leaderboard, Personal bests, Successful submissions


GPU MODE ▷ #factorio-learning-env (3 messages):

Factorio Learning Environment Docs, Jack Hopkins, Github Pages


GPU MODE ▷ #cutlass (2 messages):

SIMT loads, Tiled_mma documentation


GPU MODE ▷ #singularity-systems (3 messages):

picograd, aten-like Op intermediate representation, Device runtimes


GPU MODE ▷ #multi-gpu (3 messages):

LLM Inference, NVRAR algorithm, PAT Algorithm, Bruck algorithm, Recursive doubling algorithm


GPU MODE ▷ #nvidia-competition (159 messages🔥🔥):

CuTeDSL packed FP16, eval.py issues, cudaStreamSynchronize(), LLM-only challenges, sfa_permuted purpose


GPU MODE ▷ #hf-kernels (5 messages):

Metal Kernels Release, MacOS Compatibility Issues


GPU MODE ▷ #robotics-vla (8 messages🔥):

7x Laundry Folding Robot, No-Action Filtering, Qwen3-VL Optimization, Classic Binning vs FAST Tokenizer


OpenAI ▷ #ai-discussions (263 messages🔥🔥):

ChatGPT Biases, Nano Banana Pro, Commercial Use of AI Generated Images, GPT 5.0 mini, OpenAI UI Design


OpenAI ▷ #gpt-4-discussions (10 messages🔥):

GPT 5.1, GPT 4.1, Chat reference memory, Anime writing


OpenAI ▷ #prompt-engineering (1 messages):

mx_fuser: <@1256251788454268953>


OpenAI ▷ #api-discussions (1 messages):

mx_fuser: <@1256251788454268953>


LM Studio ▷ #general (46 messages🔥):

Unsupported API Endpoints in LM Studio, Image Captioning Issues with LM Studio, Vision Models, ROCm 7 Update for RDNA 3, Mint Opportunity Partnership with OpenSea


LM Studio ▷ #hardware-discussion (217 messages🔥🔥):

Q8 Cache, GPU Fans at 0% During Inference, Memory Pricing Issues, DLSS and RT Testing, Hardware Devaluation


OpenRouter ▷ #app-showcase (2 messages):

Color Picker Issues, RapidaAI Open Source


OpenRouter ▷ #general (196 messages🔥🔥):

Opus Overload, Model Fallback Bug, Deepseek R1 Model Gone, Meganova Chat Buzz, OpenRouter Pricing and Features


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (5 messages):

Arrakis AI model, Text-to-Video Leaderboard, Kling 2.5 Turbo, Google Veo 3


Nous Research AI ▷ #announcements (1 messages):

Psyche Office Hours


Nous Research AI ▷ #general (146 messages🔥🔥):

Suno Warner Music Partnership, Data vs Compute Cost, Blackwell Architecture, Z-Image Model, AI Disclosure on Steam


Nous Research AI ▷ #ask-about-llms (2 messages):

LLM benchmarks, pre-training data contamination, private benchmarks


Nous Research AI ▷ #interesting-links (2 messages):

History of Information Retrieval, RAG, Library of Alexandria


Eleuther ▷ #general (81 messages🔥🔥):

Hallucinations in Multi-Stage LLMs, AI and Collaborative Work, LLMs as Golden Retrievers, Verifying AI Claims, AI fact checking misinformation


Eleuther ▷ #research (37 messages🔥):

SGD shuffling, PIQA paper typo, Emergent Misalignment paper replication, AI for Drug Discovery


Eleuther ▷ #scaling-laws (1 messages):

junktown_24268: https://papers.cool/arxiv/2509.24406 - section 3, pictures in 5.1 etc etc


Latent Space ▷ #ai-general-chat (69 messages🔥🔥):

Claude Code’s upgraded Plan Mode, DeepMind Documentary, Jeff Dean’s 15-Year ML Retrospective & Gemini 3.0, AI Generated Slides, OpenAI vs Claude


Latent Space ▷ #ai-announcements (2 messages):

SOTA Vision, RF-DETR Paper, NeurIPS, Dev Writers Retreat 2025


Latent Space ▷ #genmedia-creative-ai (31 messages🔥):

Black Forest prompting guide, Wisprflow new funding, SGLang diffusion, Whisper Thunder vs VideoGen, AI Image Realism Showdown


Yannick Kilcher ▷ #general (61 messages🔥🔥):

Information Retrieval History, Genesis AI platform by Department of Energy, Curriculum Learning for Pretraining LLMs, MIT Study on AI Replacing Jobs, Trumpcoin Protocol for Zero Knowledge Proofs


Yannick Kilcher ▷ #paper-discussion (24 messages🔥):

Adobe AI summaries, LLM Summarization Limitations, ADHD and Autism in AI/CS, Posting papers without understanding


Yannick Kilcher ▷ #ml-news (6 messages):

Nano Banana Pro, Tencent Hunyuan, MAGA pushback on AI datacenters, AI replacing US workforce


HuggingFace ▷ #general (19 messages🔥):

Hugging Face Inference API, Christmas gift drop, Error in Hugging Face, Genesis Mission, PDF reader model for LLMStudio


HuggingFace ▷ #cool-finds (1 messages):

aboodj_: epic


HuggingFace ▷ #i-made-this (8 messages🔥):

RapidaAI Open Source, French Books Dataset, AI Sci-Fi Short Film


HuggingFace ▷ #reading-group (3 messages):

Chunking, GNN presentation, Structured data


HuggingFace ▷ #agents-course (1 messages):

dodrawat: let's connect


Modular (Mojo 🔥) ▷ #general (2 messages):

Mojo repo, Copybara, Repo Sync


Modular (Mojo 🔥) ▷ #max (20 messages🔥):

MAX examples for newbies, MAX written in Python, Mojo API in MAX, Migrating Python MAX code to Mojo MAX, Performance gains in MAX with Mojo


tinygrad (George Hotz) ▷ #learn-tinygrad (19 messages🔥):

TinyJit internals, Non tinygrad Python operations, Randomness functions in Tinygrad, Tinygrad JIT tutorial, PyTorch compiler history


Moonshot AI (Kimi K-2) ▷ #general-chat (14 messages🔥):

Kimi's Limits, Chatbots vs Canvases, Conversational Fallacy


DSPy ▷ #show-and-tell (4 messages):

dspy-cli tool, DSPy projects, FastAPI endpoints, MCP tools, Docker hosting


DSPy ▷ #general (9 messages🔥):

ReAct Module Trajectory Injection, Web Search API Implementation in DSPy, Anthropic Web Search API, Latency issues with web search API calls


MCP Contributors (Official) ▷ #general (11 messages🔥):

New Protocol Version, UI SEP Release, MCP Namespace Collision


Manus.im Discord ▷ #general (8 messages🔥):

AI Engineer introduction, API Issues, Telegram channel


aider (Paul Gauthier) ▷ #general (3 messages):

Benchmark Updates, Opus 4.5 vs Sonnet 4.5