Frozen AI News archive

not much happened today

**Google's Project Suncatcher** prototypes scalable ML compute systems in orbit using solar energy with Trillium-generation TPUs surviving radiation, aiming for prototype satellites by 2027. **China's 50% electricity subsidies** for datacenters may offset chip efficiency gaps, with **Huawei** planning gigawatt-scale SuperPoDs for DeepSeek by 2027. **Epoch** launched an open data center tracking hub, and **Deutsche Telekom** and **NVIDIA** announced a $1.1B Munich facility with 10k GPUs. In agent stacks, **MCP** (Model-Compute-Platform) tools gain traction with implementations like **LitServe**, **Claude Desktop**, and **Reka's MCP server** for VS Code. Anthropic emphasizes efficient code execution with MCP. Context engineering shifts focus from prompt writing to model input prioritization, with reports and tools from **Weaviate**, **Anthropic**, and practitioners highlighting instruction-following rerankers and embedding approaches. DeepMind's **IMO-Bench** math reasoning suite shows **Gemini DeepThink** achieving high scores, with a ProofAutoGrader correlating strongly with human grading. Benchmarks and governance updates include new tasks and eval sharing in lighteval.

Canonical issue URL

a quiet day.

AI News for 11/3/2025-11/4/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (200 channels, and 6479 messages) for you. Estimated reading time saved (at 200wpm): 551 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

4th quiet day in a row...


AI Twitter Recap

Compute, energy, and AI datacenters


Agent stacks, MCP, and context engineering


Reasoning, math, and evaluation


Robotics and Physical AI


Local inference and dev tooling


Multimodal and video generation


Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen Model Ecosystem Impact

2. llama.cpp WebUI Release

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI Communication Innovations

2. AI in Media and Advertising

3. AI in Personal and Educational Contexts


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1: Models Muscle Up Rankings

Theme 2: Hardware Heats Up Debates

Theme 3: Tools Tackle AI Workflows

Theme 4: Benchmarks Bash Flaws

Theme 5: Legal and Safety Storms Brew


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


OpenRouter Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


Cursor Community Discord


GPU MODE Discord


Nous Research AI Discord


HuggingFace Discord


OpenAI Discord


tinygrad (George Hotz) Discord


DSPy Discord


Latent Space Discord


Yannick Kilcher Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Moonshot AI (Kimi K-2) Discord


Eleuther Discord


Windsurf Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1126 messages🔥🔥🔥):

Lithiumflow's fate, Minimax M2 ranking, GPT-5 Juice, BlackHawk


LMArena ▷ #announcements (1 messages):

WebDev Leaderboard, MiniMax-M2, Open Models


Perplexity AI ▷ #general (1044 messages🔥🔥🔥):

Comet browser, GPT Go free, Accessibility on the web, Model Comparisons


Perplexity AI ▷ #pplx-api (2 messages):

Perplexity Sonar Pro Search, Perplexity API


OpenRouter ▷ #announcements (3 messages):

OpenRouter Charts, Activity Grouping, Filtering Options


OpenRouter ▷ #app-showcase (2 messages):

fenic, OpenRouter Integration, LLM ETL, AI Workflows


OpenRouter ▷ #general (527 messages🔥🔥🔥):

ComfyUI for Free, AMD vs Nvidia for LLMs, Model Context Limits, Deepseek and Roleplay


OpenRouter ▷ #discussion (100 messages🔥🔥):

Google's AI Model Dislike, Bedtime Fable Animation Engine, Gemma Models Solve Captchas, Provider Feedback System, Movement Labs Allegations


Unsloth AI (Daniel Han) ▷ #general (206 messages🔥🔥):

BIM in Architecture, Vision Model Finetuning with Text, AI alignment discussion, Uncensored Joke Generation, TRL Notebook vs. Unsloth Notebook


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Blockchain Trust Systems, AI Problem Solving, Industry Transformation


Unsloth AI (Daniel Han) ▷ #off-topic (178 messages🔥🔥):

Dental Procedures Cost, HP vs Asus boxes, Non-Reasoning Instruct Models, SFT vs RL, Data Entry Nightmares


Unsloth AI (Daniel Han) ▷ #help (133 messages🔥🔥):

Unsloth OCR Deepseek Integration, EuroLLM-9B-Instruct Compatibility, llama.cpp GPU Allocation, Unsloth Cross Entropy Loss, GPT-OSS-20B and REINFORCE


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Unsloth channel rules, Showcase channel scope


Unsloth AI (Daniel Han) ▷ #research (6 messages):

Roblox PII Classifier, Open Sourcing, Data set


LM Studio ▷ #general (204 messages🔥🔥):

Qwen 30B vs 32B, GPU Recommendations for LLMs and Gaming, ComfyUI Integration with LM Studio, LM Studio CUDA Issues and Runtime Updates, Qwen3-Next 80B MoE


LM Studio ▷ #hardware-discussion (247 messages🔥🔥):

3090 prices, MI50, DDR5 EPYC system, ROCm Support, 3000rpm noctua fans


Cursor Community ▷ #general (420 messages🔥🔥🔥):

Web search in Cursor, Models for UI creation, Notes panel disappearance, Team account billing issue, Frequent Cursor updates


Cursor Community ▷ #background-agents (4 messages):

Background Agent UTF8 support, Cloud Agent Plans, Mobile Web UI Crashes, Background Agent Bug


GPU MODE ▷ #general (13 messages🔥):

Sonnet 4.5 vs Opus, CUDA lecture notes, YouTube stream planned


GPU MODE ▷ #triton-gluon (4 messages):

Gluon in Triton, Tritex: LLM Pre-Training in Triton, Community Meetup


GPU MODE ▷ #cuda (19 messages🔥):

SMEM descriptor calculation for tcgen05/wgmma, Cutlass Tutorial wgmma Hopper, cutedsl hopper dense_gemm CTA Swizzle, Blackwell cards have a scheduler, Memory-bound matmuls


GPU MODE ▷ #torch (13 messages🔥):

vLLM build with newer pytorch, torch.compile cuda graph recapture, torch.compile + grouped_mm issue, UserWarning: Logical operators 'and' and 'or' are deprecated


GPU MODE ▷ #announcements (1 messages):

NVIDIA, kernel competition, NVFP4 kernels, Blackwell, CuTe DSL


GPU MODE ▷ #algorithms (1 messages):

chhillee: not required on blackwell afaik


GPU MODE ▷ #jobs (1 messages):

Mixlayer, AI inference platform, Rust, CUDA, Hiring founding engineer


GPU MODE ▷ #beginner (25 messages🔥):

High Dimensional Probability and Neural Nets, Compilers and Kernel Engineering, Nvidia Cuda Compiler (NVCC) based on LLVM, ncu setup in a public cloud, RL bug and accumulator type fixed at fp32


GPU MODE ▷ #torchao (7 messages):

TorchAO, fbgemm kernels, Weight-only float8 kernel, torch.compile


GPU MODE ▷ #irl-meetup (1 messages):

felixultimaforeverromanempire: I'll be there


GPU MODE ▷ #rocm (1 messages):

nod-ai, shark-ai, kernel optimization guide


GPU MODE ▷ #tilelang (3 messages):

TileLang, Spark, GPU Support


GPU MODE ▷ #metal (3 messages):

Torchao Metal Kernels, Nikita Metal talk, Manuel Metal Talk, Quantization


GPU MODE ▷ #self-promotion (4 messages):

Tritex LLM pre-training in Triton, Disaggregated Inference Retrospective, Symbolica AI Rust Hackathon


GPU MODE ▷ #thunderkittens (1 messages):

solimao.123: Hi 👋 is there a branch I can try for the b200 attn kernel forward pass?


GPU MODE ▷ #gpu模式 (3 messages):

Compute Limitations, Inference Optimizations, Chinese AI Expertise


GPU MODE ▷ #submissions (14 messages🔥):

VectorAdd Leaderboard Updates, Grayscale Leaderboard Updates, H100 Performance, B200 Performance, A100 Performance


GPU MODE ▷ #status (3 messages):

Nvidia Competition Submission Portal, Discord Bot Submissions, CLI Submissions, Web Submissions


GPU MODE ▷ #hardware (10 messages🔥):

GPU Cloud Pricing, Hyperscaler vs Neo Cloud, NvLink Bridges, Volume Discounts, AI/ML Infra Engineers


GPU MODE ▷ #factorio-learning-env (6 messages):

FLE infra, Sonnet Distillation, Qwen3-8b-VL-Thinking, Factorio RL


GPU MODE ▷ #amd-competition (6 messages):

Node Allocation, Runtime Overhead


GPU MODE ▷ #cutlass (13 messages🔥):

Early Returns in Cutedsl, Semaphore Implementation in CuteDSL, Make Tiled Copy Implementation


GPU MODE ▷ #mojo (2 messages):

Mojo GPU Puzzles, Video Tutorial Series


GPU MODE ▷ #singularity-systems (6 messages):

picograd, fuzzing


GPU MODE ▷ #general (1 messages):

Leaderboard, CUDA Implementation, Python vs CUDA


GPU MODE ▷ #low-bit-training (6 messages):

Deepseek-style FP8 Blockwise Training, Cutlass FP8 GEMM Implementations, Per-Expert Column Major Layout


GPU MODE ▷ #opencl-vulkan (2 messages):

clspv OpenCL kernels, GLSL compute shaders, SPIR-V


GPU MODE ▷ #cluster-management (2 messages):

Node Configuration Scripts, OS Image Preconfiguration, Lightweight Node Check Scripts, Continuous Configuration Monitoring


GPU MODE ▷ #helion (2 messages):

Lock mechanism in Helion, Fused linear cross entropy in Helion, atomic_cas and atomic_xchg in Helion


GPU MODE ▷ #nvidia-competition (191 messages🔥🔥):

Kernel Challenge Problems, GPU Competition Prizes, DSL Kernels, CUDA versions on cloud, B200 NVFP4 kernels


GPU MODE ▷ #hf-kernels (3 messages):

xenova.com


Nous Research AI ▷ #general (295 messages🔥🔥):

Peak AI, OpenAI Bubble, Anthropic Non-Open Source, Hyperstition, Gemini Uncensored


Nous Research AI ▷ #ask-about-llms (11 messages🔥):

Gesture-based Loom Interface, Frustrations with raising funding for gesture tech, Future of XR glasses and gestural interfaces, Repligate's Loom


Nous Research AI ▷ #research-papers (3 messages):

arXiv Paper Submission, arXiv Sponsor, Discord Sponsorship


Nous Research AI ▷ #interesting-links (2 messages):

Sparse Attention, Llama.cpp discussions


Nous Research AI ▷ #research-papers (3 messages):

Arxiv, Preprints, Sponsor, Discord


HuggingFace ▷ #general (198 messages🔥🔥):

Japanese AI Model, Open Source LLM, Polish Translation Peculiarities, AI Web Scraper, Sports Betting AI


HuggingFace ▷ #today-im-learning (5 messages):

Job Application Automation with Python, BERT Style Model Training, SetFit Contrastic Binary Classifier, Stealth Tactics for Web Scraping, HTML Selectors Debugging


HuggingFace ▷ #cool-finds (1 messages):

Agentic Engineering Meetup, Chicago AI Events


HuggingFace ▷ #i-made-this (22 messages🔥):

ComfyUI Workflows, LLM Evaluations, IFEval, Vulkan multi-gpu setups, Sparse Attention


HuggingFace ▷ #gradio-announcements (1 messages):

MCP 1st Birthday, Anthropic, Gradio, Hackathon, AI Agents


HuggingFace ▷ #agents-course (3 messages):

Hugging Face Agents Course channel confusion, API back up issues, Associated file errors


OpenAI ▷ #annnouncements (1 messages):

Sora Android App, Sora availability


OpenAI ▷ #ai-discussions (116 messages🔥🔥):

Sora 2 invite code, OpenAI bans medical advice, AI regulations, GPT-5 hate, OpenAI rerouting to GPT-5


OpenAI ▷ #gpt-4-discussions (17 messages🔥):

Custom GPT Knowledge Base issues, GPT-4o quality concerns, Fine-tuning requirements, GPT GO subscription management, Building ChatGPT apps


OpenAI ▷ #prompt-engineering (6 messages):

Meta-prompting, Behavioral Orchestration, Sora AI v2 prompt formatting


OpenAI ▷ #api-discussions (6 messages):

Meta-Prompting, Behavioral Orchestration, Prompt format for Sora AI v2


tinygrad (George Hotz) ▷ #announcements (1 messages):

tinybox pro v2, 8x 5090 workstation, rackable workstation


tinygrad (George Hotz) ▷ #general (76 messages🔥🔥):

Numpy Version Issues, M1 Metal Issue, Extropic's Probabilistic Hardware, VK_KHR_buffer_device_address, TinyBox Pro V2


DSPy ▷ #general (66 messages🔥🔥):

Accessing LLM in DSPy Modules, Switching LLMs with history transfer, Dspy Module documentation, Direct Interaction with OpenAI requests, Caching in DSPy


Latent Space ▷ #ai-general-chat (57 messages🔥🔥):

OpenAI Compute Strategy, Epoch AI critiques OSWorld AI computer-use benchmark, Butter-Bench for Evaluating LLM Controlled, Claude Code Web Credits, Windsurf Codemaps


Latent Space ▷ #ai-announcements (1 messages):

swyxio: new pod with <@367104793292046338> and <@194927177265840128> ! https://youtu.be/-gE1cesJF9M


Latent Space ▷ #genmedia-creative-ai (4 messages):

Hybrid AI, 3D Pipeline, 2026 Olympic Ad, AI Adoption


Yannick Kilcher ▷ #general (18 messages🔥):

Diffusion Model Inconsistency, Guidance Design in Diffusion Models, Improving Diffusion Sampling, Lévy Processes, Stochastic Interpolant Paper


Yannick Kilcher ▷ #paper-discussion (15 messages🔥):

Paper discussion scheduling, Crosscoder and circuit tracing research, LLM Flagship Destruction


Yannick Kilcher ▷ #ml-news (11 messages🔥):

Getty Images vs StabilityAI lawsuit, Guillotine Humor, Censorship on Discord


Manus.im Discord ▷ #general (17 messages🔥):

Manus Subscription Costs, Unauthorized Charges, Text to Video Tools, Twitter Webscraping, Hosting Services for Manus Apps


aider (Paul Gauthier) ▷ #general (8 messages🔥):

GPT-5 Access, Azure Credits Expiring, Aider's Future, Perplexity API Key, Model Testing


aider (Paul Gauthier) ▷ #questions-and-tips (7 messages):

ollama_chat/gpt-oss:20b reasoning effort, aider scripting capabilities, weak_model flag


Moonshot AI (Kimi K-2) ▷ #general-chat (13 messages🔥):

M2 vs GLM-4.6, Minimax as go-to AI, Kimi Emojis, Kimi iOS app


Eleuther ▷ #research (2 messages):

HuggingFace, Qwen models hallucinate long tail facts, IFEval


Eleuther ▷ #lm-thunderdome (1 messages):

Countdown Task, Adaptive Parallel Reasoning, lm-evaluation-harness PR


Eleuther ▷ #multimodal-general (2 messages):

Vision Language Models (VLMs), Vision Transformers, Positional Encodings


Windsurf ▷ #announcements (1 messages):

Codemaps, SWE-1.5, Sonnet 4.5, AI code understanding, Scaling productive output