Frozen AI News archive

not much happened today

**OpenAI** and **AWS** announced a strategic partnership involving a $38B compute deal to deploy hundreds of thousands of NVIDIA GB200 and GB300 chips, while **Microsoft** secured a license to ship NVIDIA GPUs to the UAE with a planned $7.9B datacenter investment. A 3-month NVFP4 kernel optimization competition on Blackwell B200s was launched by **NVIDIA** and GPU_MODE with prizes including DGX Spark and RTX 50XX GPUs. **vLLM** gains traction for local LLM serving, exemplified by PewDiePie's adoption. **Alibaba** previewed the Qwen3-Max-Thinking model hitting 100% on AIME 2025 and HMMT benchmarks, signaling advances in reasoning with tool use. The MIT-licensed MiniMax-M2 230B MoE model topped the Arena WebDev leaderboard, tying with Claude Sonnet 4.5 Thinking 32k. Critiques emerged on OSWorld benchmark stability and task validity. **LlamaIndex**'s LIGHT framework demonstrated significant improvements in long-term memory tasks over raw context and RAG baselines, with gains up to +160.6% in summarization at 10M tokens. **Amazon** introduced Chronos-2, a time-series foundation model for zero-shot forecasting. The MCP ecosystem expanded with new tools like mcp2py OAuth integration and Gemini Docs MCP server, alongside a build sprint by **Anthropic** and **Gradio** offering substantial credits and prizes. *"OSWorld doesn’t really exist—different prompt sets = incomparable scores"* highlights benchmarking challenges.

Canonical issue URL

a quiet day

AI News for 10/31/2025-11/3/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (199 channels, and 12068 messages) for you. Estimated reading time saved (at 200wpm): 1036 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

3rd "no news day" in a row. With only 1-2 more major model drops left for the rest of this year, it's become eerily quiet.

Bundle tickets and hotels for AIE CODE sell out soon!


AI Twitter Recap

Compute deals, hardware competitions, and serving infra

Reasoning LLMs, long-context memory, and benchmarks

Agent stacks, MCP ecosystem, and developer tooling

Training and systems engineering notes

Robotics: teleoperation now, autonomy later

Ecosystem and hiring

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Basketball Player Recognition Models

2. Google Gemma Model Controversy

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Linear Attention Mechanism Innovations

2. AI Industry Partnerships and Developments

3. AI Memes and Anecdotes


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The AI Agent & Developer Tooling Wars

Theme 2: Model Mayhem: Performance, Bugs, and Bold Claims

Theme 3: The Bleeding Edge of Hardware & Optimization

Theme 4: Platform Problems: From Pricing Puzzles to Privacy Panics

Theme 5: The Speculation Station: Market Trends and Future Gazing


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


LM Studio Discord


OpenRouter Discord


OpenAI Discord


Nous Research AI Discord


Cursor Community Discord


Modular (Mojo 🔥) Discord


GPU MODE Discord


Yannick Kilcher Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


Eleuther Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


MCP Contributors (Official) Discord


DSPy Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1110 messages🔥🔥🔥):

Comet Browser Ads, Perplexity Partnership Payouts, Bounty program, Claude Max, Accessibility and AI


Perplexity AI ▷ #sharing (6 messages):

Czech Games, Web Development Mentorship, Handstand Pushups, Optimai Network


Perplexity AI ▷ #pplx-api (6 messages):

Perplexity API Cost, Perplexity API Pricing, Sonar Pro Search, Perplexity API Pro Search


LMArena ▷ #general (1020 messages🔥🔥🔥):

Minimax M2, Qwen 3 Max Thinking, Gemini 3, Sora 2, Video generation issues


LMArena ▷ #announcements (2 messages):

October Contest, WebDev Leaderboard, MiniMax-M2


LM Studio ▷ #general (229 messages🔥🔥):

AMD MI60 on Windows, Installers that play music, LM Studio 0.3.30 Crashing, Connecting ComfyUI with LM Studio, LM Studio and MCP Servers


LM Studio ▷ #hardware-discussion (855 messages🔥🔥🔥):

DDR4 RAM scaling, AI Bubble Burst Speculation, Nvidia Competitors, MI50 Windows Drivers, Airflow Optimization


OpenRouter ▷ #announcements (3 messages):

OpenRouter charts, Activity grouping, Filtering Options


OpenRouter ▷ #app-showcase (11 messages🔥):

Fun Website with API key, Frontend AI, OpenRouter Integration


OpenRouter ▷ #general (684 messages🔥🔥🔥):

GLM 4.6, DeepInfra quantization, Sapphira-L3.3-70b-0.1, OpenRouter Presets, OpenRouter embedding models


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (91 messages🔥🔥):

Qwen3 Max, STT -> LLM -> TTS, Video Models, GPT-5.1 Testing, Feedback buttons for models


OpenAI ▷ #ai-discussions (577 messages🔥🔥🔥):

Self-learning AI, Sora 2 Code Requests, AGI, AI Data Centers, AI Consciousness


OpenAI ▷ #gpt-4-discussions (38 messages🔥):

ChatGPT Go limits, GPT age verification, ChatGPT formatting issues, Guest Mode chats, custom GPT monetization


OpenAI ▷ #prompt-engineering (26 messages🔥):

Custom Kernels for Python, AI generated Michael Jackson videos, AI Personalities, Vibe Coding, Meta-Prompting


OpenAI ▷ #api-discussions (26 messages🔥):

Platform kernels, Prompt engineering, AI videos, Meta-prompt personas, Vibe coding


Nous Research AI ▷ #general (635 messages🔥🔥🔥):

FP16 training, BF16 vs FP16, Kolmogorov-Arnold Network, Model Benchmarking, AI Consciousness


Nous Research AI ▷ #ask-about-llms (4 messages):

Monad chatbot, DeepSeek v3, NVIDIA Nims, Inference engines


Nous Research AI ▷ #research-papers (3 messages):

LLMs report subjective experience, Emergent Introspective Awareness, Consciousness denial


Nous Research AI ▷ #interesting-links (5 messages):

Travel Blogs, Teknium's Blog


Nous Research AI ▷ #research-papers (3 messages):

LLMs, consciousness, self-reference, emergent awareness


Cursor Community ▷ #general (637 messages🔥🔥🔥):

Cursor Agent limitations, System Prompt structure, Student Verification issues, Legacy Pricing Transition


Cursor Community ▷ #background-agents (5 messages):

PR descriptions, Mobile Web UI, Background Agents, UTF8 support, Cursor plans


Modular (Mojo 🔥) ▷ #general (70 messages🔥🔥):

Spark iGPU PCIE GPU support, Heterogeneous compute DB engine, HDF5 rewrite in Mojo, UnsafePointer v2 proposal, LLMs bad at Mojo


Modular (Mojo 🔥) ▷ #mojo (322 messages🔥🔥):

Mojo Origins vs Rust Lifetimes, Mojo installation issues on M1 Mac, UnsafePointer issue in Mojo Lists, Mojo's native Python collections, GPU puzzles help


Modular (Mojo 🔥) ▷ #max (11 messages🔥):

MAX roadmap, Op Staging Time, ComfyUI Mojo benchmark


GPU MODE ▷ #general (20 messages🔥):

SOTA Claude model, CUDA lecture notes, Discord invite issues


GPU MODE ▷ #triton (12 messages🔥):

Source Attribution in Triton MLIR, nvfp4 Compilation Issues, triton_bwd Library, Gluon gl.load equivalent to Triton tl.load


GPU MODE ▷ #cuda (9 messages🔥):

FA4 implementation on RTX50, Nsight Compute Kernel Measurement, FP4e2m1 type missing, smem descriptors for tcgen05/wgmma


GPU MODE ▷ #torch (1 messages):

VRAM usage, torch CUDAGraphs, dynamo and inductor passes, OOM bug, dynamo graph size


GPU MODE ▷ #announcements (1 messages):

NVFP4 kernel optimization, NVIDIA Blackwell B200, CuTe DSL, CUTLASS 4.0, Dell Pro Max GB300


GPU MODE ▷ #algorithms (2 messages):

Hopper FP8 Implementation, Blackwell Quantized Kernels


GPU MODE ▷ #cool-links (1 messages):

Opportunistic Parallel Lambda Calculus, Opal Scripting Language, LLM Performance Optimization


GPU MODE ▷ #beginner (29 messages🔥):

Dusty's Retirement, Pip Index URL Correction, Performance Profiling Tools, CUDA Advent of Code Optimization, High Dimensional Probability and Neural Nets


GPU MODE ▷ #jax-pallas (1 messages):

Pallas:MGPU matmul kernel, NVLINK comms, all-gather collective matmul, all-to-all -> grouped GEMM


GPU MODE ▷ #torchao (16 messages🔥):

TorchAO FP8 quantization bug, GemLite Performance, Profiling Inference Optimization, MXFP/NVFP large batch sizes, cudagraphs


GPU MODE ▷ #metal (6 messages):

Metal for GPU programming, M5 chip Tensor API, Metal for iOS platforms, Torchao metal kernels, Metal talks by Nikita and Manuel


GPU MODE ▷ #self-promotion (2 messages):

K&R C Exercises, TensorDiagram Python Library


GPU MODE ▷ #thunderkittens (2 messages):

2cta matmul b200 performance, pipeline stalls, sparsity


GPU MODE ▷ #edge (3 messages):

Python Serving for Large Models, TorchScript Overhead, vLLM Custom Model API, torch.compile with reduced-overhead


GPU MODE ▷ #gpu模式 (3 messages):

Compute Limitations, Inference Optimizations, Chinese AI Community


GPU MODE ▷ #status (3 messages):

Nvidia competition submission portal, Submission via Discord bot


GPU MODE ▷ #hardware (10 messages🔥):

GPU prices, Neo Clouds vs Hyperscalers, NvLink bridges, Hyperscaler support, Voltage Park support


GPU MODE ▷ #tpu (2 messages):

Protobuf Size Limit, JIT Disabling Effects


GPU MODE ▷ #factorio-learning-env (6 messages):

pip install errors, RL work with Factorio, Sonnet 4.5 distillation, Qwen3-8b-VL-Thinking SFT


GPU MODE ▷ #amd-competition (2 messages):

a2a solution, theoretical throughput


GPU MODE ▷ #cutlass (10 messages🔥):

CUTE_DSL_LINEINFO, kernel_cutlass_kernel_flash_attncuteflash_fwdFlashAttentionForwardSm90, TiledCopy, make_layout_tv, raked_product and right_inverse


GPU MODE ▷ #low-bit-training (1 messages):

matt.pd: Defeating the Training-Inference Mismatch via FP16 https://arxiv.org/abs/2510.26788


GPU MODE ▷ #llmq (1 messages):

LLMQ, Python Bindings, Multi-threaded backend


GPU MODE ▷ #helion (14 messages🔥):

Helion Autotuning, Helion Performance, Determinism Control


GPU MODE ▷ #nvidia-competition (39 messages🔥):

Kernel Challenge, GPU Mode YouTube Channel, CUDA DSL Kernels, PMPP Book


Yannick Kilcher ▷ #general (101 messages🔥🔥):

Citations in research, AI-assisted research, Multi-head attention, Matrix vs Discord, Diffusion Model training inconsistency


Yannick Kilcher ▷ #paper-discussion (17 messages🔥):

Paper reading recordings, Linear Interpretable Feature Evolution, Analog vs Digital, Awesome World Models


Yannick Kilcher ▷ #agents (2 messages):

Advancing Agents, Reasoning, Memory, AGI discussions


Yannick Kilcher ▷ #ml-news (10 messages🔥):

AI bubble, Economic downturn, Weakening labor


Latent Space ▷ #ai-general-chat (110 messages🔥🔥):

Kimi CLI, Agent Mode, DeepAgents CLI, Poolside valuation, Redpanda Data AI


Latent Space ▷ #ai-announcements (1 messages):

swyxio: new pod with <@367104793292046338> and <@194927177265840128> ! https://youtu.be/-gE1cesJF9M


Latent Space ▷ #genmedia-creative-ai (5 messages):

X-Ware.v0, OpenAI X post, YouTube video


Moonshot AI (Kimi K-2) ▷ #general-chat (98 messages🔥🔥):

Kimi K2 Research & OK Computer Reset, K2 Think Model vs. Cerebras Confusion, Qwen QWQ Finetune, Minimax vs. GLM for Daily Tasks, Claude Code Max vs. Cursor Pro+ Usage Limits


Eleuther ▷ #general (10 messages🔥):

Multi-Head Attention, EleutherAI Contribution, Mentorship


Eleuther ▷ #research (14 messages🔥):

RL Collapse, Flash Attention, Gradient Normalization, LLM-RL Libraries, HF models hallucinating


Eleuther ▷ #interpretability-general (26 messages🔥):

Transformers on Sequence Space, End-to-End LLM Outputs, Mech Interp and Probing Work, LLM Privacy / Security Claims, Activation Sharing Risks


Eleuther ▷ #lm-thunderdome (1 messages):

MMLU Benchmark, Image Analysis


Eleuther ▷ #multimodal-general (2 messages):

VLM image description order, VLMs describing image collages


tinygrad (George Hotz) ▷ #general (45 messages🔥):

setup.py vs pyproject.toml, UOp.pyrender() bug, Tenstorrent backend bounty, Tiled matrix multiplication, PyTorch backend without hacks


Manus.im Discord ▷ #general (40 messages🔥):

Manus credit costs, Claude Code vs Manus, Manus image generation quality, Manus assistance with Instagram reels, Manus custom domain pricing


aider (Paul Gauthier) ▷ #general (33 messages🔥):

Brokk AI Power Ranking, Perplexity MCP, aider vs aider-ce, Aider Community, Entangled Pair Quantum Eraser


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

aider-ce, reasoning-effort, weak_model


MCP Contributors (Official) ▷ #general (21 messages🔥):

MCPB, OCI, DXT, MCP Registry, server.json


MCP Contributors (Official) ▷ #general-wg (7 messages):

SEP-1442, SEP-1686, Statelessness proposals, Task storage


DSPy ▷ #show-and-tell (1 messages):

DSPyGen, DSLModel, Contributions to DSPy


DSPy ▷ #general (19 messages🔥):

dspy.Tool with simple Predict, Rate limit issues for Gemini, DSCloj channel in DSPy, Force finish the ReAct, Accessing LM a module is using