Frozen AI News archive

GPT 5.1 in ChatGPT: No evals, but adaptive thinking and instruction following

**OpenAI** launched **GPT-5.1** with improvements in conversational tone, instruction following, and adaptive reasoning. **GPT-5.0** is being sunset in 3 months. ChatGPT introduces new tone toggles for personalization, serving over **800 million users**. **Waymo** rolls out freeway driving for public riders in major California cities, showcasing advances in autonomous driving. **Anthropic**'s Project Fetch explores LLMs as robotics copilots using **Claude**. **Perceptron** releases a new API and Python SDK for multimodal perception-action apps supporting **Isaac-0.1** and **Qwen3VL-235B**. **Code Arena** offers live coding evaluations supporting **Claude**, **GPT-5**, **GLM-4.6**, and **Gemini**. **LangChain** introduces middleware for agent governance with human-in-the-loop controls. **LlamaIndex** releases a structured extraction template for SEC filings using LlamaAgents. **NousResearch** promotes ARC Prize benchmarks for generalized intelligence evaluation.

Canonical issue URL

an incremental step.

AI News for 11/11/2025-11/12/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (201 channels, and 5148 messages) for you. Estimated reading time saved (at 200wpm): 423 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

GPT 5.1 launched in ChatGPT today, with API availability "later this week":

GPT5.0 move to a "legacy model", and will be sunset in 3 months.

There is mention of AIME and Codeforces, but no evals made it to this particular blogpost, which some are criticising.

ChatGPT also gets new tone toggles for personalization. Fidji Simo's blog says "with more than 800 million people using ChatGPT, we’re well past the point of one-size-fits-all."

Mobile app personalization screen showing different tone and style options for AI interactions, with "Quirky" currently selected as the base style.


AI Twitter Recap

Autonomy and Physical AI: Waymo freeway rollout, Anthropic’s Project Fetch, and Perceptron’s platform

Agent evals and control: Code Arena, LangChain middlewares, and LlamaIndex SEC agent

Systems and infra: cross-container covert channel, edge LM IPW harness, and inference infra

Model UX and product updates: Gemini Live, GPT‑5.1 persona, and AI privacy

Research and theory notes

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. AELLA Open-Science Initiative

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GPT-5.1 Release and Features

2. AI in Personal Legal Success Stories

3. Creative AI Experiments


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview 05-20

Theme 1. Next-Gen AI Models Spark Hope and Frustration

Theme 2. Developer Tooling Navigates Complex AI Landscapes

Theme 3. Hardware Challenges Drive AI Performance Optimization

Theme 4. AI's Ethical Battlegrounds and Licensing Quandaries

Theme 5. Advancing LLM Research and Development Practices


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Cursor Community Discord


GPU MODE Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


OpenAI Discord


LM Studio Discord


Eleuther Discord


Nous Research AI Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


DSPy Discord


HuggingFace Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Windsurf Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1258 messages🔥🔥🔥):

Riftrunner vs other models, GPT 5.1 benchmarks, Gemini 3 Pro speculation, AI model sycophancy, Riftrunner Game Development


LMArena ▷ #announcements (1 messages):

Code Arena, WebDev Arena, LMArena Leaderboard


Perplexity AI ▷ #general (670 messages🔥🔥🔥):

Perplexity referral program, Gemini 2.5, GPT-5 mini vs GPT-5, Comet issues, Comet for Android


Perplexity AI ▷ #sharing (3 messages):

Sourcify Event, Open Source, Forks, PRs and Chaos, Threads Shareable


Cursor Community ▷ #general (534 messages🔥🔥🔥):

Cursor IDE Cost Awareness, Cursor vs Copilot preference, Exploiting ChatGPT, Cursor 'Max' Mode, Cursor Rules


GPU MODE ▷ #general (10 messages🔥):

popcorn-cli syntax, status 400 grayscale, CPU-focused community, leaderboard/eval selection, export command quotes


GPU MODE ▷ #cuda (18 messages🔥):

CUDA compiler options, warp tiling, TMEM allocation, PTXAS optimization, Mutex locking via TMEM


GPU MODE ▷ #jobs (1 messages):

HippocraticAI Hiring, LLM Inference Engineer Role, CUDA/CUTLASS/Triton expertise, NVIDIA B200s, AMD MI355, and Google TPUs


GPU MODE ▷ #beginner (4 messages):

Atomic Max for FP32, PTX Documentation Inaccuracy


GPU MODE ▷ #off-topic (3 messages):

Skunk on a car, Dune Meme


GPU MODE ▷ #rocm (3 messages):

hipkittens, image0.jpg


GPU MODE ▷ #self-promotion (5 messages):

hipkittens, FSDP Implementation, AMD Open Source AI Week Recap


GPU MODE ▷ #🍿 (4 messages):

Building from source, Nvidia competition submissions


GPU MODE ▷ #thunderkittens (2 messages):

Hipkittens launch, Other Kittens on X


GPU MODE ▷ #submissions (66 messages🔥🔥):

Leaderboard Submissions, GEMV Cheating Accusations, Benchmark Input Sizes


GPU MODE ▷ #cutlass (1 messages):

kinming_32199: impressive animation👍


GPU MODE ▷ #general (2 messages):

GPU MODE Leaderboard, Submission process


GPU MODE ▷ #multi-gpu (1 messages):

DMA Documentation, RDMA Documentation, Wikipedia, ChatGPT, Vendor websites


GPU MODE ▷ #helion (9 messages🔥):

Triton Errors, Auto Skip Triton Errors, Helion Configs BC-compatible


GPU MODE ▷ #nvidia-competition (212 messages🔥🔥):

Cutlass and CuTe 4.3.0, Popcorn-cli submission errors, Kernel global caching allowed, CuTe DSL is not requirement, torch doesn't support sm100


GPU MODE ▷ #xpfactory-vla (3 messages):

RLinf, Qwen3-VL VLA-adapter training


Unsloth AI (Daniel Han) ▷ #general (179 messages🔥🔥):

VibeVoice finetuning for Bulgarian, Intel autoround quants vs BNB 4-bit quants for training, QAT in Unsloth, MoE models and output quality, Aquif-3.5-Max-42B-A3B


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Fan encounter, Identity


Unsloth AI (Daniel Han) ▷ #off-topic (79 messages🔥🔥):

ONNX Runtime, Free Threading, Translation Model Prompt, Context Loss in SLMs, GPT-5-1 Em Dashes


Unsloth AI (Daniel Han) ▷ #help (37 messages🔥):

Low loss architecture, Broken Ollama links in docs, Dependency issues finetuning VibeVoice, Training script for GPT-OSS models locally, Nonsense generations from GPT-OSS-20b


OpenRouter ▷ #announcements (1 messages):

MiniMax M2, Paid Endpoint Migration


OpenRouter ▷ #general (268 messages🔥🔥):

OpenRouter API Issues, Gemini 3 Speculation, Free Model Scarcity, Local AI Hardware Recommendations, OpenRouter Chat Functionality


OpenRouter ▷ #discussion (3 messages):

Search Box UI, GPT Reasoning


OpenAI ▷ #annnouncements (4 messages):

NYT vs OpenAI, Free ChatGPT Plus, GPT-5.1 Release


OpenAI ▷ #ai-discussions (182 messages🔥🔥):

GitHub classifies Gemini 2.5 Pro vs ChatGPT, AI Chatbot Infestation on Social Media, GPT-5.1 vs GPT-5 vs GPT-4o, AI and Job Market, Sora 2


OpenAI ▷ #gpt-4-discussions (21 messages🔥):

GPT-5.1, Model preference, downgrading models


OpenAI ▷ #prompt-engineering (13 messages🔥):

Database re-attachment, Prompt engineering jobs, Sora issues, Prompt engineering lessons


OpenAI ▷ #api-discussions (13 messages🔥):

Prompt Engineering Jobs, Reattaching Databases, Prompt Engineering Tips, Sora Prompting


LM Studio ▷ #general (95 messages🔥🔥):

LM Studio MacOS Admin Privileges, Lightweight Chat Model Recommendations, Gemini 2.5 Pro vs Sonnet 4.5, LM Studio Hub Search Issues, MCP Resource Support in LM Studio


LM Studio ▷ #hardware-discussion (89 messages🔥🔥):

GPU memory distribution, Vulkan vs CUDA performance/stability, Driver issues and BSODs, Hardware troubleshooting (VRAM), Context length and VRAM usage


Eleuther ▷ #general (41 messages🔥):

Numpy vs Einops, MLE interviews, Implementing Multi-Head Attention, Interpretability and VLMs


Eleuther ▷ #research (75 messages🔥🔥):

Zyda-2, ClimbLab, Nemotron-CC-v2, Complex Values Attention, NVIDIA Dataset License


Eleuther ▷ #interpretability-general (22 messages🔥):

Concept Probes Training, Divergent Concepts, Model's Internal Activations, Probabilistic Polysemantic System, Class Distribution and Accuracy


Eleuther ▷ #lm-thunderdome (10 messages🔥):

Summarization Task Evaluation, lm-eval-harness, XSum Dataset Evaluation, UnitXT Integration


Nous Research AI ▷ #general (55 messages🔥🔥):

Autonomous AI accident, WeiboAI Model, Baguettotron benchmark, Importing GGUF files into Nous Chat


Nous Research AI ▷ #ask-about-llms (78 messages🔥🔥):

GGUF files, Nous Chat, local AI, Ollama, Computer Science Degree


Nous Research AI ▷ #interesting-links (2 messages):

ixlinx-8b, SOTA small model, local hackathon


Latent Space ▷ #ai-general-chat (109 messages🔥🔥):

RL envs, Windsurf Next's stealth models, OpenAI metrics, Spatial Intelligence as AI’s Next Frontier, Character.AI’s Kaiju model design for speed


Latent Space ▷ #genmedia-creative-ai (5 messages):

Magic Patterns 2.0, AI Design Tool, Series A Funding


Modular (Mojo 🔥) ▷ #general (34 messages🔥):

Dynamic Type Reflection in Mojo, Error Handling: try-catch vs. Monadic, Mojo Metaprogramming, C-FFI Pain Points, Public and Private Members


Modular (Mojo 🔥) ▷ #mojo (49 messages🔥):

Optional Mutability, MOJO_PYTHON_LIBRARY standardization, Metal Compiler failing, comptime Bird


DSPy ▷ #show-and-tell (1 messages):

Taxonomy Creation, Structured Generation


DSPy ▷ #general (68 messages🔥🔥):

DSPy vs Prompting, Signatures vs Prompts, GEPA optimization, Agentic Search with DSPy, Complex systems with DSPy


HuggingFace ▷ #general (56 messages🔥🔥):

ZeroGPU problems, Reuben banned, Custom loss function with SFTTrainer, Video cutting with AI, Audio tokens in multimodal LLMs


HuggingFace ▷ #today-im-learning (1 messages):

quantumharsh: from where your are learning machine learning


HuggingFace ▷ #cool-finds (1 messages):

Render times, Progress labels


HuggingFace ▷ #i-made-this (3 messages):

Tokenflood Load Testing Tool, SMOLTRACE Benchmarking Framework, SmolVLM Blogpost


HuggingFace ▷ #computer-vision (1 messages):

ConvNeXt-Tiny Model, Model Architectures, Computer Vision Tasks


HuggingFace ▷ #NLP (4 messages):

Random Data Generation, PII Detection and Randomization, Data Cleaning Techniques


HuggingFace ▷ #gradio-announcements (1 messages):

MCP 1st Birthday, Anthropic, Gradio, AI hackathon, API credits


HuggingFace ▷ #agents-course (1 messages):

Study Group


Moonshot AI (Kimi K-2) ▷ #general-chat (63 messages🔥🔥):

Researcher mode errors, Kimi Coding Plan API Quota, Kimi API setup, Kimi K2 Thinking vs GLM 4.6, GPT 5.1 rollout


Yannick Kilcher ▷ #general (32 messages🔥):

Semantic shifts in language, Google Colab vs Lambda Labs, FID scores for DiT models, ICLR review process, Whisper model usage


Yannick Kilcher ▷ #paper-discussion (8 messages🔥):

Kimi K2, Thinkprm, Indian Names, Memorization to Reasoning


Yannick Kilcher ▷ #ml-news (8 messages🔥):

Elevenlabs Speech to Text, GPT-5 Release?


MCP Contributors (Official) ▷ #general (18 messages🔥):

timezone information MCP clients to MCP servers, SEP draft for timezone, Anthropics Claude desktop host team, connectivity issues between Claude.ai and MCP Servers, JSON data from mcp tool call results


Manus.im Discord ▷ #general (14 messages🔥):

AI Automation Integration, Spanish Language Section Suggestion, Generative Engine Optimization, Manus System Error, Manus Support Channel Closure


aider (Paul Gauthier) ▷ #general (13 messages🔥):

Code Snippets in Markdown Files with Aider, Aider Conventions Configuration, Aider Vim Mode, aider-ce and Session Management, Aider Development Status


tinygrad (George Hotz) ▷ #general (4 messages):

OpenCL errors, package_data