Frozen AI News archive

not much happened today

**GPT-5** leads Sudoku-Bench solving 33% of puzzles but 67% remain unsolved, highlighting challenges in meta-reasoning and spatial logic. New training methods like **GRPO fine-tuning** and "Thought Cloning" show limited success. Research on "looped LLMs" suggests pretrained models benefit from repeated computation for better performance. **Baidu's ERNIE-4.5-VL-28B-A3B-Thinking** offers lightweight multimodal reasoning with Apache 2.0 licensing, outperforming **Gemini-2.5-Pro** and **GPT-5-High** on document tasks. **Databricks ai_parse_document** preview delivers cost-efficient document intelligence outperforming GPT-5 and Claude. **Pathwork AI** uses **LlamaCloud** for underwriting automation. **Gemini File Search API** enables agentic retrieval augmented generation (RAG) with MCP server integration. **Together AI** and **Collinear** launch **TraitMix** for persona-driven agent simulations integrated with **Together Evals**. Reports highlight risks in long-running code agents like **Claude Code** reverting changes, emphasizing guardrails. Community consensus favors multiple code copilots including Claude Code, Codex, and others.

Canonical issue URL

a quiet day

AI News for 11/10/2025-11/11/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (201 channels, and 5180 messages) for you. Estimated reading time saved (at 200wpm): 465 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

AIE CODE Side Events listing are now up: https://ai.engineer/code#events

If you're in NYC, you can attend any of these without an AI Engineer ticket! Enjoy.


AI Twitter Recap

Reasoning benchmarks and training techniques

Multimodal and document intelligence

Agents, retrieval, and production strategy

Open data, models, and tools

Systems, kernels, and robotics

Safety, consent, and platform quality

On-device multimodal models

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. VibeThinker 1.5B Model and Benchmark Performance

2. Egocentric-10K Dataset Launch

3. GPT-OSS-120B on Cerebras Satirical Analysis

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI-Generated Content and Detection

2. AI Model and Tool Innovations

3. Humorous AI Memes and Content


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The AI Model Arms Race Heats Up

Theme 2: Performance Tuning, Hardware Battles, and Framework Philosophies

Theme 3: Framework Frustrations and Persistent Bugs

Theme 4: AI Applications, User Experience, and Ethical Quandaries

Theme 5: The Data-Driven Frontier of Training and Interpretability


Discord: High level Discord summaries

Perplexity AI Discord


GPU MODE Discord


LMArena Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


LM Studio Discord


OpenAI Discord


Yannick Kilcher Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


HuggingFace Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


DSPy Discord


aider (Paul Gauthier) Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (917 messages🔥🔥🔥):

Perplexity Referral Program Controversy, Comet Browser Issues, Perplexity Pro Limits, Fraudulent Activity Accusations, Sonnet 4.5 Model Bug Fix


Perplexity AI ▷ #sharing (2 messages):

The Orbits, Debut Single Release, Shareable Threads


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity API, Python SDK exception handling


GPU MODE ▷ #general (1 messages):

_therealpilot: Anyone here attending Neurips in San Diego?


GPU MODE ▷ #cuda (19 messages🔥):

Ampere GEMM tricks, Cutlass examples, smem->rmem pipelining, CUDA compiler options, Warptiling


GPU MODE ▷ #beginner (7 messages):

tinygrad bounties, async loads without TMA, atomic max for float in CUDA


GPU MODE ▷ #torchao (8 messages🔥):

TorchAO MXFP8 MOE, Activation Checkpointing in TorchTitan, GB200 Cluster Performance, Llama4 Scout Optimization


GPU MODE ▷ #off-topic (11 messages🔥):

Multiple Monitors, Vertical Dual Monitors, Monitor Resolutions


GPU MODE ▷ #rocm (2 messages):

hipkittens, Cool Cats, Social Media Shares


GPU MODE ▷ #intel (7 messages):

Intel GPUs, Bank Conflicts, Shared Local Memory (SLM)


GPU MODE ▷ #self-promotion (5 messages):

Penny Faster Than NCCL, VoxCPM Text-to-Speech CoreML Port, Hipkittens AI Hack


GPU MODE ▷ #avx (5 messages):

AVX2 benefits, tiktoken regex


GPU MODE ▷ #🍿 (1 messages):

Popcorn-cli, WSL, GLIBC_2.39


GPU MODE ▷ #thunderkittens (3 messages):

TK support on B200


GPU MODE ▷ #submissions (138 messages🔥🔥):

NVIDIA Leaderboard, Caching values, LLMs Moral Compass


GPU MODE ▷ #status (2 messages):

Benchmark Caching, CUDA Streams


GPU MODE ▷ #hardware (3 messages):

TPU database upkeep, Volunteer-based databases


GPU MODE ▷ #factorio-learning-env (3 messages):

Factorio VSCode Extension, Factorio Modding Source Access


GPU MODE ▷ #cutlass (2 messages):

Nvidia GEMM Kernel competition, CUTLASS FP8 attention example on Blackwell


GPU MODE ▷ #helion (26 messages🔥):

Autotuning Helion Kernels, AOT Kernel Wrapper, Persistent Kernels, Limiting Autotuning Search Space, Timeout Slow Kernels


GPU MODE ▷ #nvidia-competition (774 messages🔥🔥🔥):

Triton Do Bench, NVFP4 Optimization, CUDA Graphs, Model Load, Cutlass 4.3


GPU MODE ▷ #xpfactory-vla (8 messages🔥):

Google Paper, Microsoft VITRA, ContextVLA, VLA-Adapter with Qwen3-VL


LMArena ▷ #general (756 messages🔥🔥🔥):

Gemini 3 release date, Nano Banana 2 Release, Viper model identity (Grok), AI generated images in schools, OpenAI censorship


LMArena ▷ #announcements (1 messages):

User Login, Email Login


Cursor Community ▷ #general (391 messages🔥🔥):

Cursor Agents View, Cursor speed issues on Intel MBP, Claude 4.5 API errors, Cursor indexing home directory, Cursor crashing on Windows


Cursor Community ▷ #background-agents (3 messages):

environment.json, Cloud Agents API, Slack integration, Repo dependencies


Unsloth AI (Daniel Han) ▷ #general (180 messages🔥🔥):

Unsloth's dynamic 2.0 quant models with vLLM, Fine-tuning Kimi K2 model, Ring T1 model in the Unsloth Zoo, Unsloth GGUF vs Qwen GGUF, Custom loss function


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Discord fan, Little one


Unsloth AI (Daniel Han) ▷ #off-topic (88 messages🔥🔥):

LLM Efficiency, Dataset size vs model size, Data quality importance, Perplexity checks for data grooming, Effective batch size tuning


Unsloth AI (Daniel Han) ▷ #help (53 messages🔥):

Fine-tuning Llama 3 for script writing, Dataset size for fine-tuning, Chat template issues, Unsloth documentation on fine-tuning, Continued pre-training


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

FameSumm Implementation, Medical Term Logit Penalization


OpenRouter ▷ #announcements (1 messages):

MiniMax M2, Paid Endpoint Migration, OpenRouter Announcements


OpenRouter ▷ #app-showcase (15 messages🔥):

Smol AI, Anti-bot measures, News Scraper


OpenRouter ▷ #general (268 messages🔥🔥):

Output price filtering for models, Gemini latest news, Minecraft chat moderation, GPT-5 pricing, OpenRouter account with $10 billing


OpenRouter ▷ #discussion (15 messages🔥):

Meta AI's Low-Coverage Language Project, Data Acquisition for 500 Languages, OpenRouter Search Bar Removal, UI/UX choices


LM Studio ▷ #general (44 messages🔥):

AWS for LLM hosting, Low cost private LLM hosting, LM Studio plugin for external LLMs, LM Studio admin privileges on macOS


LM Studio ▷ #hardware-discussion (186 messages🔥🔥):

AMD GPU vs Nvidia GPU performance, CUDA vs Vulkan performance differences, Multi-GPU setups for local LLMs, Model routing for efficient LLM usage, Power requirements for multi-GPU rigs


OpenAI ▷ #ai-discussions (114 messages🔥🔥):

Sora 3 Status, D.A.T.A™ Avatar, AI coding shortfalls, AI Smartwatch game, Gemini 2.5 Pro vs GPT-5


OpenAI ▷ #gpt-4-discussions (40 messages🔥):

GPT models training limitations, GPT-4.1 vs GPT-5 rerouting, Alternative AI companies, Teen safety and privacy changes, Sora2 invite code


OpenAI ▷ #prompt-engineering (11 messages🔥):

Centralized interactive tool development, API features for custom GPTs, Verbatim quotes and citations


OpenAI ▷ #api-discussions (11 messages🔥):

Lock Verbatim of Comments, External System API Features for Custom GPTs, Prompt Engineering Jobs


Yannick Kilcher ▷ #general (121 messages🔥🔥):

Physics Informed Neural Nets (PINNs), Researcher Communication Strategies, JSON Data Model, Paper Selection Filters, Self Attention Scaling


Yannick Kilcher ▷ #paper-discussion (4 messages):

Reproducibility, ThinkPRM, Kimi K2


Yannick Kilcher ▷ #ml-news (10 messages🔥):

PromptFlux Malware, Yann LeCun Leaving Meta, Country Songs Analysis


Nous Research AI ▷ #general (124 messages🔥🔥):

UI feedback, Neurips Meetup, AWS Reinvent, Pretraining on Hermes, Peer Review Application


Nous Research AI ▷ #ask-about-llms (1 messages):

Salience full tour


Nous Research AI ▷ #interesting-links (2 messages):

GradientHQ Parallax, Live demo of Parallax


Modular (Mojo 🔥) ▷ #general (24 messages🔥):

Mojo vs Rust, Mojo phase 3, Mojo use cases, Mojo replacing C++, Mojo and dynamic type reflection


Modular (Mojo 🔥) ▷ #mojo (85 messages🔥🔥):

Implicit conversion to ref/mut, Raw vs Unsafe, GPU compilation error, Python 3.14


Eleuther ▷ #general (22 messages🔥):

Cost-effective hardware for AI training, TPUs vs physical hardware, Implementing multi-headed attention from scratch


Eleuther ▷ #research (29 messages🔥):

DCLM dataset, Zyda-2 dataset, Nemotron-CC-v2 dataset, RWKV datasets, Pretraining datasets


Eleuther ▷ #interpretability-general (25 messages🔥):

Concept Probes for Interpretability, Divergent Internal Concepts, Real-Time Concept Detection


HuggingFace ▷ #general (40 messages🔥):

Learning Python with LLMs, Out of Memory Errors with Diffusers, HF Spaces failing with io.EOF error, HF Responses API, Multi Headed Attention interview questions


HuggingFace ▷ #i-made-this (16 messages🔥):

SUP Toolbox, Muon Optimizer, NVIDIA's document layout detect model


HuggingFace ▷ #core-announcements (1 messages):

Diffusers MVP program


HuggingFace ▷ #NLP (3 messages):

Random data generation, PII Randomization


Latent Space ▷ #ai-general-chat (49 messages🔥):

SYNTH dataset, Meta Omnilingual ASR, Moonshot AI Kimi AMA, Gamma's Series B, Meta's GEM model


Latent Space ▷ #genmedia-creative-ai (10 messages🔥):

AI Progress, Magic Patterns 2.0, Series A Funding


Moonshot AI (Kimi K-2) ▷ #general-chat (50 messages🔥):

Opencode interleaved thinking, Kimi-cli auto start thinking mode, Moonshot Inference Cluster Slowdown, Kimi Coding Plan API Quota, Bug Report Guidelines


tinygrad (George Hotz) ▷ #general (20 messages🔥):

hatch vs setuptools, pyproject.toml, package data, ChatGPT suggestions for setup


tinygrad (George Hotz) ▷ #learn-tinygrad (13 messages🔥):

M4 Mac segfaults, torch private buffer issue, Tensor from URL


Manus.im Discord ▷ #general (17 messages🔥):

PowerPoint presentations with company brand, Manus invite, Publish button issues, AI engineer introductions, Mini AGI


DSPy ▷ #show-and-tell (2 messages):

DSPyMator Launch, Taxonomy Creation Blogpost


DSPy ▷ #general (11 messages🔥):

Transfer Learning, GEPA Prompting with Strong Models, Optimizing OCR pipeline with GEPA, Saving/Loading DSPy Modules


aider (Paul Gauthier) ▷ #general (9 messages🔥):

aider-ce improvements, webspp UI integration with aider-ce, Merge editor with conflict resolution, Code Snippets in MD Files


aider (Paul Gauthier) ▷ #questions-and-tips (2 messages):

LLM Preprocessing Scripts, LLM Summarization Scripts, LLM Planning


aider (Paul Gauthier) ▷ #links (2 messages):

Aider-CE Agent Mode, Chrome Devtools Integration, Context7


MCP Contributors (Official) ▷ #mcp-dev-summit (2 messages):

MCPConference Paris, Declarative Agents, Evals