Frozen AI News archive

not much happened today

**Claude Code Skills** gains attention with a published talk and Hugging Face's new "skill" enabling one-line fine-tuning pipelines for models from ~0.5B to 70B parameters, supporting SFT, DPO, and GRPO, costing as low as ~$0.30 for small runs. **Zhipu AI** launches multimodal models **GLM-4.6V** (106B params MoE) and **GLM-4.6V-Flash** (9B dense), featuring 128k context and native multimodal function calling, with free Flash variant and API pricing detailed. **Jina AI** releases **Jina-VLM (2B)**, a compact multilingual VLM excelling in diagrams and documents with top benchmark scores. At **NeurIPS 2025**, research highlights include Google's post-Transformer sequence architectures (Moneta, Yaad, Memora) showing up to 20% gains in long-context retrieval, **AxiomProver**'s autonomous Lean system solving 9/12 Putnam 2025 problems rapidly, and mechanistic interpretability advances discussed by Chris Olah emphasizing scalable tooling.

Canonical issue URL

a quiet day

AI News for 12/5/2025-12/8/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 16871 messages) for you. Estimated reading time saved (at 200wpm): 1319 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of excitement about Claude Code Skills, which now has a published talk, being able to finetune AI models.


AI Twitter Recap

Automating open LLM training with Claude Code + Hugging Face Skills

New multimodal models: Zhipu’s GLM‑4.6V and Jina‑VLM

NeurIPS 2025 research signals: new sequence architectures, interpretability, and formal methods

Agents in practice: evaluation, reliability, and knowledge grounding

Infrastructure and serving: open stacks and systems updates

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. GLM-4.6V Model Releases and Features

2. RAM Price Surge and OpenAI's Influence

3. Local LLM Builds and Vector Database Comparisons

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GITAI Space Robotics and Lunar Base Feasibility

2. Nano Banana and Z-IMG Model Innovations

3. AI Predictions and Humorous AI Memes


AI Discord Recap

A summary of Summaries of Summaries by Gemini 3.0 Pro Preview Nov-18

Theme 1. Hardware wars: DRAM shortages, Blackwell quirks, and AMD contenders

Theme 2. Model Evaluation: Reasoning prowess, vision failures, and small-model wins

Theme 3. Developer Ecosystem: Broken APIs, new adapters, and framework struggles

Theme 4. Application Nightmares: Billing scams, bugs, and ban hammers

Theme 5. Research & Security: Fake papers, jailbreaks, and prize winners


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


BASI Jailbreaking Discord


LM Studio Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


OpenAI Discord


Eleuther Discord


GPU MODE Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


HuggingFace Discord


DSPy Discord


Yannick Kilcher Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1410 messages🔥🔥🔥):

AI Image Generation Costs, Nano Banana Flash's potential, Veo 4 video model, Algorithm Blessing, GPT-5.2 Speculation


Perplexity AI ▷ #general (1279 messages🔥🔥🔥):

Claude Opus overspending, Comet degradation, YouTube Recap, Gemini Pro lacking features, bypass limits


BASI Jailbreaking ▷ #general (802 messages🔥🔥🔥):

GPT-5 System Prompt Leak, Enterprise Claude Jailbreak, Project Genesis Concerns, Twitter Blue Check Hysteria, geopolitics


BASI Jailbreaking ▷ #jailbreaking (531 messages🔥🔥🔥):

Gemini 3 Jailbreak, UltraBr3aks Special Token Jailbreak, Deepseek Jailbreak, Claude Jailbreak, Grok Jailbreak


BASI Jailbreaking ▷ #redteaming (59 messages🔥🔥):

AI Red Teaming Tools, ChatGPT Jailbreak Prompt Revision, Agentic AI system security, OSINT of LLM models


LM Studio ▷ #general (942 messages🔥🔥🔥):

7900xtx, LM Studio Discord bot, Model Merging, Qwen 3 4b, Vulkan vs ROCm


LM Studio ▷ #hardware-discussion (687 messages🔥🔥🔥):

Micron consumer division shutdown impact on OSS models, Thunderbolt/USB PCIe adapter for extra GPU, Best AMD GPU for AI, GPU for code review suggestions, Kimi K2 Thinking AI model quality


Cursor Community ▷ #general (872 messages🔥🔥🔥):

VPN issues with Cursor, Shadow Workspace Creation, Sonnet ignoring project rules, Approval button fix, GPT model issues


Unsloth AI (Daniel Han) ▷ #general (671 messages🔥🔥🔥):

DDP guide, Unsloth Reddit, 4-bit or 8-bit fine tuning, Unsloth and Autoround, Mistral Large 3 GGML


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

CS Grad AI journey, Local Model Game Creation


Unsloth AI (Daniel Han) ▷ #off-topic (414 messages🔥🔥🔥):

RTX 5090, YankoviC, DeepSeek V3.2, React Security Vulnerabilities, WSL networking


Unsloth AI (Daniel Han) ▷ #help (183 messages🔥🔥):

Unsloth NaN bug, GGUF quants i1, VRAM usage, Ministral 3 models, dequantising 4-bit safetensors


Unsloth AI (Daniel Han) ▷ #research (77 messages🔥🔥):

EleutherAI server, Open datasets for training (OLMo), Synthetic data legality, HF Skills Training: automated dataset and model selection, Distillation with synthetic datasets


OpenRouter ▷ #announcements (1 messages):

Multi-Model Agents, Body Builder API, OpenRouter API


OpenRouter ▷ #general (366 messages🔥🔥):

Looksmaxxing mortality, OpenRouter Bug, Deepseek Versions, OpenRouter Account Compromised, OpenRouter on Chrome for Android Issue


OpenRouter ▷ #new-models (3 messages):

``


OpenRouter ▷ #discussion (87 messages🔥🔥):

Qwen3 TTS Update, Google Cloud TPUs, Gemini 2.5 Flash TTS, Narrator's Natural Voices, Grok 4.2 Stealth Release


OpenAI ▷ #ai-discussions (398 messages🔥🔥):

Sora Video Generation, Gemini 3 Pro vs ChatGPT-5.1 Codex, AI Ethics and Legal Compliance with Sora, AI and Humor Understanding, AI models for triangle counting


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

Chat splitting for complex projects, ChatGPT word limit issues, Deep Research in ChatGPT via API, GPT-4o-mini-TTS model issues, Breaking Bad roleplay


OpenAI ▷ #prompt-engineering (19 messages🔥):

GPT-5.1 vs Claude vs Gemini, Posture Persistence Experiment, Structural Synthesis, Differential Field, Stability Index


OpenAI ▷ #api-discussions (19 messages🔥):

Posture Persistence Experiment, GPT-5.1 vs Claude vs Gemini, Synapse-Lite, Structural Synthesis, Differential Field


Eleuther ▷ #general (220 messages🔥🔥):

Strategies for Reading Research Papers, Learning ML Efficiently, New Members, Interpretability with SGLang, ArXiv Endorsement


Eleuther ▷ #research (119 messages🔥🔥):

Sinusoidal Init, Adam analysis, Generalization, Muon-trained Model, Video Generation


Eleuther ▷ #interpretability-general (2 messages):

Task Optimized KV Caches, Task Optimized LoRAs


Eleuther ▷ #lm-thunderdome (4 messages):

Qwen3, anthropic


GPU MODE ▷ #general (17 messages🔥):

Multiple Mac Studios, B200 latency, Moore Threads, ML Infra


GPU MODE ▷ #cuda (39 messages🔥):

FP4, PTX 9.1, Async Sharp Operations, TileGym Autotuner, tcgen05.mma


GPU MODE ▷ #torch (1 messages):

Symmetric Memory, CUDA error: an illegal memory access, Distributed training issues


GPU MODE ▷ #cool-links (2 messages):

TTE-TPU, SGLang


GPU MODE ▷ #jobs (3 messages):

.NET Migration, PyTorch Operators, X.AI GPU Kernels


GPU MODE ▷ #pmpp-book (9 messages🔥):

CUDA API Docs, Book 3rd vs 4th edition differences, Book purchasing difficulties


GPU MODE ▷ #off-topic (1 messages):

jaefosho: Very Eastern European


GPU MODE ▷ #irl-meetup (1 messages):

szymonoz: I'll be in SF this week, anyone from the Bay up for a meetup?


GPU MODE ▷ #rocm (10 messages🔥):

AMD, MI355X, Strix Halo, RDNA 3.5, Linux


GPU MODE ▷ #tilelang (2 messages):

GB300 CUDA Cores


GPU MODE ▷ #liger-kernel (1 messages):

jaefosho: How does one break into the ml infra space?


GPU MODE ▷ #self-promotion (4 messages):

CUDA 13.1, CUDA Tile, Distributed Training, QuintNet, 3D Parallelism


GPU MODE ▷ #🍿 (1 messages):

Nvidia DRIVE AGX Thor, Kernel Optimization, Torch Models


GPU MODE ▷ #thunderkittens (1 messages):

Megakernel Implementation, Batched Llama Official, Instruction Generator Script, Blog Post Timings


GPU MODE ▷ #submissions (39 messages🔥):

nvfp4_gemm Leaderboard Updates, sort_v2 Leaderboard Domination, prefixsum_v2 Leaderboard Sweep


GPU MODE ▷ #factorio-learning-env (6 messages):

Factorio AI Development, Factorio-Learning-Environment Project, Moby 2.0 and CS Majors


GPU MODE ▷ #general (1 messages):

Image Analysis Achievements, Image Displayed


GPU MODE ▷ #llmq (2 messages):

pypi, Cross-entropy skipping, Chunked softmax calculation, CUDNN Workspace Chunking


GPU MODE ▷ #nvidia-competition (54 messages🔥):

CuTeDSL Talk, CuTile vs CuTeDSL, Modal and NCU, B200 Blockscaled GEMM, popcorn-cli Submissions


GPU MODE ▷ #robotics-vla (15 messages🔥):

behavior-1k, RoboCOIN, mobile bimanual tasks, VLA for behavior-1k


Nous Research AI ▷ #general (168 messages🔥🔥):

Pre-training sets ablations, Hermes 4.3 AWQ Quants, Consilience-40B Replacement, Multimodal RL Training, Video Game 3D Map Generation


Nous Research AI ▷ #ask-about-llms (7 messages):

Hermes 4.3, Sonnet 4.5, Llama.cpp prompt template


Nous Research AI ▷ #research-papers (1 messages):

theguywhogamesalot: Reminds me of a bit of Nvidia's Method:

https://arxiv.org/abs/2510.01265


Nous Research AI ▷ #interesting-links (15 messages🔥):

Humble Bundle, O'Reilly Book Packages, Langchain


Nous Research AI ▷ #research-papers (1 messages):

theguywhogamesalot: Reminds me of a bit of Nvidia's Method:

https://arxiv.org/abs/2510.01265


Modular (Mojo 🔥) ▷ #general (3 messages):

YouTube Live Stream, Video Upload Delay


Modular (Mojo 🔥) ▷ #announcements (4 messages):

MAX framework, Model API, Mojo Meetup, MMMAudio, Shimmer


Modular (Mojo 🔥) ▷ #mojo (143 messages🔥🔥):

Mojo compiler bugs, Lightbug status, Cutile relevance, MMMAudio presentation, ImplicitlyCopyable


Modular (Mojo 🔥) ▷ #max (16 messages🔥):

Bazel Integration, MAX API in Mojo, Heterogeneous CPU + GPU Graph Processing, Parametric traits and conditional conformance


HuggingFace ▷ #general (115 messages🔥🔥):

Zero GPU spaces, building a server capable of running 120b oss gpt model, huggingface pro billing question, Binary convolutional neural network, LLM compliance ruleset


HuggingFace ▷ #today-im-learning (1 messages):

Minimax M2, Claude Credits


HuggingFace ▷ #cool-finds (1 messages):

HRM vs TRM models, LLM compute costs, LLM environmental impact, LLM Alternatives


HuggingFace ▷ #i-made-this (18 messages🔥):

Alignment Constitution Red-Teaming, AMD GPU Monitoring Tool, Offline Dictation macOS App, Hugging Face Spaces Dashboard, Graph Database Implementation


HuggingFace ▷ #core-announcements (1 messages):

Image generation, Release announcements


HuggingFace ▷ #computer-vision (5 messages):

object size detection, depth estimation models


HuggingFace ▷ #smol-course (1 messages):

LORA, Fine tuning, Open Source, Metal, CUDA


HuggingFace ▷ #agents-course (7 messages):

Agent Course Certificate, GAIA Evaluation Agent Attachments, AI Agent Workshop


DSPy ▷ #show-and-tell (59 messages🔥🔥):

TOON adapter for DSPy, BAMLAdapter, GEPA optimizations, Compounding Engineering CLI, rec-praxis-rlm


DSPy ▷ #general (31 messages🔥):

Vision Language Models optimization with DSPy, Gemini 3 Pro, Claude code DSPy harness, Context Compression Experiments


Yannick Kilcher ▷ #general (67 messages🔥🔥):

Control Theory, Free Markets, Lyapunov functions for NNs, Streaming Audio Transcription, Catastrophic Forgetting


Yannick Kilcher ▷ #agents (2 messages):

Discord Paper Discussions, Copilot Identification


Yannick Kilcher ▷ #ml-news (20 messages🔥):

Echo-TTS, Qwen3-TTS, Anthropic Interview, OpenAI's Stargate Project, DDR5 RAM kits


Latent Space ▷ #ai-general-chat (72 messages🔥🔥):

Meta acquires Limitless, GPT-4o video generation, ARC Prize winners announced, Essential AI's open-source model, Google's Titans revisited


Latent Space ▷ #genmedia-creative-ai (9 messages🔥):

Nano Banana Pro, Prompt-to-Image Checklist, Contact-Sheet Prompting


Moonshot AI (Kimi K-2) ▷ #general-chat (45 messages🔥):

Black Friday Promotion, Kimi Slides Feature, Kaggle Competition, Username Length Limit, Kimi Markdown Issues


Manus.im Discord ▷ #general (29 messages🔥):

Manus Support Issues, Google Play Billing Bugs, Account Upgrade Problems, Credit Refund Requests, Understaffed Support Team


aider (Paul Gauthier) ▷ #general (7 messages):

Gemini CLI OAuth, Claude Opus 4.5, aider + Amazon bedrock


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

ethan_15839: Is there any way to use the Gemini CLI oAuth in aider to use the gemini models?


tinygrad (George Hotz) ▷ #general (5 messages):

USB 2.0 Driver Support, Meeting #99 Agenda, asm2464pd-firmware


MLOps @Chipro ▷ #events (1 messages):

AI Agent Workshop, AI Engineering Bootcamp, GenAI for Beginners


MCP Contributors (Official) ▷ #general-wg (1 messages):

paoloricciuti: Is there anywhere I can ask for this info to get more answers? 😅