Frozen AI News archive

Claude Skills grows: Open Standard, Directory, Org Admin

**Claude Skills** are gaining significant traction since their launch in October, with a milestone of 100k views in one day for the Claude Skills talk, signaling growing adoption and importance. Announcements include org admin support, a new Skills Directory, and the move to an open standard named **Agent Skills**. In frontier model launches, **OpenAI** released **GPT-5.2-Codex**, touted as the best agentic coding model with improvements in native compaction, long-context reliability, and tool-calling, emphasizing real-world security impacts. **Google DeepMind** introduced **Gemini 3 Flash**, focusing on speed as a product feature impacting workflows and user engagement, alongside **FunctionGemma** and **T5Gemma 2**, emphasizing on-device deployment, fine-tuning, and multimodality.

Canonical issue URL

Skills are going the way of MCP!

AI News for 12/17/2025-12/18/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (207 channels, and 7381 messages) for you. Estimated reading time saved (at 200wpm): 603 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Some minorly interesting releases in 5.2 Codex and FunctionGemma, but the story you're most likely going to care about a year from now is the continued growth of Claude Skills. Launched in October, it was pretty universally ridiculed as a "folder of markdown" and/or pivot from MCP (now moved to Linux Foundation), but among insiders traction has grown and grown and grown. One way to gauge this growth is the Claude Skills talk crossing 100k views in 1 day - easily the fastest to that milestone in AIE history and probably the second millionaire talk of 2025.

Two AI engineers from Anthropic presenting at the Code Summit, with a banner suggesting "Don't Build Agents, Build Skills Instead"

The announcements today are:

All these seem incremental additions but the bigger picture is that Skills adoption is growing and serious and if our IRL conversations are any indication, you are probably also underestimating them.

The last time we made a non-news trend callout like this was Claude Code.


AI Twitter Recap

Frontier model launches: GPT-5.2-Codex, Gemini 3 Flash, and on-device Gemma variants


Agents: “Skills” standardization, harness UX, and long-running infra realities


Evals, regressions, and safety measurement: METR horizon fixes + OpenAI CoT monitorability


Systems & open tooling: MLX distributed on Macs, vLLM MoE throughput, and diffusion-LM toolchains


Multimodal generation & document intelligence: Kling motion control, Runway Gen-4.5, Mistral OCR 3

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Google's Gemma Models Update

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GPT-5.2 Benchmark Achievements

2. Medical Advice and AI

3. Image Generation and Realism


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. Next‑Gen Frontier & Edge Models: Gemini 3 Flash, GPT‑5.2, FunctionGemma & Friends

2. Open‑Source Infra, Ranking, and JSON‑Safe APIs for LLMs

3. GPU Hardware, Kernel Competitions, and Practical Performance Tuning

4. Prompt, Context, and Program Optimization: From GEPA to Context‑Rot

5. New AI‑Native Products and Data Platforms Built on LLMs


Discord: High level Discord summaries

LMArena Discord


BASI Jailbreaking Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


Cursor Community Discord


LM Studio Discord


GPU MODE Discord


OpenAI Discord


HuggingFace Discord


Latent Space Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


DSPy Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


MCP Contributors (Official) Discord


tinygrad (George Hotz) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1112 messages🔥🔥🔥):

GPT-1.5 Censorship, Gemini Image Generation vs. GPT, Gemini 3 Flash cost/performance, Google vs OpenAI: compute


LMArena ▷ #announcements (4 messages):

Arena-Rank Open Source, Image Edit Leaderboard Updates, Search Leaderboard Updates, Text Leaderboard Updates


BASI Jailbreaking ▷ #general (941 messages🔥🔥🔥):

Browser mining extensions, Lottery bitcoin mining, Browser Exploitation Framework, ChatGPT Jailbreak subreddit ban, Fetch tokens sales


BASI Jailbreaking ▷ #jailbreaking (359 messages🔥🔥):

Gemini 5.2 jailbreak, LLM Jailbreaking Techniques, Nano Banana Pro restrictions, r/chatgptjailbreak ban, Gemini image generation


BASI Jailbreaking ▷ #redteaming (3 messages):

``


Perplexity AI ▷ #announcements (1 messages):

Gemini 3 Flash, Perplexity Pro, Perplexity Max


Perplexity AI ▷ #general (886 messages🔥🔥🔥):

GPT-5 Pro on Perplexity, Gemini 3 Pro vs ChatGPT vs Claude for coding, Ethically Sourced Music AI, Perplexity Pro Referral Program, Tilly Norwood and AI in Hollywood


Perplexity AI ▷ #pplx-api (10 messages🔥):

Perplexity Pro API key, Financial Modeling Prep, Realtime Price Feeds, Finnhub Data Provider


Unsloth AI (Daniel Han) ▷ #general (263 messages🔥🔥):

Qwen3-VL, Saving Embeddings, GLM-4.6V-Flash-GGUF repetition issues, RL model beta release, Finetuning on a phone


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Unsloth updates - 3x faster, FunctionGemma, Nemotron 3, Mistral VLMs, GLM-4.6V


Unsloth AI (Daniel Han) ▷ #off-topic (405 messages🔥🔥🔥):

Overfitting tokenizers, Moving to Arch Linux, H100s on Google Colab, TTS Model for Multiple Languages, T5Gemma 2


Unsloth AI (Daniel Han) ▷ #help (46 messages🔥):

Qwen3 4B Instruct Errors, FBGEMM Warning, OCR Performance, PaddleOCRv5, Qwen3 VL


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

Progressive Disclosure of AI Context, Qwen3-4b-Deep-Beta Model Release, Savant Commander MOE Model


Unsloth AI (Daniel Han) ▷ #research (2 messages):

MoLA, Adapter Training, Reasoning, Token Budgeting


OpenRouter ▷ #announcements (1 messages):

JSON repair, Browser notifications, Long-context models, Fastest-growing AI infra


OpenRouter ▷ #app-showcase (44 messages🔥):

AI-made Discord Server List, Image Verification, LLM System Prompt Test Cases, OpenRouter Model Table


OpenRouter ▷ #general (430 messages🔥🔥🔥):

Gemini 3 Flash caching, Chutes crypto mining, deepseek v3 0324 context size, AI water usage, Openrouter Android/iOS app


OpenRouter ▷ #discussion (192 messages🔥🔥):

Mistral Large 3 Quality, OpenRouter Website Performance, Vision Function for Bots, AI Learning Resources for GTM Team, Gemini Model's Pixel-Perfect Bounding Boxes


Cursor Community ▷ #general (447 messages🔥🔥🔥):

Domain name pricing, Gemini 3 flash, Obsidian Copper and Carbon Monochrome themes, Free models in cursor, Student Discount Eligibility


LM Studio ▷ #general (129 messages🔥🔥):

Gemini's Deep Research in LM Studio, Open Source Models for API website operation, Smart Home setup, DDG MCP server, Cursor IDE


LM Studio ▷ #hardware-discussion (132 messages🔥🔥):

Radeon R9700 Scaling, AMD GPU lifespan, Multi-GPU scaling issues, AMD vs Nvidia for AI, W7800 48G GPU


GPU MODE ▷ #general (7 messages):

Lecture 1 Error, model definition, profiling code


GPU MODE ▷ #cuda (2 messages):

Spark Devs


GPU MODE ▷ #announcements (1 messages):

NVIDIA, cuTile, TileIR, Mehdi Amini, Jared Roesch


GPU MODE ▷ #job-postings (2 messages):

SemiAnalysis, clusterMAX, SLURM, GPUs, Kubernetes


GPU MODE ▷ #beginner (5 messages):

CUDA Setup, DL Projects, Visual Studio, VS Buildtools


GPU MODE ▷ #torchao (5 messages):

dtype deprecation in linear_quant_modules, ao namespacing PR


GPU MODE ▷ #off-topic (14 messages🔥):

AI Formal Verification, GPU Kernels Verification, PyTorch PRs, Open Source Contribution


GPU MODE ▷ #rocm (2 messages):

Training Models on Strix Halo, PyTorch Tutorials, GitHub Repositories


GPU MODE ▷ #self-promotion (9 messages🔥):

SonicMoE, NVIDIA Hopper GPUs, Princeton University, UC Berkeley, Together AI


GPU MODE ▷ #thunderkittens (1 messages):

kashimoo2_76983: <@1012256135761383465> did you folks write a decode kernel with mi300s or 355s?


GPU MODE ▷ #reasoning-gym (4 messages):

Reasoning-gym code, faker generator, robust tests


GPU MODE ▷ #submissions (13 messages🔥):

nvfp4_gemm benchmark, grayscale_v2 benchmark, H100 performance, NVIDIA performance


GPU MODE ▷ #hardware (3 messages):

Homelab Setup, GPU Training Differences, NVIDIA vs Other GPUs/NPUs, Intra-Node Interconnect Importance, NVIDIA's Software Role


GPU MODE ▷ #amd-competition (3 messages):

AMD-MLA-Decode leaderboard, Reproducing Kernels, MI300 Availability, AMD Developer Cloud


GPU MODE ▷ #general (2 messages):

Trimul competition, Kernel Runtime, Geometric Mean, Standard Deviation


GPU MODE ▷ #nvidia-competition (113 messages🔥🔥):

CuTeDSL L2 cache hint policies, Submission system timing out, Discord Bot usage for Submissions, MMA wrap optimization, TCGen05 instruction assistance


GPU MODE ▷ #robotics-vla (6 messages):

Hand Pose Estimation, Wrist Cameras, NVIDIA Cosmos Predict, Mimic-Video Paper


GPU MODE ▷ #career-advice (19 messages🔥):

Contributing to Open Source Projects, Keeping up with SoTA Research, AI Infra Engineer Demand, Kernel Competitions, Parallel Programming Passion


OpenAI ▷ #annnouncements (3 messages):

Pinned Chats, GPT-5.2-Codex, Chain-of-Thought Monitorability


OpenAI ▷ #ai-discussions (163 messages🔥🔥):

GPT-5.2 Hallucinations, Gemini 3.0 Flash, Sora 2 Discussions, AI Coherence over Time, ChatGPT App Store


OpenAI ▷ #gpt-4-discussions (8 messages🔥):

Gemini 3.0 Flash, GPT 5.2 High, deepseek r3.2, API date


OpenAI ▷ #prompt-engineering (1 messages):

Model provenance, AI writing style, Lack of provenance annotation tags


OpenAI ▷ #api-discussions (1 messages):

Model Detection, Provenance Annotation, Response Polishing


HuggingFace ▷ #general (103 messages🔥🔥):

Lightweight Vision Transformer Models, Model Choice for Structured Data Extraction, Fill Mask Techniques, Forward Pass in LLMs and Steering, Kaggle Runtime Disconnections


HuggingFace ▷ #i-made-this (2 messages):

Android voice assistant, Gemini 1.5 Flash, VoxCPM 1.5, Apple Neural Engine


HuggingFace ▷ #agents-course (4 messages):

Smolcourse Delays, AI Learning Resources


Latent Space ▷ #ai-general-chat (89 messages🔥🔥):

Exa AI People Search, Michael Truell and John Schulman LLM discussion, OpenAI potential $750B valuation, Pieter Abbeel Amazon AGI Head, Tomo AI


Latent Space ▷ #private-agents (2 messages):

vLLM Router, Intelligence Control Plane, Ollama / vLLM routing through semantic router


Latent Space ▷ #genmedia-creative-ai (8 messages🔥):

Black Forest Labs FLUX.2 Launch, xAI Grok Voice Agent API


Eleuther ▷ #general (14 messages🔥):

GPT-2 interpretability, 3D visualization of residual stream, SOTA Model Performance, Claude Opus 4.5 mistakes, Neuronpedia


Eleuther ▷ #research (3 messages):

Speech/NLP Research Collaboration, AI Research, NLP


Eleuther ▷ #interpretability-general (4 messages):

Anthropic's weight masking, Gemma 3 extreme activations, Adam's fault


Eleuther ▷ #multimodal-general (4 messages):

long range view synthesis, novel view synthesis, parallax effect, depth estimate


Eleuther ▷ #gpt-neox-dev (1 messages):

Custom Cross Entropy Function, Backwards Pass


Modular (Mojo 🔥) ▷ #general (19 messages🔥):

Mojo GPU usage, Rust GPU capabilities, Mojo std::offload


Modular (Mojo 🔥) ▷ #mojo (6 messages):

GPU issues with LLM build in MAX, C interop ideas from Rust, Mojo's array access quirks


aider (Paul Gauthier) ▷ #general (17 messages🔥):

Aider vs Aider-ce, OpenCode Accuracy vs Aider, Context Management and Token Efficiency, Task Bundling with Context


aider (Paul Gauthier) ▷ #links (3 messages):

Gemini 3 Flash, aider configurations, Litellm updates


Nous Research AI ▷ #general (6 messages):

Office Hours Recording, Sam's IOU Scheme, CUDA Performance on Linux


Nous Research AI ▷ #ask-about-llms (3 messages):

Open Source Server for Minos-v1, VLLM, SGLang


Nous Research AI ▷ #research-papers (3 messages):

LLM finetuning, Electronic schematics dataset


Nous Research AI ▷ #research-papers (3 messages):

LLM finetuning, Electronic schematics dataset


DSPy ▷ #general (12 messages🔥):

GEPA Optimization, Robots building robots, TreeOfThought Module, dspy.Refine feedback, GEPA Definition


Yannick Kilcher ▷ #general (10 messages🔥):

In-Context Learning Research, Draft Model Optimization, Training Cluster Pipelining, Vast.ai Issue, Nvidia's Brev vs Runpod


Yannick Kilcher ▷ #ml-news (1 messages):

ARC-AGI Benchmark, Toolathlon Benchmark, Training Data Mix


Moonshot AI (Kimi K-2) ▷ #general-chat (6 messages):

Kimi K2, Moonshot AI, Free Models


Manus.im Discord ▷ #general (5 messages):

DNS issues, Chat image limits, Manus revenue


MCP Contributors (Official) ▷ #general (4 messages):

MCP Prompts in Node Server, ChatGPT App Submissions


tinygrad (George Hotz) ▷ #general (3 messages):

JIT Refactor, Firmware Crushing, RDNA3 Assembly Backend