Frozen AI News archive

not much happened today

**MiniMax M2.1** launches as an **open-source** agent and coding Mixture-of-Experts (MoE) model with **~10B active / ~230B total parameters**, claiming to outperform **Gemini 3 Pro** and **Claude Sonnet 4.5**, and supports local inference including on **Apple Silicon M3 Ultra** with quantization. **GLM 4.7** demonstrates local scaling on **Mac Studios** with **2× 512GB M3 Ultra** hardware, highlighting system-level challenges like bandwidth and parallelism. The concept of **inference quality** is emphasized as a key factor affecting output variance across deployments. Yann LeCun's **VL-JEPA** proposes a **non-generative, non-autoregressive** multimodal model operating in latent space for efficient real-time video processing with fewer parameters and decoding operations. Advances in agentic reinforcement learning for coding include self-play methods where agents inject and fix bugs autonomously, enabling self-improvement without human labeling, and large-scale RL infrastructure involving massive parallel code generation and execution sandboxes.

Canonical issue URL

a quiet christmas

AI News for 12/26/2025-12/27/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (208 channels, and 2801 messages) for you. Estimated reading time saved (at 200wpm): 236 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

happy christmas.


AI Twitter Recap

Open-source models, local inference, and “inference quality” as the hidden variable

Non‑generative multimodal learning resurges: VL‑JEPA as an efficiency play

Agents, RL-for-coding, and the emerging “context engineering” discipline

Retrieval, memory, and evaluation: from “benchmarks” to operational reliability

Systems & hardware constraints: memory supply chain and the “DIY PC is dead” narrative

Assorted technical notes worth bookmarking (but less central than the above)

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. GPU VRAM Upgrade Advocacy

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Vision-Language Model Innovations

2. OpenAI Prompt Packs Launch

3. Humorous AI and Art


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. GPU Inference & DSL Arms Race

2. Efficient Training, Fine-Tuning & Reasoning Shortening

3. Jailbreaks, Red Teaming & Safety Bypasses

4. Lightweight Multimodal & Dev Tooling

5. Memory Architectures for Autonomous Agents


Discord: High level Discord summaries

BASI Jailbreaking Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LMArena Discord


Perplexity AI Discord


OpenRouter Discord


LM Studio Discord


OpenAI Discord


GPU MODE Discord


HuggingFace Discord


Nous Research AI Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


Latent Space Discord


aider (Paul Gauthier) Discord


Eleuther Discord


DSPy Discord


Manus.im Discord Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Modular (Mojo 🔥) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MCP Contributors (Official) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

BASI Jailbreaking ▷ #general (476 messages🔥🔥🔥):

Japanese words, H100 setup, AI manipulation hackathon, Replit agent use, NVIDIA H100 GPU


BASI Jailbreaking ▷ #jailbreaking (407 messages🔥🔥🔥):

SWAGGY challenge on hackai.lol, Poisoned training data, GPT account recovery, Jailbreaking Gemini 3


BASI Jailbreaking ▷ #redteaming (20 messages🔥):

Uncensored LLMs for ethical coding, Model 'obliteration' vs. 'ablation', Special tokens and whitelists in LLMs, Jailbreak Function Templates, Flagging Code in LLMs


Unsloth AI (Daniel Han) ▷ #general (288 messages🔥🔥):

DoRA in Unsloth, NVME SSD speed boost, Qwen 3 thinking VL model fine-tuning for shorter reasoning, Synthetic Data, GPT OSS model generation


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

SIMD, GPU programming, LLM Fine-tuning


Unsloth AI (Daniel Han) ▷ #off-topic (165 messages🔥🔥):

KCD2 and Cyberpunk discussion, ChaiNNer Updates, TPU Training Struggles, LLM API Advertising, Anthropic Soul Research


Unsloth AI (Daniel Han) ▷ #help (21 messages🔥):

Unsloth LLaVA 1.5 notebook, Unsloth-zoo file import issue, Ministral-3-3B-Instruct-2512 model compatibility, Qwen3-VL video dataset fine-tuning, Cross-posting warning


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

Unsloth Logo Feedback, Nano Banana Chart Inquiry, deepfabric-blender-mcp-gguf Model Repo


Cursor Community ▷ #general (260 messages🔥🔥):

Cursor Student Verification Issues, Auto Unlimited Removal, Opus vs GPT-5.2, Claude Code Integration, Antigravity as an Alternative


LMArena ▷ #general (251 messages🔥🔥):

LMArena image compression, Captcha verification loop, Grok censorship, Gemini bypass


Perplexity AI ▷ #general (219 messages🔥🔥):

Perplexity 2026, EU LLM, 112 context, comet, Elon Musk


OpenRouter ▷ #app-showcase (2 messages):

Sonnet Model, thinking tags, output text


OpenRouter ▷ #general (203 messages🔥🔥):

Free AI API, Gooning, Nano Banana Pro 4K generation, Choosing an LLM for copywriting, Prompt engineering tips


OpenRouter ▷ #discussion (12 messages🔥):

Jules improvements, Free AI API, Misleading throughput numbers, GLN aggressive batching


LM Studio ▷ #general (90 messages🔥🔥):

RAM allocation during inference, Claude Distills and Modern Thinking Models, ffmpeg-mcp for MacOS, lmstudio as OpenAI Endpoint client, Remote LM Studio


LM Studio ▷ #hardware-discussion (91 messages🔥🔥):

Connecting multiple GPUs to i3, Nvidia hotfix driver 591.67, PCIe lanes for inference, DDR5 ECC Price Discrepancy, Blackwell vs 3080 for VRAM


OpenAI ▷ #ai-discussions (97 messages🔥🔥):

ChatGPT Subscription Issues, AI Image Generation, Sora Monetization, AI Development Projects, AI Detection


OpenAI ▷ #gpt-4-discussions (4 messages):

AI for analyzing pitch, AI for analyzing tone, AI for analyzing body language, AI for linguistics


OpenAI ▷ #prompt-engineering (17 messages🔥):

CustomGPTs for Prompt Improvement, Evaluating Prompt Quality, Meta-Prompt Systems, Prompt Engineering for Coding vs. Conversational AI


OpenAI ▷ #api-discussions (17 messages🔥):

CustomGPTs for prompt improvement, Prompt evaluation metrics, Meta-prompt systems, Prompt engineering for coding agents, Rubrics for non-verifiable domains


GPU MODE ▷ #general (14 messages🔥):

DSLs, PTX, CDNA ISA, AGX assembly


GPU MODE ▷ #triton-gluon (4 messages):

cuTile Performance, GPU Programming Evolution, DSL Challenges


GPU MODE ▷ #off-topic (2 messages):

Scientific Note-Taking, Technical Note-Taking, LaTeX, Orgmode, Excalidraw


GPU MODE ▷ #submissions (9 messages🔥):

NVIDIA Leaderboard Updates, nvfp4_dual_gemm Leaderboard


GPU MODE ▷ #cutlass (1 messages):

Cute DSL, Competitions in channel


GPU MODE ▷ #teenygrad (3 messages):

Tensor Handles, OpNode IR, Runtime Buffer Allocators, OpCode.ADD, ascii printer for IR


GPU MODE ▷ #helion (2 messages):

Helion usage in vLLM


GPU MODE ▷ #nvidia-competition (1 messages):

Cute-DSL vs C++ Cute, Cute API limitations, Cute consensus preference


GPU MODE ▷ #career-advice (3 messages):

handwritten kernels, tinygrad IR interpreter, tilelang


HuggingFace ▷ #general (26 messages🔥):

Agentic AI Course, RAG learning Resources, ML Project Water Quality Testing, Open Source OCR for Medical Documents, Hottest LLM Model


HuggingFace ▷ #i-made-this (4 messages):

Coding models released, AI-powered Django REST API Generator, New model released


HuggingFace ▷ #agents-course (3 messages):

Agent course, Hugging Face Courses


Nous Research AI ▷ #general (21 messages🔥):

LLMs social deduction game, Claude one-shot UI, Smart watch data to external AI, GPT models and ads, Zai and Minimax IPO


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/openbmb/status/2004539303309750341?s=46


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/openbmb/status/2004539303309750341?s=46


Moonshot AI (Kimi K-2) ▷ #general-chat (23 messages🔥):

Kimi coding vs Gemini, Kimi Researcher Model Speculations, AI implementation in society, Fact checking with LLMs


Yannick Kilcher ▷ #ml-news (10 messages🔥):

Groq Inference Chips, NVIDIA NPU acquisition, Chinese chips


Latent Space ▷ #ai-general-chat (7 messages):

Karpathy Refactoring, AI Agent Programming, Rob Pike Opinion


Latent Space ▷ #private-agents (2 messages):

Torchax, Unified Memory, Godawful OS


aider (Paul Gauthier) ▷ #general (4 messages):

Claude, Codebase Explanation


aider (Paul Gauthier) ▷ #questions-and-tips (3 messages):

Explaining codebase to Claude, Automating Claude updates


Eleuther ▷ #general (5 messages):

Full Autonomy Systems for AIs, Limitations to Full AI Autonomous Systems, Context Management, Long-Term Memory Implementation


Eleuther ▷ #research (2 messages):

all-the-noises github, arxiv