Frozen AI News archive

not much happened today

**GLM-4.7** and **MiniMax M2.1** open-weight model releases highlight day-0 ecosystem support, coding throughput, and agent workflows, with GLM-4.7 achieving a +9.5% improvement over GLM-4.6 and MiniMax M2.1 positioned as an OSS Claude-like MoE model with 230B total parameters and 200K context. **Gemma Scope 2** from **google-deepmind** introduces sparse autoencoders and transcoders for interpretability across Gemma 3 models, aiming to provide shared infrastructure for safety and debugging. The **Medmarks v0.1** open medical evaluation suite and leaderboard launch addresses the need for open medical benchmarking across 15+ environments, engaging clinicians and researchers.

Canonical issue URL

a quiet day.

AI News for 12/23/2025-12/24/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (208 channels, and 4471 messages) for you. Estimated reading time saved (at 200wpm): 341 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!


AI Twitter Recap

Open-weight model releases: GLM‑4.7 and MiniMax M2.1 tighten the gap


Interpretability & mech interp infra: Gemma Scope 2 as a community substrate


Benchmarks & evaluation: medicine, agents, ARC, and API-invocation reality checks


Agents & developer workflows: simplification, context organization, and “skills” loops


Multimodal shipping: TTS, image editing acceleration, and visual-context architectures


Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. DGX Spark User Experience

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Qwen-Image-Edit-2511 Release and Analysis

2. AI Tools and User Experiences

3. AI in Popular Culture and Memes


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.2

1. Model Reliability, Hallucinations, and Benchmark vs Reality Gaps

2. Reasoning Tokens, Interleaved Thinking, and Tooling That Breaks When You Don’t Preserve State

3. Local-First Model Ops: GGUF Pipelines, Small Tool-Call Models, and Hardware Reality

4. GPU Kernel & Compiler Tooling: New Knobs, Faster Autotune, and Triton Ecosystem Pressure

5. New Benchmarks, Datasets, and Open-Source Drops (Plus Some Spicy Model Editing)


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Cursor Community Discord


BASI Jailbreaking Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenAI Discord


OpenRouter Discord


HuggingFace Discord


GPU MODE Discord


Latent Space Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


DSPy Discord


tinygrad (George Hotz) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MCP Contributors (Official) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1015 messages🔥🔥🔥):

GLM 4.7 Hallucinations, Haiku's low hallucination rate, Gemini 3 Pro grounding issues, Liabilities of LLM hallucinations, Fallibility of AI tools


Perplexity AI ▷ #general (881 messages🔥🔥🔥):

Google AI Pro referral code, Gemini in China, Perplexity AI and emails, Max vs Pro output, Voice of reason


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity API, 502 Bad Request


Cursor Community ▷ #general (594 messages🔥🔥🔥):

Cursor IDE Mac Integration Issues, Frustrations with AI Progress in 2025, Opus 4.5's reasoning skill issue, Free Composer-1 Promo, GLM Model Discussion


BASI Jailbreaking ▷ #general (433 messages🔥🔥🔥):

Visual System Introspection Dialogue, Shopify Money Burning, Google Privacy Concerns, MODIE Integration, Emacs and Vim Integration


BASI Jailbreaking ▷ #jailbreaking (137 messages🔥🔥):

ChatGPT 5.2 Jailbreak, Gemini Jailbreak, DAN Jailbreak, Grok Jailbreak, NSFW Content Generation


BASI Jailbreaking ▷ #redteaming (12 messages🔥):

Waiting room hell, red team exercises, publishing findings, Google's consent


Unsloth AI (Daniel Han) ▷ #general (154 messages🔥🔥):

GGUF Conversion, Merging Adapters, GLM 4.7 iMatrix, Quantization Algorithm, Smart Offloading


Unsloth AI (Daniel Han) ▷ #off-topic (283 messages🔥🔥):

AI music instrument extraction, SNAC Codec TTS, ElevenLabs latent space Replication, Vote with wallet?, HLE dataset


Unsloth AI (Daniel Han) ▷ #help (2 messages):

LangGraph, ReAct Agent, Structured Output


LM Studio ▷ #announcements (1 messages):

FunctionGemma, UnslothAI, GGUF conversion


LM Studio ▷ #general (124 messages🔥🔥):

MLX models for image analysis and coding, Gemma for Images, Qwen3 Model Optimization, Zero Data Retention, Functiongemma Model


LM Studio ▷ #hardware-discussion (211 messages🔥🔥):

GPU temperature issues, Thermal Paste Problems, Multi-GPU Setup, PCIe Lane Configuration, VRAM Temperature Degradation


OpenAI ▷ #ai-discussions (146 messages🔥🔥):

Structured output from GPT, GPT-5's Fluff, Knowledge tracking, LLM costs, Transformer bottlenecks


OpenAI ▷ #prompt-engineering (19 messages🔥):

PDF Visuals with ChatGPT, ToS Splitting Technique, Honesty Training, Agent Controlled Meta-cognition, Ecosystem Wide Extra Controls


OpenAI ▷ #api-discussions (19 messages🔥):

PDF Visuals Prompting, ToS Splitting Technique, Honesty Training, Meta-Cognition for Hallucination Control, Agent-Controlled Meta-Cognition Workflow


OpenRouter ▷ #announcements (1 messages):

MiniMax M2.1, OpenRouter, Interleaved Thinking Model, Reasoning Details


OpenRouter ▷ #app-showcase (16 messages🔥):

Latex Rendering, Code Highlighting, Waifurewolf, Kimi models, Free Trial


OpenRouter ▷ #general (63 messages🔥🔥):

OpenRouter for consumer use vs. OpenWRT, Gemini 3 Flash Preview 400 Errors, RooCode & Gemini Reasoning, OpenRouter Coin, Video model capabilities


OpenRouter ▷ #new-models (3 messages):

``


OpenRouter ▷ #discussion (38 messages🔥):

Benchmarking Claude Code, OpenBench Evals, Agent vs Raw LLM Benchmarking, Consensus Agents


HuggingFace ▷ #general (77 messages🔥🔥):

API Agent Selection, Reduce RAM Usage, Reverse Texture Generation, VLM for Image Description, Qwen Models for Node Graph Creation


HuggingFace ▷ #i-made-this (4 messages):

Embedding Tooling, GapTrack Job App, Amoeba Butterfly System


HuggingFace ▷ #core-announcements (1 messages):

2026 plans, Community support, Hugging Face Thanks Supporters


HuggingFace ▷ #agents-course (7 messages):

Reinforcement Learning Panel, HF Jobs replacement


GPU MODE ▷ #general (7 messages):

AI Systems Performance Engineering Book, Finding the right engineers, PMPP Relevance to Inference Kernels, Tensor Cores and Kernel Fusion, Mixed-Precision Math resources


GPU MODE ▷ #triton-gluon (1 messages):

cuTile Triton Adapter, LLVM Fork Necessity, Triton Roadmap, cuTile Hints


GPU MODE ▷ #cuda (10 messages🔥):

st.async, PTX 8.7, st.async.shared::cluster, PTX 8.1/sm_90, st.global vs st.async.global


GPU MODE ▷ #pmpp-book (1 messages):

mannythecreator: Currently reading Parallel Histogram


GPU MODE ▷ #off-topic (6 messages):

Vietnamese Noodles, C and C++ History, Beef bone broth names


GPU MODE ▷ #submissions (6 messages):

NVIDIA Leaderboard, nvfp4_dual_gemm leaderboard results


GPU MODE ▷ #cutlass (1 messages):

Cache Policy, Cute.Jit, TMA Copy, CacheEvictionPriority


GPU MODE ▷ #low-bit-training (1 messages):

kitsu5116: https://arxiv.org/abs/2511.05811


GPU MODE ▷ #helion (2 messages):

Helion, LFBO Pattern Search


GPU MODE ▷ #nvidia-competition (46 messages🔥):

FP16 issues, Negative scale clamping, Quickstart for contests, Cutedsl tmem allocator, Blackwell Pipelining


GPU MODE ▷ #career-advice (1 messages):

LLVM, MLIR, CUDA compilers, Mojo


Latent Space ▷ #ai-general-chat (15 messages🔥):

OpenAI frontierscience Dataset, Ivan Zhao Article, Google DeepMind 2026 Initiative


Latent Space ▷ #genmedia-creative-ai (23 messages🔥):

EgoX code release, AI Content Virality, Alibaba Qwen-Image-Edit-2511, AI Christmas Cartoon, Neo-Noir Cinematic Comic Style


Nous Research AI ▷ #general (15 messages🔥):

AI Model Commoditization, Qualitative Research Title Defense, AI Companion Risks, New Dataset Release


Nous Research AI ▷ #research-papers (4 messages):

Prompt context issues, Synthetic dataset training costs


Nous Research AI ▷ #research-papers (4 messages):

Prompt context limits, Synthetic dataset for model training, Cost of model training


Modular (Mojo 🔥) ▷ #general (2 messages):

Discord bug with Kapa AI, Mojo GPU Puzzles Channel


Modular (Mojo 🔥) ▷ #mojo (17 messages🔥):

package memory and UnsafePointer, implicit imports, safe behaviour of the language aka opt-in, multiple preludes, distributed database in Mojo


Eleuther ▷ #general (2 messages):

Open Source Project Feedback, Empirical Observation Projects, Performance Validation


Eleuther ▷ #research (9 messages🔥):

Non-TACL NLP Journals, Computational Linguistics Journal, TACL page limits, System Instruction Prompt tests


Eleuther ▷ #interpretability-general (8 messages🔥):

In-context learning, Fine-tuning, Interventions


Moonshot AI (Kimi K-2) ▷ #general-chat (17 messages🔥):

Kimi Glaring Issues, Gemini vs Kimi, Minimax 2.1 as digital employee


Manus.im Discord ▷ #general (10 messages🔥):

Manus Promo Code, Open Sourcing Manus, Freelance Engineer Intro, Overbilling Issue


DSPy ▷ #show-and-tell (3 messages):

GEPA runs, ML, AI, validation sets, DSPy


DSPy ▷ #general (5 messages):

DSPy Contributions, Anthropic Skills Optimization


tinygrad (George Hotz) ▷ #learn-tinygrad (7 messages):

Transformer Implementations, Attention Autograd Shapes, Causal Masking, tinygrad.gradient


Yannick Kilcher ▷ #general (1 messages):

AI/ML Learning, Collaboration Opportunities