Frozen AI News archive

Not much happened today

**Sakana AI** released **Reinforcement-Learned Teachers (RLTs)**, a novel technique using smaller 7B parameter models trained via reinforcement learning to teach reasoning through step-by-step explanations, accelerating **Chain-of-Thought** learning. **Mistral AI** updated **Mistral Small 3.2** improving instruction following and function calling with experimental FP8 quantization. **Google Magenta RealTime**, an 800M parameter open-weights model for real-time music generation, was released. **Arcee AI** launched **AFM-4.5B**, a sub-10B parameter foundation model extended from **Llama 3**. **OpenThinker3-7B** was introduced as a new state-of-the-art 7B reasoning model with a 33% improvement over **DeepSeek-R1-Distill-Qwen-7B**. The **STORM** text-video model compresses video input by 8x using **Mamba layers** and outperforms **GPT-4o** on MVBench with 70.6%. Discussions on reinforcement learning algorithms PPO vs. GRPO and insights on **DINOv2**'s performance on ImageNet-1k were also highlighted. *"A very quiet day"* in AI news with valuable workshops from **OpenAI**, **Amazon**, and **GDM**.

Canonical issue URL

a very quiet day.

AI News for 6/20/2025-6/23/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 12500 messages) for you. Estimated reading time saved (at 200wpm): 1080 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A good day to browse the new AIE videos rolled out this weekend, including:

What a good time to catch up!


AI Twitter Recap

Model & Technique Development

AI Agents & Tooling

Industry, Companies & Geopolitics

AI Safety & Research Philosophy

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

no localLlama posts met our bar today!

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AI Model and Agent Benchmarks and Releases

2. AI, Automation, and the Changing Nature of Work

3. Robotic and AI Mishaps in Healthcare Memes


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. AI Model Performance and Evaluation

Theme 2. AI Hardware and Low-Level Optimization

Theme 3. AI Tooling and Development Experience

Theme 4. Agents and Orchestration

Theme 5. Model Development and Research Techniques


Discord: High level Discord summaries

OpenAI Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LMArena Discord


HuggingFace Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Nous Research AI Discord


Eleuther Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Latent Space Discord


Notebook LM Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


Torchtune Discord


MCP (Glama) Discord


Manus.im Discord Discord


LlamaIndex Discord


Cohere Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #ai-discussions (1018 messages🔥🔥🔥):

AI consciousness, LLM prompting, Custom GPT action code, o3 limits


OpenAI ▷ #gpt-4-discussions (15 messages🔥):

Expired file warnings, Text-to-voice speed control, Model Dumb-Down Conspiracy, Training GPTs with books


OpenAI ▷ #prompt-engineering (11 messages🔥):

ChatGPT-4o error debugging, Mimicking Deep Research report, PDF generation failures in ChatGPT


OpenAI ▷ #api-discussions (11 messages🔥):

ChatGPT-4o errors, error directive, hallucinations, Deep Research report format, PDF generation


Perplexity AI ▷ #general (1142 messages🔥🔥🔥):

ChatGPTs inability to maintain context in certain conditions, Gemini's awareness of its own hallucinations, Comparison of Kimi and Perplexity Labs, Samsung Galaxy's free Perplexity Pro promotion, AskPerplexity bot on X (formerly Twitter) not replying to users


Perplexity AI ▷ #sharing (6 messages):

earthquake, cross-origin-context-poisoning, US enters Iran war, MCP model context protocol sec, quantum tele


Perplexity AI ▷ #pplx-api (2 messages):

PPLX Devs Availability, API Support Inquiry


Unsloth AI (Daniel Han) ▷ #general (1006 messages🔥🔥🔥):

Gemma 3, Blackwell, Runpod MI300X, Deepseek Tool Calling


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

AI Companies Sponsor Digitization, Essential Web Data Size, Text-to-Music Proprietary Challenge, QAT Finetuning Library


Unsloth AI (Daniel Han) ▷ #help (183 messages🔥🔥):

Multigpu Support, TRL downgrade, Gemma 3 fix, Qwen3 notebook broken, llama3.2 empty output


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

ADV_AGI_FRAME on Hugging Face, Homeless developer shares link


Unsloth AI (Daniel Han) ▷ #research (11 messages🔥):

Vibe Coding Study, Gemini API Reward Functions, GRPO Reward Model Training, BNPO vs Dr.GRPO


Cursor Community ▷ #general (1007 messages🔥🔥🔥):

New Cursor Pricing, Rate Limits, Gemini vs Sonnet, MCP Tools, Background Agents


Cursor Community ▷ #background-agents (60 messages🔥🔥):

Background Agent Environment Setup, Docker Configuration for Background Agents, Background Agents and Secrets Management, Slack Integration with Background Agents, Background Agents API


LMArena ▷ #general (949 messages🔥🔥🔥):

Gemini vs O3, Grok 3.5, Stonebloom, Model Performance Evaluation, LLM AUPs


LMArena ▷ #announcements (1 messages):

AI Generation Contest, Cozy Desk Theme


HuggingFace ▷ #general (402 messages🔥🔥):

Flamesong model, Password Issues, SFT vs RLHF tuning, Running models with AMD cards, Finding non safety tuned models


HuggingFace ▷ #today-im-learning (1 messages):

devanshukoli: i'm entering the Mcp Course by hugging face.


HuggingFace ▷ #cool-finds (1 messages):

@techhjork:

technosourceressextraordinaire: bills messy like my dad


HuggingFace ▷ #i-made-this (43 messages🔥):

Proto-consciousness field, GridDB for IoT sensory data, Postgresql MCP server, Lunaris Codex, Biomimicry in AI


HuggingFace ▷ #reading-group (3 messages):

Reading Group Cadence, GNNs/Spectral Graph Theory Literature Review


HuggingFace ▷ #computer-vision (6 messages):

Midjourney Video Model, African Image Datasets, JAX Models, Optimum DETR


HuggingFace ▷ #NLP (5 messages):

Docker crash, Sentence Transformers, Input Embeddings


HuggingFace ▷ #gradio-announcements (2 messages):

LlamaIndex Choice Award, NASA Space Explorer Agent, Mistral AI Choice Award, OpenSorus Project, Hackathon Support


HuggingFace ▷ #smol-course (2 messages):

Ollama llama3.2, smol course


HuggingFace ▷ #agents-course (25 messages🔥):

Error 500 with OpenAIServerModel and TinyLlama, Smolagents Docstring Parsing Exception, Hugging Face Discord Access, Agent AI Learning Paths, Submitting Use-Case Work


LM Studio ▷ #general (219 messages🔥🔥):

LM Studio pull request, LM Studio default persona settings, Download model from huggingface, LM Studio Hardware tab & system requirements, LM Studio Qwen3 threads usage


LM Studio ▷ #hardware-discussion (180 messages🔥🔥):

Quantization impact on token generation speed, AMD Ryzen AI Max 395 vs 70b+ models, DDR5 RAM limitations with Intel 12th gen CPUs, 5090 vs 4090 price comparison


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Gemini 2.5 Pro, API Migration, Breaking Changes


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Mnemix app launch, AwesomeMCPs app launch


OpenRouter (Alex Atallah) ▷ #general (372 messages🔥🔥):

Deepseek R1T Chimera Disappearance, Deepinfra B200 Promo, Azure vs OVH Cost Comparison, OpenAI's Confusing Model Naming Strategy, Cohere Moderation Changes


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


Nous Research AI ▷ #general (172 messages🔥🔥):

Entropy and LLMs, Nano vLLM by DeepSeek, Effective response length of reasoning models, Humanizing AI agents


Nous Research AI ▷ #ask-about-llms (24 messages🔥):

LLM training with negative information, Nous API token count, Function calling implementation across models, Best RP model, Wallet connection


Nous Research AI ▷ #interesting-links (6 messages):

MCP System, Think Tank, Mesh Sharing, Data Tagging, Reward Models and Bias


Eleuther ▷ #general (62 messages🔥🔥):

Reputation in Discord, Joining OWL, Public Problem List, Language Diffusion Models, Prefix Caching


Eleuther ▷ #research (25 messages🔥):

Bottleneck Dimension Experiments, Token Pruning/Dropping Methods, Spectral Clipping, Imitation Learning in Racing Games, EAI Summer of Open AI Research


Eleuther ▷ #interpretability-general (4 messages):

k-shot steering vectors, ACL paper on feature interaction, EAI Summer of Open AI Research, NNsight pre-release


Eleuther ▷ #lm-thunderdome (78 messages🔥🔥):

HFLM model access for hooks, Log Likelihood Numbers, Llama3 GSM8k Reproduction, Lambada Target Token Issue


GPU MODE ▷ #general (17 messages🔥):

PSU and GPU Power, CUDA Server, GPU purchase 5070 vs 7800xt, Neutrino: Fine-grained GPU Kernel Profiling, Code Readability & Const Variables


GPU MODE ▷ #triton (3 messages):

Triton AOT Compile, Triton type hints


GPU MODE ▷ #cuda (19 messages🔥):

Nsight for CLion, memcpy_async details, control divergence, GFLOPS calculation, Nsight compute


GPU MODE ▷ #torch (8 messages🔥):

PyTorch gradient calculation for torch.clip, Quantization aware training for embedded systems, Capturing collective communication graphs in torchtitan, SimpleFSDP implementation in titan, Custom graph passes in inductor


GPU MODE ▷ #algorithms (4 messages):

Parallel Algorithms, Matrix Operations, Sorting Algorithms


GPU MODE ▷ #beginner (8 messages🔥):

CUDA Illegal Memory Access, Triton vs CUDA Learning Resources, SYCL Information


GPU MODE ▷ #rocm (19 messages🔥):

mi300x profiling, chisel-cli, rocprof integration, rocprofiler-sdk, nsight-compute


GPU MODE ▷ #intel (1 messages):

tri_nitr0_t0luene: where do I find the documentation on how to Code oneAPI SYCL for GPU?


GPU MODE ▷ #self-promotion (5 messages):

Fibonacci GPU Calculation, NVIDIA Thrust Library, MI300X Profiling Tool Chisel, CuTeDSL Introduction, NVIDIA CUTLASS Team


GPU MODE ▷ #🍿 (4 messages):

OSS datasets, Pytorch, Triton, KernelBot


GPU MODE ▷ #reasoning-gym (5 messages):

VRAM Requirements, KL Loss, FP32 Training, A6000 GPUs, Reasoning Gym


GPU MODE ▷ #gpu模式 (3 messages):

Chinese speakers in the channel, Multilingual AI research community, GPU Mode


GPU MODE ▷ #submissions (23 messages🔥):

MI300, H100, amd-fp8-mm Leaderboard, Grayscale Leaderboard, Histogram Leaderboard


GPU MODE ▷ #tpu (2 messages):

TPU Interaction, XLA Compiler, Pallas, StableHLO


GPU MODE ▷ #factorio-learning-env (22 messages🔥):

Self-Generating Tasks, Auto Verifiers, Factorio Source Code, Factory Bug Fixes


GPU MODE ▷ #cutlass (2 messages):

CuTeDSL PTX and sass code Emission, Cutlass Future Releases


aider (Paul Gauthier) ▷ #general (88 messages🔥🔥):

Benchmarking minimax/minimax-r1, Claude Code Sonnet Limits, Aider Context Management, Mcpm Aider tool, Gemini Code rewriting


aider (Paul Gauthier) ▷ #questions-and-tips (21 messages🔥):

aider skipping edits, Aider Interaction Guidelines, Claude 4 Sonnet not following CONVENTIONS.md, Loading custom typescript library, Recovering an /undo command


aider (Paul Gauthier) ▷ #links (9 messages🔥):

Claude Code API, Anthropic Subsidization, Terms of Service


tinygrad (George Hotz) ▷ #general (67 messages🔥🔥):

Tinygrad backward time, AMD GPU instability, IO_uring ZCRX DMA-BUF, tinygrad server, NVMe driver in userspace


tinygrad (George Hotz) ▷ #learn-tinygrad (38 messages🔥):

Tinygrad Async Data Transfer, RNN Performance in Tinygrad, LSTM performance, Unit Tests Wishlist, Device Availability Check Failing


Latent Space ▷ #ai-general-chat (95 messages🔥🔥):

MCP servers, Scarlet AI rewrite, Google Workspace automation, AI timeline, ElevenLabs 11ai


Notebook LM ▷ #use-cases (12 messages🔥):

GestaltView Ecosystem, NotebookLM as a Strategic Partner, Podcast Language Expansion, Solicitation Guidelines


Notebook LM ▷ #general (76 messages🔥🔥):

AI Engineering Study Tips with NotebookLM, NotebookLM vs Gemini, Audio Overview Limits, Image analysis, Gemini Model selection


Yannick Kilcher ▷ #general (23 messages🔥):

NLP kickstart with small LLMs, Anti-drone detection with YOLO, Stanford AI resource, Verifying reasoning traces, VLM research


Yannick Kilcher ▷ #paper-discussion (16 messages🔥):

Reading Group Info, RWKV-7 Goose, Mathematical Finance Papers


Yannick Kilcher ▷ #ml-news (40 messages🔥):

Agent2Agent Protocol, Vision Language Models, Computational Chemistry with Deep Learning, AI and its impact on learning, Genetic Engineering vs Automation


Modular (Mojo 🔥) ▷ #general (7 messages):

Latent Space Interview, AMD Support Announcement, End-to-End Rust Replacement, Hack Weekend Event, Self-Promotion Rule Violation


Modular (Mojo 🔥) ▷ #mojo (37 messages🔥):

Int vs int, Typed raises, Autodiff engine, Memory errors, Optional Tensor


Torchtune ▷ #dev (23 messages🔥):

Torchtune Transformers Alignment, Dataset Packing OOM Errors, Pre-Tokenized Packed Datasets, On-the-Fly Packing RFC, AdamW ScheduleFree


Torchtune ▷ #papers (13 messages🔥):

Optimized Newton-Schulz Kernel, Triton Matmul Tutorial, Muon Merges, Deepseek v3


MCP (Glama) ▷ #general (27 messages🔥):

MCP for semantic search, MCP for image creation and OCR, DestructiveHint ambiguity, Neo4j MCP outside Claude Desktop, List_tags tool implementation


MCP (Glama) ▷ #showcase (9 messages🔥):

MCP Validator Release, Glama Automations, AwesomeMCPs iOS App, mcp-server-webcrawl, Ilograph MCP Server


Manus.im Discord ▷ #general (28 messages🔥):

Manus credit usage, Manus video generation, Cloud Browser and Twitter, Manus and Stock Suggestions, Promotion of Manus


LlamaIndex ▷ #blog (1 messages):

Agents & MCP Hackathon, LlamaIndex


LlamaIndex ▷ #general (13 messages🔥):

Query Pipelines Deprecation, EU Region Latency Issues, LlamaIndex Free Features, Prompt Management Tools


Cohere ▷ #👋-introduce-yourself (6 messages):

ML Cybersecurity Integration, Model Compression, Deep Fake Detection, Adversarial ML


DSPy ▷ #show-and-tell (4 messages):

MCP-DSPy tool in VS Code, HF MCP tutorial, Context failures and fixes, @mcp.tool decorators


DSPy ▷ #general (1 messages):

bernhard_123: Hi. Are there any plans to migrate DSPy to other languages, e.g. Dart beside python ?


Nomic.ai (GPT4All) ▷ #general (5 messages):

GPT4All Build issues in WSL2, Qt Version Compatibility, GPT4All out of date


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

Course Certificates, Social Media Posts