Frozen AI News archive

not much happened today

**China's Xiaohongshu (Rednote) released dots.llm1**, a **142B parameter open-source Mixture-of-Experts (MoE) language model** with **14B active parameters** and a **32K context window**, pretrained on **11.2 trillion high-quality, non-synthetic tokens**. The model supports efficient inference frameworks like Docker, HuggingFace, and vLLM, and provides intermediate checkpoints every 1 trillion tokens, enabling flexible fine-tuning. Benchmarking claims it slightly surpasses **Qwen3 235B** on MMLU, though some concerns exist about benchmark selection and synthetic data verification. The release is notable for its truly open-source licensing and no synthetic data usage, sparking community optimism for support in frameworks such as llama.cpp and mlx.

Canonical issue URL

a quiet day

AI News for 6/5/2025-6/6/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 7362 messages) for you. Estimated reading time saved (at 200wpm): 647 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

a quiet day. The MechInterp pod with Anthropic is worthwhile:

https://www.youtube.com/watch?v=9YQW2mH9FyA


AI Twitter Recap

pipeline down again!


AI Reddit Recap

/r/LocalLlama Recap

1. Rednote dots.llm Model Launch and Performance Benchmarks

2. Recent Efficient Edge and Open LLM Releases (OpenThinker3 & MiniCPM4)

3. On-device AI Application Showcases

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Gemini 2.5 Pro and Other Model Benchmark Results

2. Autonomous Delivery Robots and Figure's Innovations

3. OpenAI & Claude Model Privacy and Community Complaints


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mayhem: Gemini's Rollercoaster, Qwen's Ascent, and Claude's Expansion

Theme 2: Data Deluge: EleutherAI's Common Pile Sets New Open Standard

Theme 3: Dev Tool Drama: Cursor's $10B Boom & Bust, MCP's Many Faces, Unsloth's Trending Tricks

Theme 4: Silicon Sizzlers & Kernel Conundrums: ROCm on Windows, Tinygrad's Tussles

Theme 5: Trust Traps & Truth Trials: Deepfakes Deceive, Benchmarks Baffle, Vixra Vanquished


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Eleuther Discord


Cursor Community Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


GPU MODE Discord


HuggingFace Discord


aider (Paul Gauthier) Discord


LM Studio Discord


Modular (Mojo 🔥) Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


MCP (Glama) Discord


Latent Space Discord


Torchtune Discord


Yannick Kilcher Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1263 messages🔥🔥🔥):

Perplexity AI Limits, Android vs iPhone, Comet browser release date, AI Model Ranking, Scheduled actions in Gemini


Perplexity AI ▷ #sharing (4 messages):

ArtisticAI, Michael Tait, Pakistan's diplomacy, Dark Tetrad, Cruelty


Perplexity AI ▷ #pplx-api (21 messages🔥):

Sonar deep research upgrades, Academic mode on all models, Richer Citations, Camera integration to voice chat, Formal reasoning for coding


LMArena ▷ #general (1282 messages🔥🔥🔥):

Model Generation, Google's Gemini 2.5 Pro, Mistral Le Chat API, Kingfall model, Livebench concerns


LMArena ▷ #announcements (1 messages):

LMArena Test Garden, Early Access Feedback Program


Eleuther ▷ #announcements (1 messages):

Common Pile v0.1, Openly Licensed LLMs, Comma v0.1-1T, Comma v0.1-2T, Ethical language model ecosystem


Eleuther ▷ #general (648 messages🔥🔥🔥):

LLMs trained in-context by inexpert humans, LLM Memory and Abuse, Synthetic Data for LLM Training, Common Pile dataset, Sycophancy in LLMs


Eleuther ▷ #research (45 messages🔥):

Attention Pre-Transformer, Schmidhuber's linear attention, vixra vs arxiv, Evolving LLMs Through Text-Based Self-Play, Point Cloud Completion


Eleuther ▷ #scaling-laws (1 messages):

Funding for Non-LLM AI


Eleuther ▷ #interpretability-general (1 messages):

MPL Weights, Vocabulary Embedding Space, Project Visualization


Eleuther ▷ #lm-thunderdome (2 messages):

Answer Extraction, Reasoning Models, lm_eval, Output Preferences, LLM as Judge


Eleuther ▷ #gpt-neox-dev (5 messages):

Rotary Percentage Configuration, Per-Layer Attention Specification


Cursor Community ▷ #general (339 messages🔥🔥):

Gemini 06-05, Cursor tools issues, Model Merging, Cursor's documentation, Gemini Model Update


Cursor Community ▷ #background-agents (16 messages🔥):

Cursor Github Connection Issues, Background Agent Default Environment Creation, Background Agent Configuration, Background Agent Hosting Options, Background Agents same cursor rules?


OpenAI ▷ #ai-discussions (236 messages🔥🔥):

Gemini vs ChatGPT, Gemini 2.5 pro, O3 Issues and hallucinations, Veo 3 Limitations, ARC-AGI-1 vs ARC-AGI-2


OpenAI ▷ #gpt-4-discussions (1 messages):

Choosing the correct forum for AI/OpenAI questions, Software Developer Seeks Proper Channel for AI/OpenAI Expertise


OpenAI ▷ #prompt-engineering (45 messages🔥):

Y-Combinator podcast Prompting evaluation, Meta Prompting, Character and Audio consistency using Veo3, Tracking Prompt versions, ChatGPT memory capacity with PDFs


OpenAI ▷ #api-discussions (45 messages🔥):

Prompt engineering and evaluation mechanisms, Meta prompting, ChatGPT memory capacity and PDF usage, Sora prompt censorship bypass, File format preferences


Unsloth AI (Daniel Han) ▷ #general (187 messages🔥🔥):

TTS Benchmarks, Qwen3 Releases, Chrome autofill issues, Unsloth Notebooks trending, Licensed data for language modeling


Unsloth AI (Daniel Han) ▷ #off-topic (3 messages):

Speeding up LLM inference, Triton, Optimized kernels for sparse/quantized LLMs, Triton contribution, Android issue diagnosis


Unsloth AI (Daniel Han) ▷ #help (126 messages🔥🔥):

VS Code, Github Copilot Autocomplete, Unsloth Local Fine Tuning, Validation Dataset Issues, Qwen2.5-VL-3B Nan Loss


GPU MODE ▷ #general (2 messages):

Hopper GPU, ThunderKittens


GPU MODE ▷ #triton (5 messages):

Megakernel in Triton, Full-model kernel, Memory transfer bottlenecks, Triton vs CUDA Kernel Performance


GPU MODE ▷ #cuda (10 messages🔥):

GMEM Coalescing, L1 Caching, CUDA and Memory Optimization, Atomics in DL Kernels, GPU Physics


GPU MODE ▷ #torch (9 messages🔥):

torch.compile vs aitemplate, AOTInductor, tlparse graph, custom_op function, MoE expert routing in torch.compile


GPU MODE ▷ #cool-links (1 messages):

real.optimus.prime: https://scalingintelligence.stanford.edu/blogs/tokasaurus/


GPU MODE ▷ #jobs (1 messages):

SafeAD careers, CV roles, ML roles


GPU MODE ▷ #beginner (7 messages):

GPU access costs, Torch benchmarking, CUDA timing


GPU MODE ▷ #jax (1 messages):

blueredblue: How does ffi_call work with pmap, will one kernel get launched per device?


GPU MODE ▷ #irl-meetup (1 messages):

GTC Paris, CUDA C++ Workshop, Connect With the Experts


GPU MODE ▷ #rocm (2 messages):

pytorch+ROCm on Windows, Radeon GPUs, TheRock, strix halo, gfx1151


GPU MODE ▷ #liger-kernel (1 messages):

as_ai: Will take a look, thanks for sharing!


GPU MODE ▷ #self-promotion (4 messages):

Fluxions Open Source Model, Job Inquiry, Efficient Matrix Transpose, GPU-heavy tech


GPU MODE ▷ #🍿 (101 messages🔥🔥):

Triton Kernel Generation, Scalable Environments, Synthetic Data, Kernel Optimization, Task Diversification


GPU MODE ▷ #thunderkittens (2 messages):

AMD Porting, Matmul Kernel


GPU MODE ▷ #submissions (15 messages🔥):

AMD MI300 performance, H100 grayscale benchmark, T4 prefixsum, AMD FP8 MM, A100 vectoradd


GPU MODE ▷ #tpu (2 messages):

SparseCores, TPUs, Transformer Training, Transformer Inference, Nvidia TensorCore


GPU MODE ▷ #factorio-learning-env (9 messages🔥):

FLE API, Hugging Face Agents course


GPU MODE ▷ #amd-competition (17 messages🔥):

AMD FP8, H100 Submission, Backward Pass, Solution Write-Ups


GPU MODE ▷ #cutlass (7 messages):

Cutlass Turing TensorOp GEMM Example, CuTe Layout Interpretation, Visualizing CuTe Physical Layouts


HuggingFace ▷ #general (146 messages🔥🔥):

Network Bandwidth for 30 Machines, LLM access to terminal/browser, DDR5 RAM limitations on AMD Zen 5, Hugging Face Hub Outage, OCR models


HuggingFace ▷ #today-im-learning (2 messages):

Fraud Detection in Finance, Resources for Learning Fraud Detection


HuggingFace ▷ #cool-finds (1 messages):

0xcc6434: Morning


HuggingFace ▷ #i-made-this (6 messages):

ConvNeXt-Tiny model, audio deepfake detection, Gradio application, PDF parser, DeepFake detection company


HuggingFace ▷ #computer-vision (1 messages):

Hugging Face Computer Vision Hangout, Pruna AI, Image Generation Speed


HuggingFace ▷ #gradio-announcements (1 messages):

Hackathon Extension, Builder Community, Prize Pool


HuggingFace ▷ #agents-course (12 messages🔥):

Gemini Frameworks, Smolagents with Gemini, Monthly Certifications, GPT-4o Parsing Errors


aider (Paul Gauthier) ▷ #general (146 messages🔥🔥):

Gemini 2.5 Pro Evaluation, Kingfall benchmark results, Opus vs Gemini, Context handling in models


aider (Paul Gauthier) ▷ #questions-and-tips (12 messages🔥):

aider vs cursor, gemini stt, superwhisper, vllm server with aider, cpp/rust for embedded work


LM Studio ▷ #general (76 messages🔥🔥):

App Based Spam, Gemini Rate Limits, Open Thinker Model vs Qwen3-4B, LM Studio and OpenAI API, LM Studio RAG Embedding Model


LM Studio ▷ #hardware-discussion (82 messages🔥🔥):

LM Studio Ryzen AI NPU, Qwen3 Speed, Strix Halo Benchmarks, Llama 3.3 70B performance, Model Quantization


Modular (Mojo 🔥) ▷ #general (2 messages):

Modular, magic, pixi, Mojo upgrade, memory alignment


Modular (Mojo 🔥) ▷ #mojo (144 messages🔥🔥):

Mojo in Bioinformatics, immutable variables, terse syntax, LLMs / prompts for writing Mojo code, Intel Mac build


Nous Research AI ▷ #general (129 messages🔥🔥):

Tool Integrated Reasoning, Atropos environments for tool calling, LLM Data Copyright Issues, AllenAI's OLMo Models and Reproducibility, Training LLMs from Scratch


Nous Research AI ▷ #ask-about-llms (5 messages):

Voigt-Kampff test, Obsidian, XQuartz, Docker, Hermes


Nous Research AI ▷ #interesting-links (1 messages):

wandabells: https://www.deeplearning.ai/courses/


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

OpenRouter, RSS Feed for models, API models


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

insight_cheats: For the gooners - https://personality.gg


OpenRouter (Alex Atallah) ▷ #general (130 messages🔥🔥):

Gemini 2.5 Pro regression, Claude Max vs Gemini pricing, OpenAI logging practices, Gemini 2.5 flash lite, GPT-4.1 mini is good for many routine task


MCP (Glama) ▷ #general (76 messages🔥🔥):

MCP Inspector Fork, MCP Server on Cloudflare Workers, MCP Use Cases, MCP Clients, Real-time Codebase Indexing


MCP (Glama) ▷ #showcase (2 messages):

inked github, Slack MCP server


Latent Space ▷ #ai-general-chat (56 messages🔥🔥):

Claude Projects Content Increase, Qwen3-Embedding and Qwen3-Reranker Series, Netlify DB Serverless Postgres, Zapier AI Fluency Measurement, Cursor Funding Round


Torchtune ▷ #dev (47 messages🔥):

HFModelTokenizer, Axolotl loss curves, Reward Modeling RFC, Fused Optimizer Issues


Yannick Kilcher ▷ #general (33 messages🔥):

Training LLMs, Datasets for LLMs, Clustering emails with RAG, Meta's OPT-175B logbook, GPT sycophancy


Yannick Kilcher ▷ #paper-discussion (2 messages):

Vec2Vec Code Review, Translators/Transformers directory, Background review of implementation


Yannick Kilcher ▷ #ml-news (7 messages):

Qwen3 Embedding, Nemotron-H Reasoning Model, EleutherAI and Public Data LLMs, Cohere's Business Model, RAG Marketing


tinygrad (George Hotz) ▷ #general (1 messages):

LLVM, loop splitting, ROCm, InductiveRangeCheckElimination


tinygrad (George Hotz) ▷ #learn-tinygrad (23 messages🔥):

tinygrad kernel optimization, hlb_cifar10 data shuffling, OpenCL kernel performance, GPU indexing kernels


Manus.im Discord ▷ #general (19 messages🔥):

Video Function, Credit Costs, Manish Model Update, Manus partnership with Claude, Egyptian users


LlamaIndex ▷ #blog (3 messages):

AI Agents, MCP vs A2A, Vector Databases


LlamaIndex ▷ #general (9 messages🔥):

files_via_content mode, AgentWorkflow orchestration, Multi-Agent setup


LlamaIndex ▷ #ai-discussion (1 messages):

SchemaLLMPathExtractor, Graph database population


Nomic.ai (GPT4All) ▷ #general (5 messages):

VPS for API server, RAM Pricing, Mistral MOE, Deepseek MOE, Chinese CPU vendor


DSPy ▷ #general (3 messages):

Session Thanks, Blockchain Engineer Introduction, AI Agent Engineer Introduction


Cohere ▷ #💬-general (1 messages):

stormortiz: here is an magic place


Cohere ▷ #🤝-introductions (2 messages):

Introductions, ML Audio Engineer


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

radhakrishnan_20251: thanks for the update


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

MCP Tools Authorization, Enterprise OAuth