Frozen AI News archive

not much happened today

**Tencent's Hunyuan-Turbos** has risen to #8 on the LMArena leaderboard, showing strong performance across major categories and significant improvement since February. The **Qwen3 model family**, especially the **Qwen3 235B-A22B (Reasoning)** model, is noted for its intelligence and efficient parameter usage. **OpenAI** introduced **HealthBench**, a new health evaluation benchmark developed with input from over **250 physicians**, where models like **o3**, **GPT-4.1 nano**, and **Grok 3** showed strong results. **ByteDance** released **Seed1.5-VL**, a vision-language model with a 532M-parameter vision encoder and a 20B active parameter MoE LLM, achieving state-of-the-art results on 38 public benchmarks. In vision-language, **Kling 2.0** leads image-to-video generation, and **Gemini 2.5 Pro** excels in video understanding with advanced multimodal capabilities. Meta's Vision-Language-Action framework and updates on VLMs for 2025 were also highlighted.

Canonical issue URL

a quiet day.

AI News for 5/12/2025-5/13/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 4553 messages) for you. Estimated reading time saved (at 200wpm): 445 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Gergely Orosz has a worthwhile read on the ChatGPT Images launch, which Simon Willison has excerpted. The WizardLM team left MSR China to join Tencent and coincidentally launched Tencent Hunyuan-Turbos, a closed model but now the top ranked Chinese model on LMArena.

There are 20 full-conference Early Bird tickets left for AI Engineer World's Fair, now T-minus 3 weeks to go, which has continued to firm up the speaker, workshop, and event list.


AI Twitter Recap

Language Models and Benchmarks

Vision Language Models

AI Engineering and Tooling

Model Release and Performance

HuggingFace and Inference

Career and Industry Trends

Meme/Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Qwen3 Model Release and Technical Details

2. Trends and Architecture in New MoE Models

3. Experimental LLM Use Cases and Demos

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Claude Code Recent Updates and User Experiences

2. HealthBench, AI Advances, and OpenAI Model Milestones

3. Workplace Transitions to AI Art and Stable Diffusion Hardware Builds


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Cutting-Edge Models and Performance Showdowns

Theme 2: Enhancing LLM Interactions and Local Deployment

Theme 3: GPU Programming and Acceleration Advances

Theme 4: Platform Quirks, API Changes, and User Experience Hiccups

Theme 5: AI Community Buzz: From Governance to Groundbreaking Tools


Discord: High level Discord summaries

Perplexity AI Discord


LM Studio Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Yannick Kilcher Discord


LMArena Discord


OpenAI Discord


GPU MODE Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Eleuther Discord


HuggingFace Discord


Torchtune Discord


Notebook LM Discord


MCP (Glama) Discord


Latent Space Discord


DSPy Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Cohere Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1119 messages🔥🔥🔥):

Deep Research, MerlinAI, AI Studio, Sonar


Perplexity AI â–· #sharing (1 messages):

meijer5838: https://www.perplexity.ai/page/token-minimization-for-sustain-1Cbiopx3T3C5SWyrYTVvdw


Perplexity AI ▷ #pplx-api (9 messages🔥):

Polling for results, Sonar Pro vs Claude 3.5 Sonnet, API Access for Pro Users, Payment Plan for API Access


LM Studio ▷ #general (232 messages🔥🔥):

Connecting local LLMs to Cursor AI, CUDA support on Linux, Qwen3 Model, LM Studio API model influencing, GGUF quants


LM Studio ▷ #hardware-discussion (334 messages🔥🔥):

Intel ARC support in LM Studio, GPU/RAM usage monitoring on macOS, Netdata for Linux monitoring, RTX 5060 Ti benchmarks, ROCm vs. Vulkan


Cursor Community ▷ #general (434 messages🔥🔥🔥):

Cursor 0.50 Update Issues, Cursor API Key Exposure, Token count display within chats, Claude code guides, Background agents rollout


Unsloth AI (Daniel Han) ▷ #general (283 messages🔥🔥):

Unsloth Dynamic 2.0 GGUF quants, Llama-3.1-8B-Instruct, NousResearch DeepHermes-3, Qwen3 GRPO notebook, Base64 Image formatting


Unsloth AI (Daniel Han) ▷ #off-topic (14 messages🔥):

Kaggle Colab Upgrades, HealthBench Evaluation Benchmark, O3 Performance, GPT-4.1 Coding


Unsloth AI (Daniel Han) ▷ #help (103 messages🔥🔥):

Multiprocessing Disable, Coding LLM Assistance, vLLM vs Exl2 Batch Inference, Multi-GPU Support, Autoregressive TTS Inference


Unsloth AI (Daniel Han) â–· #research (6 messages):

Meta FAIR updates, Sakana AI, Job Postings, arXiv Papers


Yannick Kilcher ▷ #general (304 messages🔥🔥):

Turing completeness of LLMs, Treaty between humanity and AI, RL-Diffusion Model Debate, Hamiltonian Neural Networks and Transformers


Yannick Kilcher ▷ #paper-discussion (27 messages🔥):

Physics of LLMs, Grade School Math benchmarks, GSM8K, Language Models Reasoning Skills


Yannick Kilcher â–· #agents (1 messages):

Sakana, maze examples, ARC


Yannick Kilcher â–· #ml-news (3 messages):

AI Regulation Ban, Budget Reconciliation bill, State and Local Governments


LMArena ▷ #general (265 messages🔥🔥):

Deepseek V3 benchmark, o3 hallucination, Gemini 2.5, Grok 3.5, DrakeClaw


LMArena â–· #announcements (1 messages):

Discord Server Changes, Independent Scrolling Preview


OpenAI ▷ #ai-discussions (147 messages🔥🔥):

GPT-4o, Claude for Coding, AI Models for Coding, AI Industry Investment, Grok roasting


OpenAI ▷ #gpt-4-discussions (12 messages🔥):

GPT App Freezing, GPT Memory, GPT-4o


OpenAI ▷ #prompt-engineering (15 messages🔥):

Companion Mode, GPT for web app coding, Guardrails for HR data


OpenAI ▷ #api-discussions (15 messages🔥):

Companion Mode, PII guardrails, ChatGPT for coding, ChatGPT model selector


GPU MODE ▷ #general (13 messages🔥):

memory bound operations, optimizing LLM using SGLang, tensor compiler project, CUDA memory sharing


GPU MODE ▷ #cuda (38 messages🔥):

CUDA thread indexing difficulties, CUDA streams and device association, Shared memory allocation between kernels


GPU MODE â–· #torch (5 messages):

at::Tag::needs_fixed_stride_order, CUDA streams API, H200


GPU MODE â–· #jobs (1 messages):

C-Gen AI, Senior Software Engineer, GPU cluster technology


GPU MODE â–· #beginner (1 messages):

guto2750: Hello, someone can help me, please! How can i put my cute code in python to run


GPU MODE â–· #rocm (2 messages):

Memory Bandwidth Benchmarking, MI300X vs H100 vs H200, CU Driven Benchmarks


GPU MODE â–· #self-promotion (1 messages):

X post screenshot, Image analysis


GPU MODE ▷ #submissions (67 messages🔥🔥):

MI300, amd-fp8-mm leaderboard, amd-mixture-of-experts leaderboard


GPU MODE â–· #hardware (1 messages):

neonninjaastro_63946: wow thanks this was a great resource


GPU MODE â–· #factorio-learning-env (5 messages):

Factorio Environment Costs, Collaboration Structure, Genetic Algorithm Blueprint, Dynamic Path-Finding Algorithm


GPU MODE ▷ #amd-competition (20 messages🔥):

Kernel Synchronicity, Measuring Kernel Execution Time, File Upload Errors, Ranked Run Timeouts


GPU MODE ▷ #cutlass (16 messages🔥):

Cutlass, Triton, torch.compile, CuTe DSL, CUTLASS 4.0 installation


GPU MODE â–· #mojo (5 messages):

Mojo and PyTorch, Mojo as a language for writing custom ops, torch compile backend


Nous Research AI â–· #announcements (2 messages):

RL Environments Hackathon, Atropos v0.2.0 Release, Axolotl Integration


Nous Research AI ▷ #general (133 messages🔥🔥):

Stripe AI foundation model for payments, Lower top up amount, Hackathon participants, Unsloth's Dynamic 2.0 GGUF Quant, Chain of Awareness Around the World


Nous Research AI â–· #research-papers (2 messages):

Qwen3 vs Qwen2.5, Technical Report Analysis, Model Size Comparison


Nous Research AI â–· #interesting-links (1 messages):

Facebook BLT, Byte Latent Transformer


Nous Research AI â–· #research-papers (2 messages):

Qwen3 vs Qwen2.5 performance, Qwen3 Technical Report analysis, Model Size Performance, Notable Observations


OpenRouter (Alex Atallah) ▷ #general (120 messages🔥🔥):

Chat Syncing, Corvid Comradeship, Gemini API on OpenRouter, DeepSeek API on OpenRouter, OpenRouter and Embeddings


Manus.im Discord ▷ #general (120 messages🔥🔥):

Manus Pro Subscription Experience, Fact Checks, Credits Disappearing After Cancelling Membership, Phone Verification, Daily Credit Usage


aider (Paul Gauthier) ▷ #general (66 messages🔥🔥):

Aider with CPU vs GPU, Aider as MCP tool in Claude, Aider and Context Caching, Tmux and Aider Navigation, Gemini Comments in Aider


aider (Paul Gauthier) ▷ #questions-and-tips (31 messages🔥):

AiderDesk Model Choices, Gemini Rate Limiting, yes-always Configuration Bug, Lean Context Management


Eleuther ▷ #general (38 messages🔥):

AI Governance, Compliance with AI, lm-eval-harness utility, AI parent legal hurdles, diffusion model prereqs


Eleuther ▷ #research (23 messages🔥):

Fusion Model Benchmarking, Multi-Agent RL, Memory Visualization, ML Topics Scope


Eleuther â–· #interpretability-general (1 messages):

Paper Review, Interpretability Research


Eleuther â–· #lm-thunderdome (4 messages):

o3 optimization, multi-GPU lm-eval, accelerate launch


Eleuther â–· #gpt-neox-dev (4 messages):

GPT-NeoX data shuffling, Lingua library, TorchTitan, Nanotron, Code Rot


HuggingFace ▷ #general (19 messages🔥):

ComfyUI users, GPU no longer supported, Inference Provider contact, System prompt limits, ML engineers for image processing


HuggingFace â–· #today-im-learning (4 messages):

Knowledge Graphs with Agentic AI, Hugging Face GGUF models


HuggingFace ▷ #i-made-this (8 messages🔥):

Bytedance Seed Coder, LLM comparison website, libmtmd Android app, Voice AI assistant based on gemma 3, Rust chat templating


HuggingFace â–· #computer-vision (1 messages):

Three.js, .glb model, 2D image positioning, image detection, segmentation


HuggingFace â–· #smol-course (3 messages):

Software Development Basics, LLM-Assisted Coding


HuggingFace ▷ #agents-course (25 messages🔥):

Chess API and FEN strings, Llama-3.2-3B-Instruct Errors, Hugging Face Space Stuck, Final Assignment Submission, LlamaIndex Section Difficulty


Torchtune â–· #general (3 messages):

Finetuning libraries, Multi-GPU support, Unsloth, Fairseq2, Axolotl


Torchtune ▷ #dev (55 messages🔥🔥):

Llama3.1 tokenizer for 3.3 training, Kron and Muon optimizers in torchtune, HFModelTokenizer with Gemma chat template, ChatML template for Gemma


Notebook LM â–· #use-cases (4 messages):

NotebookLM audio shortcomings, Invisible Sun TTRPG, NotebookLM for gaming content


Notebook LM ▷ #general (48 messages🔥):

NotebookLM invite delays, Audio language change issues, Folder system for note organization, NotebookLM use in education, iplusinteractif textbook integration


MCP (Glama) ▷ #general (39 messages🔥):

MCP Server conversion, OpenAPI to MCP, Claude Code MCP, Postgres MCP Server Connection Issues, Streamable HTTP MCP Servers


MCP (Glama) â–· #showcase (6 messages):

MCP Integration, uniffi-rs for MCP, LLMs and Structured Inputs, magic_file MCP Tool, Local Goose Qwen3mcp Log Proxy


Latent Space ▷ #ai-general-chat (32 messages🔥):

Lilian Weng Chart, Gemini API Thinking Tokens, AI Technical Educators, Vertical SaaS for Restaurants, GPT-4 Launch Stories


DSPy â–· #show-and-tell (1 messages):

DSPy Blogpost, LLM Hacking, Bugcrowd


DSPy ▷ #general (16 messages🔥):

DSPy for Agentic Workflows, Data QA with DSPy, MIPRO vs Optuna, TypeScript equivalent to DSPy, DSPy module needing signatures


DSPy â–· #examples (1 messages):

Discord Message Links, Source code in Prompt


Modular (Mojo 🔥) ▷ #mojo (6 messages):

BigInt support, Convolution Puzzle Clarity


Modular (Mojo 🔥) ▷ #max (8 messages🔥):

Open Sourcing MAX Mojo APIs, MAX Graph Tutorials, Tensor Type Migration Code


LlamaIndex â–· #announcements (1 messages):

PapersChat, Deep Research Agent, Multilingual RAG, Invoice Reconciliation Agent, LlamaParse Updates


LlamaIndex â–· #blog (2 messages):

LlamaIndex Memory API, AI Agents Memory Improvement, Short-term chat history, Long-term memory


LlamaIndex â–· #general (3 messages):

google_genai integration, GoogleSearch, FunctionTool


tinygrad (George Hotz) â–· #learn-tinygrad (4 messages):

OpenCL implementation, tensor numel, device/backend, memory movement functions, view changes


Nomic.ai (GPT4All) â–· #general (3 messages):

Creative Writing with oblix.ai, Local vs Cloud Model Orchestration, Edge Computing Savings


LLM Agents (Berkeley MOOC) â–· #hackathon-announcements (1 messages):

Lambda Workshop, Nobel FutureTech Info Session