Frozen AI News archive

not much happened today

**Qwen model family** released quantized versions of Qwen3 models including **14B**, **32B**, and **235B** parameters, with promising coding capabilities in Qwen3-235B. **Microsoft** launched **Phi-4-reasoning**, a **14B** parameter model distilled from OpenAI's o3-mini, emphasizing supervised fine-tuning and reinforcement learning, outperforming larger models in some benchmarks. **Cohere's Command A** leads SQL performance on Bird Bench. **Google** introduced the **TRAJAN** eval for video generation temporal consistency and updated the **Gemini** OpenAI compatibility layer. **Inception Labs** launched a diffusion LLM API claiming 5x speed improvements over autoregressive models. Community rankings show **OpenAI's o3** model debuting strongly in web app-building tasks. Other releases include **AllenAI's OLMo2 1B** and additional Phi 4 variants. *"Qwen3-235B shows promise for coding"* and *"Phi-4-reasoning tech report emphasizes SFT gains"* highlight key advancements.

Canonical issue URL

a quiet weekend.

AI News for 5/1/2025-5/2/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 4793 messages) for you. Estimated reading time saved (at 200wpm): 473 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

You could read the second OpenAI sycophancy postmortem, or you could read about the new MCP Auth spec. But you really don't have to. Happy weekend.


AI Twitter Recap

Language Models, Benchmarks, and Evaluations

AI Agents and Tool Use

New Applications and Use Cases

Open Source and Community

Other Topics

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Qwen3 Model Deployment and Fine-tuning Updates

2. New Model and Benchmark Tools (Granite, SOLO, LLM GPU Calculator)

3. Local Llama Model Running Experiences & Memes

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AI Playing and Completing Pokémon via Gemini Benchmarks

2. Novel Personal and Emotional Experiences with AI Chatbots (Claude, ChatGPT)

3. Cutting Edge AI Model Releases and Testing (OpenAI GPT-4o and Gemini)


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mania: Performance, Quirks, and Quantization

Theme 2: Dev Tools Duel: Frameworks, APIs, and Debugging Dramas

Theme 3: GPU Grind: Hardware Heats Up, Kernels Compete

Theme 4: AI Ecosystem Evolution: Releases, Roles, and Ruckus

Theme 5: Community Contributions & Collaboration Corner


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


Nous Research AI Discord


LM Studio Discord


OpenAI Discord


GPU MODE Discord


HuggingFace Discord


Latent Space Discord


Yannick Kilcher Discord


Notebook LM Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Eleuther Discord


LlamaIndex Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


tinygrad (George Hotz) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Claude Sonnet Routing Issue, Internal Flag Misconfiguration, Aravind's Detailed Explanation


Perplexity AI ▷ #general (710 messages🔥🔥🔥):

Perplexity AI app, Image generation, Deepseek r2, GTA 6 delay, Grok 3 and Psychology


Perplexity AI ▷ #pplx-api (8 messages🔥):

Sonar API with LlamaIndex RAG project, Perplexity API Cookbook, Perplexity API purchase issues


Unsloth AI (Daniel Han) ▷ #general (314 messages🔥🔥):

Qwen3 Base Model Training Issue, OpenRouter API Integration, 3070 Server Setup, rsLoRA Unpredictability, GRPO Model Instruction Following


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Qwen3, Dynamic 2.0, Llama 4 + Meta, Phi-4 reasoning, DeepSeek


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

XDNA Drivers, Arch Linux, Ubuntu Live Disk


Unsloth AI (Daniel Han) ▷ #help (178 messages🔥🔥):

Custom Architectures Support, GGUF Export Issues, Qwen3 fine-tuning guide, Tokenizer issues, Training GRPO models


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Qwen-3-14B, PEFT fine-tuning, Gemini reasoning outputs, Hugging Face datasets


Unsloth AI (Daniel Han) ▷ #research (2 messages):

GANs fine tune LLMs, Adversarial Training


aider (Paul Gauthier) ▷ #general (283 messages🔥🔥):

Claude MaxFeed with docs, trafilatura to crawl library and language documentation, commit message prompt, Git Commit Message Conventions, Lazygit based TUI


aider (Paul Gauthier) ▷ #questions-and-tips (69 messages🔥🔥):

Repomix and Aider Integration, AI Studio vs Aider Workflow, Tips for library updates with AI models, Context Generation from Specific Projects, Diff mode in Gemini 2.5


aider (Paul Gauthier) ▷ #links (1 messages):

dex73r: https://x.com/wmhuo168/status/1918014248040484934


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

O3 model, OpenRouter Chatroom, BYOK access


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

PDF Processing, Flathub Toledo1 App, Image Upload


OpenRouter (Alex Atallah) ▷ #general (294 messages🔥🔥):

Claude issues on OpenRouter, Aider leaderboard model performance, DeepSeek R1 issues, Streaming with usage information in Python, Gemini experimental limitations


Cursor Community ▷ #general (277 messages🔥🔥):

Cursor 3.7 vs 3.5, Realtime Cursor Usage Monitoring, C# Debugging in Cursor, o3 Model Issues, Cursor Ambassador Role


Nous Research AI ▷ #general (254 messages🔥🔥):

Gemini vs Sonnet Code Performance, Cursor code integration with Gemini, Claude debugs via screenshots, v0 design limitations, API update challenges


Nous Research AI ▷ #ask-about-llms (2 messages):

Nous Minos Classifier, vLLM Support


Nous Research AI ▷ #interesting-links (6 messages):

Nous Research, Decentralized AI, Sentient Auto Correction


LM Studio ▷ #general (207 messages🔥🔥):

LM Studio model downloading vs manual download, Hardware requirements for running LLMs, Quantization effect, Context cache, LM Studio API


LM Studio ▷ #hardware-discussion (52 messages🔥):

Laptop thickness preferences, Llama 70b Q4 performance, Qwen3-32b Q4_K_M token generation speed, Qwen3 30b MOE vs Non-MOE, Multi-GPU setup for finetuning


OpenAI ▷ #annnouncements (1 messages):

GPT-4o Sycophancy, ChatGPT Update


OpenAI ▷ #ai-discussions (102 messages🔥🔥):

Gemini 2.5 Pro vs Google AI Studio, GPT-4o Context Length, Grok 3 for Roleplaying, Qwen Series Performance, AI Video Generation Tools


OpenAI ▷ #gpt-4-discussions (27 messages🔥):

o3 Search, API to list reponse_id, error in message stream, Reasoning Capabilities vs. Search Functionality in o3, Usage of o4-mini vs o3


OpenAI ▷ #prompt-engineering (18 messages🔥):

Sorting Mendeleev table, OpenAI function calling, API usage with free ChatGPT


OpenAI ▷ #api-discussions (18 messages🔥):

Sorting Mendeleev Table by Density, Multiple Function Calls with OpenAI, API Access with Free ChatGPT


GPU MODE ▷ #general (7 messages):

LoRA fine-tuning with FSDP, Saving model error, Qwen2.5-0.5B-Instruct, Deepspeed vs FSDP


GPU MODE ▷ #triton (6 messages):

AOT Triton, Kernel Packaging, Triton Autotuner, libdevice support for round


GPU MODE ▷ #cuda (8 messages🔥):

Nvidia drops cross-compilation on Mac, Cutlass Tutorials


GPU MODE ▷ #torch (2 messages):

Torch Compile, Dynamic Input Shapes, Quadratic Algorithm


GPU MODE ▷ #cool-links (2 messages):

HF transformers tensor packing, CUDA JIT compiling, cute-kernels library


GPU MODE ▷ #beginner (1 messages):

lissim.: Where can i learn more about the topics pipeline and stages?


GPU MODE ▷ #rocm (2 messages):

NPS Bandwidth, Cache Bypassing


GPU MODE ▷ #liger-kernel (1 messages):

benasd: Can anyone review this bug fix PR? https://github.com/linkedin/Liger-Kernel/pull/632


GPU MODE ▷ #metal (2 messages):

QR writing difficulty, Deep Research AI overestimation


GPU MODE ▷ #self-promotion (32 messages🔥):

Hopper Architecture Optimization, Matrix Transpose Kernel, TMA and Swizzling Patterns, H100 Bandwidth Variants, Memory Layouts and Performance


GPU MODE ▷ #🍿 (1 messages):

Popcorn Project, Contribution Opportunities


GPU MODE ▷ #submissions (52 messages🔥):

amd-fp8-mm leaderboard, amd-mixture-of-experts leaderboard, histogram leaderboard, MI300, A100


GPU MODE ▷ #status (1 messages):

Leaderboard Vulnerability, Timeout Issues, MoE problem, AMD


GPU MODE ▷ #hardware (4 messages):

CUDA 12.9, CC 10.3, CC 12.1, NVIDIA GPU Table, Blackwell Ultra


GPU MODE ▷ #amd-competition (18 messages🔥):

Triton Autotune, Composable Kernel Compilation, Discord Cluster Manager


GPU MODE ▷ #mojo (5 messages):

Mojo Kernels, GPU Module, Modular Special Repo


HuggingFace ▷ #general (33 messages🔥):

GPU Quota Recharge, HF Usage Limit, Educational Resources on HF, Gradio Server Deployment on Production, Agent Course Error


HuggingFace ▷ #today-im-learning (1 messages):

cakiki: <@1185985139340222495> please don't cross-post


HuggingFace ▷ #i-made-this (2 messages):

PdfItDown, TTS Arena


HuggingFace ▷ #agents-course (100 messages🔥🔥):

Gemini API Attribute Errors, Phoenix UI Issues, Gemini vs GPT-4o, Langgraph Migration, Inference API Payment Required


Latent Space ▷ #ai-general-chat (84 messages🔥🔥):

Forward Deployed Engineer, webpage to markdown, AI Events, MCP Authorization Specification, Xcode x Anthropic


Latent Space ▷ #ai-in-action-club (49 messages🔥):

A2A vs MCP, Discord Stream Issues, Google A2A Usage, A2A better than MCP


Yannick Kilcher ▷ #general (117 messages🔥🔥):

promptfoo.dev, self-supervised learning, AI edits its last message, Bayesian networks vs doctors, American sign language model


Yannick Kilcher ▷ #paper-discussion (10 messages🔥):

Perception Encoder (PE), Vision-Language Learning, AI-generated summaries, New Paper Discussion


Yannick Kilcher ▷ #ml-news (4 messages):

Tariffs on Electronics, AI Leaderboard Bias, Google AI Chatbot Ads


Notebook LM ▷ #announcements (2 messages):

NotebookLM App Waitlist, User Experience Research Program


Notebook LM ▷ #use-cases (10 messages🔥):

Long-form podcasts with Notebook LM, Podcast Speaker Diarization issues, Interactive Mode is INSANE, Gemini 2.5 fix, Podcast length control


Notebook LM ▷ #general (75 messages🔥🔥):

NotebookLM prior knowledge, Discover Sources button, Gemini Advanced vs ChatGPT, Translate audio to transcript, App Store pre-order


Manus.im Discord ▷ #general (84 messages🔥🔥):

Manus invitation codes, Time zones politics, AI applications for healthcare, Biological computing with human brain cells, File manager in Manus


Modular (Mojo 🔥) ▷ #general (1 messages):

GPU Mode livestream, Mojo Release Cadence


Modular (Mojo 🔥) ▷ #mojo (70 messages🔥🔥):

C++ package managers, C++ ecosystem, Fedora 42, Mojo FFI, Global Mutable Data


MCP (Glama) ▷ #general (51 messages🔥):

Claude Integrations, Remote MCPs, Revenue Share for App Creators, Atlassian's Hosted Remote Server, AzureOpenAI with playwright-mcp


MCP (Glama) ▷ #showcase (5 messages):

Model Enhancement Servers, MCP Hosting, MCPJam, Sequential Thinking Servers, Visual Reasoning Servers


Eleuther ▷ #general (3 messages):

LLM Hallucinations, Jailbreak Methods, Adversarial Robustness, Activation Probing


Eleuther ▷ #research (8 messages🔥):

Weight Decay as Forgetting, Compacted Latent Space, Differential Memory Matrix, LR and WD Coupling


Eleuther ▷ #interpretability-general (18 messages🔥):

Attention Maps in Diffusion Transformers, RoPE in Transformer Layers


Eleuther ▷ #lm-thunderdome (26 messages🔥):

Gemma 3 27bhi issues, Qwen models, GSM8k task in lm-evaluation-harness, ICML workshop submission


LlamaIndex ▷ #blog (2 messages):

LlamaIndex vs Claude 3.7, AI SDRs


LlamaIndex ▷ #general (24 messages🔥):

LLMs Non-Determinism, Error Handling, Fuzzy Matching, Chat Store Issue, RAG Accuracy


Cohere ▷ #💬-general (10 messages🔥):

Chat UI Missing Functionality, Embed-4 Embeddings Extraction, Internal Server Error, Email Support


Cohere ▷ #🔌-api-discussions (1 messages):

Cohere Embed V4, Cohere Embed Jobs


Cohere ▷ #🤝-introductions (4 messages):

Data Warehousing, ETL, Databricks, Informatica Cloud, PyTorch


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Submission Guidelines, Entrepreneurship Track, Research Track


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (7 messages):

Labs release, Assignment Deadlines


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (5 messages):

MOOC lectures, AgentX hackathon, Dawn Song's Keynote


Nomic.ai (GPT4All) ▷ #general (6 messages):

24GB GPUs on eBay, Jinja chat template, VRAM vs RAM, PDF Upload Issues


DSPy ▷ #general (4 messages):

DSPY intro on YouTube, vllm for OCR tasks, dspy.ai landing page, NeurIPS deadline, GenseeAI survey


tinygrad (George Hotz) ▷ #general (2 messages):

Windows Support in Tinygrad