Frozen AI News archive

Granola launches team notes, while Notion launches meeting transcription

**GPT-4.1** is now available in **ChatGPT** for Plus, Pro, and Team users, focusing on coding and instruction following, with **GPT 4.1 mini** replacing **GPT 4o mini**. **Anthropic** is releasing new **Claude** models including **Claude Opus** and **Claude Sonnet**, though some criticism about hallucinations in **Claude O3** was noted. **Alibaba** shared the **Qwen3 Technical Report** with strong benchmark results from **Seed1.5-VL**. **Meta FAIR** announced new models and datasets but faced criticism on **Llama 4**. **AM-Thinking-v1** launched on **Hugging Face** as a 32B scale reasoning model. **Granola** raised $43M in Series B and launched **Granola 2.0** with a Notion-like UI. The AI ecosystem shows rapid iteration and cloning of ideas, emphasizing execution and distribution.

Canonical issue URL

Whisper is all you need.

AI News for 5/13/2025-5/14/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 4313 messages) for you. Estimated reading time saved (at 200wpm): 428 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We try to keep coverage to model- and code-specific news that we're pretty sure engineers will someday use at work, but occasionally smaller product launches are interesting fodder for commentary on the broader AI landscape, especially if the launches involve highly regarded work products like Notion or Granola.

There's an ongoing joke in biology that everything evolves into crab. The same is happening in AI wrapper land - just because they're now recognized to be valuable, doesn't stop them from still being easy to clone. Bolt inspires Figma Make, Claude Code inspires OpenAI Codex, Deep Research inspires Deep Research inspires Research inspires DeepSearch, and on and on. Ideas are worth nothing, may the best distribution + execution win.

The occasion of Granola's $43m Series B (at $250m valuation) is their time to launch "Granola 2.0", their collaborative version with a surprisingly... Notiony UI.

This is a day after Ivan Zhao launched... an interesting Granola-lite feature.


AI Twitter Recap

Language Models and Releases

Agent Development and Tooling

AI Infrastructure and Tools

AI and Research Concepts

Industry, Business, and Economic Impacts

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

1. Benchmarking AMD Strix Halo and Qwen3 Models for Local LLM Inference

2. MAESTRO Local-First AI Research App Release and Benchmarks

3. BitNet R1 Ternary Model Finetune and Community Tools

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AlphaEvolve and DeepMind Breakthroughs in Coding and Science AI

2. Anthropic Claude Sonnet/Opus Model Release Anticipation and OpenAI Model Rollout

3. ChatGPT as New Internet Interface and Its Societal Impact


AI Discord Recap

A summary of Summaries of Summaries by gpt-4.1-2025-04-14

1. Model Benchmark Showdowns and Coding Performance

2. Distributed and Decentralized Training/Inference

3. Hardware and Performance Optimizations

4. Prompt Engineering, Tokenization, and Memory Mishaps


Discord: High level Discord summaries

Perplexity AI Discord


LM Studio Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


Cursor Community Discord


Yannick Kilcher Discord


OpenRouter (Alex Atallah) Discord


Manus.im Discord Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


Eleuther Discord


Nous Research AI Discord


Notebook LM Discord


Latent Space Discord


HuggingFace Discord


MCP (Glama) Discord


Torchtune Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (748 messages🔥🔥🔥):

Android app custom model selection, Deep Research release date, Merlin AI pricing and web search quality, Perplexity AI Sonar vs GPT models, AI Studio for multimodal utility


Perplexity AI ▷ #sharing (2 messages):

Token Minimization, Sustain


Perplexity AI ▷ #pplx-api (12 messages🔥):

Sonar Model Benchmarks, Perplexity Pro API Access, New Developer Relations Resident, Sharepoint integration


LM Studio ▷ #general (176 messages🔥🔥):

Embedding Modules Issue, Benign Log Spam, LM Studio JIT, Building Models from Scratch, LM Studio Autoload Issues


LM Studio ▷ #hardware-discussion (450 messages🔥🔥🔥):

Gigabyte RTX 5060 Ti, PCIE 5.0 Benefits, qwen3-14b-q4km performance, Dual GPUs, ROCm on Linux vs Windows


LMArena ▷ #general (530 messages🔥🔥🔥):

DeepSeek R2 release, New Gemma models, Claude Neptune / 3.8 leaks, GPT-4.1 vs GPT-4o, O3 Pro release delays


LMArena ▷ #announcements (1 messages):

Server Updates, Forum Category, Roles Creation, Moderation Improvements, Future Events


Unsloth AI (Daniel Han) ▷ #general (379 messages🔥🔥):

File:// usage, Qwen3 inference speed, llama 3.2 vision fine tuning, mergekit and frankenmerge, Qwen3 GRPO notebook


Unsloth AI (Daniel Han) ▷ #off-topic (49 messages🔥):

O3 evaluation, GPT-4.1 coding, Qwen models, NEFTune


Unsloth AI (Daniel Han) ▷ #help (83 messages🔥🔥):

Vocabulary Size, Chat Templates and Base Models, Unsloth Performance Issues, GGUF compatibility, GRPO and Qwen3


Unsloth AI (Daniel Han) ▷ #research (5 messages):

Med Palm 2, QLoRA memory, modernBERT context length


OpenAI ▷ #annnouncements (2 messages):

Safety Evaluations Hub, GPT-4.1, GPT-4.1 Mini


OpenAI ▷ #ai-discussions (151 messages🔥🔥):

Sentient AI conversation, ChatGPT models for coding, O3 model intelligence, ChatGPT Enterprise plan, AI-generated images on Instagram


OpenAI ▷ #gpt-4-discussions (12 messages🔥):

GPT-4o for web app coding, Structured outputs for Azure OpenAI assistants, Node ID errors, Chat delays on PC vs. mobile, Flagged chats due to long output


OpenAI ▷ #prompt-engineering (70 messages🔥🔥):

GPTs for coding, PII data guardrails, AI for universe simulation, Mimicking writing style with AI, Ollama vs Windsurf


OpenAI ▷ #api-discussions (70 messages🔥🔥):

ChatGPT for Web App Development, GPT-4o Coding Assistance, HR Data Guardrails and PII, Mimicking Writing Style, Prompt Engineering and Agentic Frameworks


Cursor Community ▷ #general (271 messages🔥🔥):

GPU power in decentralized systems, Cursor pricing vs API pricing, Claude Max in Cursor, Multi-repo projects in Cursor, Cursor's Git changes sync issues


Yannick Kilcher ▷ #general (197 messages🔥🔥):

Patents in AI, RL-Diffusion, Generator Paradigm, Evolutionary Algorithms, Hamiltonian Neural Networks and Transformers


Yannick Kilcher ▷ #paper-discussion (23 messages🔥):

Grade School Math Benchmarks, ML systems rabbit hole, Data Loading and Preprocessing, LLMs are like humans, Model formulates a plan


Yannick Kilcher ▷ #ml-news (12 messages🔥):

AI Regulation Ban, AlphaEvolve, Budget Reconciliation Bill


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

New Chatbot Platform, Customization and Models, Image Generation in Chat


OpenRouter (Alex Atallah) ▷ #general (177 messages🔥🔥):

OpenAI Reasoning Models Naming, Free Google Models, Gemini Rate Limits, Claude on OpenRouter vs Native, Corvid Befriending


Manus.im Discord ▷ #general (165 messages🔥🔥):

Manus credits not refreshing, best use cases for manus, Manus invitation codes, Manus refunds, Gemini Developer API's Function Calling feature


GPU MODE ▷ #general (2 messages):

torch.compile performance, layernorm vs rmsnorm


GPU MODE ▷ #cuda (1 messages):

CUDA Shared Buffers, PyTorch Tensors, RAPIDSAI/rmm Library


GPU MODE ▷ #torch (2 messages):

PyTorch nightly, at::Tag, needs_exact_strides, C++ code, torch.compile


GPU MODE ▷ #beginner (5 messages):

Arithmetic Intensity of Kernels, TMA Utilization Metrics, Triton Performance Debugging, Nsight Compute for Kernel Debugging


GPU MODE ▷ #self-promotion (10 messages🔥):

Weight Pruning, PTX Instructions for Matrix Load/Store, CohereAI Talk Recording


GPU MODE ▷ #🍿 (1 messages):

c.3.p.1: This looks potentially interesting: https://arxiv.org/abs/2504.09246


GPU MODE ▷ #submissions (47 messages🔥):

AMD MI300, AMD fp8-mm, VectorAdd, Leaderboard Submissions


GPU MODE ▷ #status (1 messages):

Competition delayed, Ironing out details, Problem #3


GPU MODE ▷ #factorio-learning-env (15 messages🔥):

Factorio Genetic Algorithm, Cutting Down Tokens, Nearest buildable tool


GPU MODE ▷ #amd-competition (23 messages🔥):

Reference Kernel Times, Application Timeout Errors, fp8 gemm VGPR usage, Leaderboard Submission Issues, HIP Kernel .s File Access


GPU MODE ▷ #cutlass (25 messages🔥):

CUTLASS 4.0 Release, CuTe DSL for Python, MLIR Compiler, PTX Dumping, Custom Kernel Performance


GPU MODE ▷ #mojo (2 messages):

Mojo PyTorch backend, Autograd Implementation, Micrograd, Pytorch internals


aider (Paul Gauthier) ▷ #general (49 messages🔥):

Gemini 2.5 Pro, Model Performance, Common Lisp, AI Studio, Repomap


aider (Paul Gauthier) ▷ #questions-and-tips (48 messages🔥):

Gemini rate limits, Aider upgrades, Aider models, Aider configuration, Aider file navigation issues


Eleuther ▷ #general (22 messages🔥):

lm-eval-harness dataset download, R1-distill models prompt format, Regulatory bias standards and LLMs, Open Science Conference call for papers, ODSC vs OSC conference confusion


Eleuther ▷ #research (57 messages🔥🔥):

Model of Mind AI, Falsifiable Hypothesis, Sparse Gradients, Qwen 3, Skywork Model


Eleuther ▷ #lm-thunderdome (7 messages):

Multi-GPU lm-eval, vllm Tensor Parallel


Nous Research AI ▷ #announcements (2 messages):

Atropos v0.2.0 Release, Psyche Network Launch, Decentralized AI Training, Large Language Model Training, Open Source AI Development


Nous Research AI ▷ #general (78 messages🔥🔥):

Frontier Models, smolvlm-realtime-webcam, 3 GPUs, latex2sympy2_extended math_verify, Atropos


Nous Research AI ▷ #ask-about-llms (1 messages):

princepolka: Is 05-06 worse at instruction-following than the previous 2.5 Pro?


Nous Research AI ▷ #research-papers (1 messages):

LLMs in multi-turn conversations, LLM performance degradation, Lost in Conversation paper, Premature Solution Generation by LLMs, LLM Recovery from Conversational Errors


Nous Research AI ▷ #interesting-links (2 messages):

Finetuning to 1.58 Bits, Cody S Tweet


Nous Research AI ▷ #research-papers (1 messages):

LLMs in Multi-Turn Conversations, Lost In Conversation paper, LLM Unreliability


Notebook LM ▷ #announcements (1 messages):

User Experience studies, Multilingual Audio Overviews, NotebookLM Feedback


Notebook LM ▷ #use-cases (19 messages🔥):

Invisible Sun TTRPG, Shareability Factor, Google Product Discontinuation, NotebookLM and OneNote Sync, Podcast Feature ToS


Notebook LM ▷ #general (32 messages🔥):

Podcast Length, Audio Upload and Transcription, Account Restrictions on PDF Uploads, Adding Information to System Prompt, Early Access Installation Issues


Latent Space ▷ #ai-general-chat (41 messages🔥):

GPT-4 Launch, ChatGPT Scaling, AI Founder in Residence, AI in Ohio Courts, AlphaEvolve


Latent Space ▷ #ai-announcements (3 messages):

Tom Yeh, Llama 1/2/3/4, LLM Paper Club


HuggingFace ▷ #general (15 messages🔥):

Qwen Model Distillation, MiniCPM-V-2_6, Perceptron Visualizers, Local Stable Diffusion Hosting, Langfuse Deployment with Smolagents


HuggingFace ▷ #today-im-learning (4 messages):

Assistance Offered, Hugging Face Transformers, EleutherAI Suggestion, Diffusion Course from MIT


HuggingFace ▷ #i-made-this (7 messages):

pdf2tex vs 12GB ram, PDF format criticism, Markdown output suggestion, Civitai censorship


HuggingFace ▷ #reading-group (3 messages):

Simulation-Based Inference, AI Reading Group session


HuggingFace ▷ #NLP (3 messages):

Emotion detection limitations, Transformers tokenizer context length


HuggingFace ▷ #smol-course (2 messages):

Agent blocked sites, Smolagents framework


HuggingFace ▷ #agents-course (10 messages🔥):

HF Inference Provider Credits, HF SPACE_ID and SPACE_HOST ENV vars, Unit 1 code execution, InferenceClient Model Selection, Llama models text_generation


MCP (Glama) ▷ #general (40 messages🔥):

Typescript vs Authpython Lag, Debugging MCP servers on Smithery, Scalable MCP with Streamable HTTP, User Confirmation for AI Agent MCP Tools, Revolutionary idea for MCP Security


MCP (Glama) ▷ #showcase (3 messages):

Yarr MCP Servers, Tiny Agents Remote MCP Support, LLM-provider-agnostic, MCP enabled Chat Client


Torchtune ▷ #general (5 messages):

Custom Torchtune Models with vLLM, Synchronous GRPO recipe with vLLM


Torchtune ▷ #dev (37 messages🔥):

HFModelTokenizer vs GemmaTokenizer, Gemma PromptTemplate, Tokenizer configurations, Masking assistant tokens


Modular (Mojo 🔥) ▷ #mojo (25 messages🔥):

Variant bug with SIMD, register_passable types, Mojo in Google Colab


tinygrad (George Hotz) ▷ #general (15 messages🔥):

WebGPU bug, BEAM parameter, tinybox-ui, high performance blake3 implementation


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

cookiecrumbs3808: Or offloaded to CPU, I guess.


LlamaIndex ▷ #blog (3 messages):

LlamaIndex Memory component, LlamaExtract citation implementation


LlamaIndex ▷ #general (6 messages):

LlamaIndex Memory Component, Memory Session Management, Database Integration for Memory, Serialization vs. Database for Context, Memory vs Redis


Cohere ▷ #💬-general (3 messages):

Generation Parameters, Use cases for Cohere, Cohere vs ChatGPT and Anthropic


Cohere ▷ #🔌-api-discussions (5 messages):

Cohere API Calls, Cohere Billing, Cohere Trial Key


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Course certificate requirements, Medium article or X post for certificate