Frozen AI News archive

not much happened today

**OpenAI** plans to evolve **ChatGPT** into a **super-assistant** by 2025 with models like **o3** and **o4** enabling agentic tasks and supporting a billion users. Recent multimodal and reasoning model releases include ByteDance's **BAGEL-7B**, Google's **MedGemma**, and NVIDIA's **ACEReason-Nemotron-14B**. The **Sudoku-Bench Leaderboard** highlights ongoing challenges in AI creative reasoning. In software development, OpenAI's **Codex** aids code generation and debugging, while Gemini's **Context URL tool** enhances prompt context. **AgenticSeek** offers a local, privacy-focused alternative for autonomous agents. Ethical concerns are raised about AGI development priorities and Anthropic's alignment with human values. Technical discussions emphasize emergence in AI and training challenges, with humor addressing misconceptions about **Gemini 3.0** and async programming in C. A novel synthetic speech training method enables instruction tuning of LLMs without real speech data, advancing low-resource language support.

Canonical issue URL

a quiet day

AI News for 5/23/2025-5/26/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (217 channels, and 11775 messages) for you. Estimated reading time saved (at 200wpm): 1148 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

zzzzz


AI Twitter Recap

Advancements in AI Models and Technologies

AI in Software Development and Tools

Ethics and AI Governance

Technical Challenges and Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Synthetic Speech Model Launches and Benchmarks

2. Local LLM Deployment Hardware & Tooling

3. Novel LLM Security Applications

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Google Veo 3, VACE, and Wan: Next-Gen AI Video Generation & Tools

2. Coding Benchmarks, Model Comparisons, and Real-World Claude 4/O4/Gemini Usage

3. Chatbot & LLM Quirks: Model Identity, AI Outsourcing, and App Behaviors


AI Discord Recap

A summary of Summaries of Summaries by gpt-4.1-2025-04-14

1. AI Hardware, Models, and Benchmarking Buzz

2. RL, Reasoning, and Prompt Innovations

3. AI Agents, Security, and Voice Tech Vibes

4. Hardware, Kernel, and Ecosystem Engineering

5. Open-Source Launches and Ecosystem Upgrades


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenAI Discord


LM Studio Discord


Cursor Community Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Manus.im Discord Discord


HuggingFace Discord


GPU MODE Discord


MCP (Glama) Discord


Nous Research AI Discord


Notebook LM Discord


Latent Space Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


DSPy Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Pro Perks, Academic Homepage, Finance Dashboard Revamp, Audio & Video Search, Space Templates


Perplexity AI ▷ #general (1002 messages🔥🔥🔥):

Sam Altman's New Hardware Venture, Comet Beta Access, Gemini vs Mistral Models, Perplexity AI's Support, Gemini Pro Free Trial


Perplexity AI ▷ #sharing (5 messages):

US City Changes, Weapon Ownership, AI Model Ranking, AI Song Release


Perplexity AI ▷ #pplx-api (15 messages🔥):

API credits, Python client, Perplexity API pricing, Sonar API usage, Image generation API


Unsloth AI (Daniel Han) ▷ #general (1034 messages🔥🔥🔥):

Ikea vs Herman Miller Chairs, Desk Job Health Hazards, AI and Gym Balance, DeepSeek v3 Release, Multiple GPU Support ETA


Unsloth AI (Daniel Han) ▷ #off-topic (28 messages🔥):

Training run habits, Multimodal RAG tools, Harvard AI course, Reddit AI companies list, Falcon-H1 Benchmarking


Unsloth AI (Daniel Han) ▷ #help (601 messages🔥🔥🔥):

Qwen3 fine-tuning, Synthetic data and GPUs, Unsloth installation issues, RAG vs Fine-tuning, LoRA and VLLM


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

RAG fine-tuning, Open source web analytics, DIY Analytics GitHub Repo


Unsloth AI (Daniel Han) ▷ #research (11 messages🔥):

Deepseek-v3, PEER expert layers, Memory hierarchy aware expert streaming, Cross encoders vs colbert, Multi-GPU support


LMArena ▷ #general (1225 messages🔥🔥🔥):

Gemini 2 Flash, Model Merging, Open Empathic, Grok 3.5 Release, Claude 4


LMArena ▷ #announcements (2 messages):

Style Control, Independent Scrolling


OpenAI ▷ #ai-discussions (812 messages🔥🔥🔥):

Google Veo 3, Gemini models, Codex, Claude, Image Generators


OpenAI ▷ #gpt-4-discussions (22 messages🔥):

UI changes on platform, GPT-4.1 hallucinations, O3 vs 4o explanation quality, O3 Pro release delay


OpenAI ▷ #prompt-engineering (42 messages🔥):

GPT Markdown, XML, JSON Understanding, Multi-Agent Systems with GPTs, Prompt Refinement for 3D Mockups, Learning Prompt Engineering with ChatGPT


OpenAI ▷ #api-discussions (42 messages🔥):

GPT understanding of Markdown, XML, JSON, Multi-agent system using GPTs, Prompt Engineering Learning Resources, 3D mockup prompt refinement, Negative instruction phrasing


LM Studio ▷ #general (424 messages🔥🔥🔥):

Devstral LLM, Qwen Coder models, Mistral 7B V0.3 prompt template fix, whiterabbitnepo model, OpenAI Whisper on LM Studio


LM Studio ▷ #hardware-discussion (553 messages🔥🔥🔥):

5080 worth it?, AMD vs CUDA, 3090 alternatives, Multi GPU setup, Lunar Lake laptops


Cursor Community ▷ #general (715 messages🔥🔥🔥):

Claude 3.7 vs 4.0, Cursor pricing and profitability, search_replace Tool broken, Gemini-2.5-pro max, language channels


Cursor Community ▷ #background-agents (36 messages🔥):

Background Agent Setup Issues, Privacy Mode Delay, Environment Variables for Background Agents, Branching with Background Agents, Background Agent Errors


Cursor Community ▷ #announcements (1 messages):

Language-specific channels


aider (Paul Gauthier) ▷ #general (674 messages🔥🔥🔥):

Claude 4 Sonnet vs Gemini, Gemini Pro Performance Degradation, OpenRouter LLM access, Aider's benchmark, DeepSeek V3 Rumors


aider (Paul Gauthier) ▷ #questions-and-tips (41 messages🔥):

Aider's /code vs Cursor Agent Mode, /architect vs /ask + /code, Globally disable thinking tokens, Lisp parens balancing tricks, Benchmark Flags in Architect Mode


aider (Paul Gauthier) ▷ #links (3 messages):

Mistral AI, Devstral, Model Testing


Yannick Kilcher ▷ #general (508 messages🔥🔥🔥):

OpenEvolve, Sonnet 4, residual streams, Neural Turing Machine, OpenRouter


Yannick Kilcher ▷ #paper-discussion (3 messages):

No Paper Friday, Weekend anticipation


Yannick Kilcher ▷ #agents (2 messages):

Probabilistic Integral Circuits, Probabilistic Circuits


Yannick Kilcher ▷ #ml-news (11 messages🔥):

Character tokenization, Windows OS, GANs


Manus.im Discord ▷ #general (492 messages🔥🔥🔥):

Manus AI Agent, Claude 4.0, Manus Customer Service, Video inventory with Manus


HuggingFace ▷ #general (284 messages🔥🔥):

Qwen 0.6b Model Usage, GPU Recommendations, Token Issues, Synthetic Data Kit, Real-Time Audio Transcription


HuggingFace ▷ #today-im-learning (2 messages):

Attention Mechanism, Query, Key, Value vectors


HuggingFace ▷ #cool-finds (2 messages):

EasyShield Anti-Spoofing AI Model, Claude AI Referral Program


HuggingFace ▷ #i-made-this (8 messages🔥):

Flast Taglines, Button Color Schemes, Native Cross-Platform AI Chat App, Agentle Project, SweEval Dataset


HuggingFace ▷ #reading-group (5 messages):

Cross-posting, Channel Topics, Weekly Reading Group


HuggingFace ▷ #NLP (9 messages🔥):

Source Available Models, Fine-tuning data size, HF Transformers Contributions


HuggingFace ▷ #smol-course (3 messages):

Attachment handling in app.py, LLM tool calling issues, Qwen LLM


HuggingFace ▷ #agents-course (56 messages🔥🔥):

HF Spaces app.py deployment, HF Token setup and permissions, Smolagents Notebook issues, Submitting Agent course assigments to leaderboard, Final GAIA code and file attachments


GPU MODE ▷ #general (6 messages):

CI providers without self-hosted GPUs, Lambda Labs GH200 efficiency, Modal GH200, Reasoning article feedback, ModernBERT inference latency


GPU MODE ▷ #triton (58 messages🔥🔥):

Interleaved PIDs and Coalesced Loads in Triton, ML Compilation and Optimization, Triton Compiled Hook Registration, Double Buffering in Triton Kernels, A100 vs 4090 LDMatrix Behavior


GPU MODE ▷ #cuda (30 messages🔥):

FlashAttention-2 Implementation, cuSOLVER Optimization on Hopper/Blackwell, Top-K Algorithm Parallel Implementation, RTX 6000 Pro Analysis and FP4/FP6 Performance, Triton Data Packing


GPU MODE ▷ #torch (2 messages):

Training support


GPU MODE ▷ #announcements (1 messages):

Disaggregated LLM inference, Jensen's keynote at GTC


GPU MODE ▷ #algorithms (8 messages🔥):

Markov Chain Monte Carlo, Tritonhacks 2025


GPU MODE ▷ #cool-links (1 messages):

FP4 Training, LLMs


GPU MODE ▷ #beginner (8 messages🔥):

Swizzling shared memory, PyTorch Eager Execution, KernelBench usage, CUDA Architecture Compatibility, Triton AMD Kernel Optimization


GPU MODE ▷ #pmpp-book (1 messages):

Chapter 5, Thread Sanity Check, Matrix Multiplication


GPU MODE ▷ #jax (1 messages):

LLM Training Memory Issues, Backprop Memory Explosion, Sharding Strategies for Matmul, Embedding Layer Optimization


GPU MODE ▷ #torchao (2 messages):

QAT Loss Curves, WeightWithDynamicFloat8CastTensor issues


GPU MODE ▷ #off-topic (17 messages🔥):

Mick Gordon, Doom Eternal soundtrack, New Balance Patch difficulty, Boston Frontier Labs, Noodle recipe


GPU MODE ▷ #irl-meetup (1 messages):

Account Deletion Request, Email Registration Error


GPU MODE ▷ #rocm (3 messages):

torch._grouped_mm, ROCm Support, CDNA3


GPU MODE ▷ #lecture-qa (2 messages):

Triton demo, linear bandwidth result, BLOCK_SIZE, correctness issue


GPU MODE ▷ #self-promotion (2 messages):

Resume builder, Mojo programming language, Vector reduction on GPU, CUDA alternative


GPU MODE ▷ #🍿 (6 messages):

Hackathon for synthetic data, Kernelbook Opt-Out, RL-style training, KernelLLM to generate triton kernels


GPU MODE ▷ #edge (1 messages):

Real-time speech translation, Google Meet


GPU MODE ▷ #reasoning-gym (2 messages):

X post, willccbb


GPU MODE ▷ #submissions (121 messages🔥🔥):

MI300 Leaderboard Updates, amd-mixture-of-experts performance on MI300, amd-mla-decode performance on MI300, amd-fp8-mm performance on MI300, grayscale performance on T4


GPU MODE ▷ #status (2 messages):

MLA Bugs, MLA Tolerance


GPU MODE ▷ #tpu (2 messages):

maxtextXLA, TPUs, Torch XLA


GPU MODE ▷ #factorio-learning-env (3 messages):

Claude 4, factorio-learning-environment


GPU MODE ▷ #amd-competition (51 messages🔥):

RoPE implementation issues, Composable Kernel (CK) integration, HIP kernel error, Leaderboard command failure, MLA decode tolerance adjustment


GPU MODE ▷ #cutlass (19 messages🔥):

CUTLASS Blackwell MLA support in FlashInfer, TmaTiler argument for cpasync.make_tma_tile_atom, CuTe with nanoGPT and larger matrix sizes on H100, compile cutlass with default config, torch.compile


GPU MODE ▷ #mojo (7 messages):

Usefulness of Blog Posts, Rules about Self Promotion


MCP (Glama) ▷ #general (211 messages🔥🔥):

VSCode MCP client issues, MCP 'roots' explanation, Tool calling for MCP servers, A2A vs MCP, MCP Prompts and Resources


MCP (Glama) ▷ #showcase (5 messages):

MCP Directory, MCP Buddy, MCP App Store, Google Analytics MCP, mcp-ui-bridge library


Nous Research AI ▷ #general (157 messages🔥🔥):

Hermes steers models, Alignment protocol talk, Bitcoin ordinals for agents, RL for math and tool calling


Nous Research AI ▷ #ask-about-llms (16 messages🔥):

Gemma 3n, PLE Implementation, Neural Networks, Linear Projection


Nous Research AI ▷ #research-papers (3 messages):

One-Shot RLVR, Absolute Zero Reasoner (AZR), RL Fine-tuning Effects on LLMs, Post-saturation generalization


Nous Research AI ▷ #interesting-links (8 messages🔥):

OpenEvolve, Veo3 open source, ECCV papers, Vibe coding with Rick Rubin, Microsoft Azure tutorials


Nous Research AI ▷ #research-papers (3 messages):

Reinforcement Learning for Reasoning, Absolute Zero Learning, LLMs as Greedy Agents


Notebook LM ▷ #use-cases (41 messages🔥):

NotebookLM use for podcasts, Custom GPTs for enterprise, TTS Quota Limits, Generating long audiobooks, NotebookLM Cast of Characters feature


Notebook LM ▷ #general (139 messages🔥🔥):

Audio Length Limits, Mobile App Issues, Data Format Recommendations, Podcast quality concerns, Mindmap Feature Requests


Latent Space ▷ #ai-general-chat (66 messages🔥🔥):

ChatGPT error handling, PicoCreator AI Agent, Yapper AI lip-sync tool, Langdock enterprise ChatGPT wrapper, AI coding tools limitations


Latent Space ▷ #ai-in-action-club (65 messages🔥🔥):

MCPI CLI, Cursor tools, Discord audio issues, Discord vs Zoom, Google Meet


Eleuther ▷ #general (64 messages🔥🔥):

Serverless Architecture Paper, John Carmack presentation, ML performance optimization as consulting business, Open source fine-tunable models for live voice mode, AI alignment research at Eleuther


Eleuther ▷ #research (31 messages🔥):

AI Safety Initiative, FP8 for Optimizers, Quantized Training


Eleuther ▷ #interpretability-general (28 messages🔥):

NNsight vs TransformerLens, Kuramoto oscillators, Activation Manifold, Mechanistic Router Interpretability Hackathon


Modular (Mojo 🔥) ▷ #mojo (75 messages🔥🔥):

Rust vs Haskell for compile time execution, Mojo's Bool wrapping requirement, Compile Mojo to RISCV_32, Mojo FFI instability and OpenGL, Mojo from Python


DSPy ▷ #show-and-tell (1 messages):

Self improving Vibe Coding Template, DSPy Tooling, New Models


DSPy ▷ #general (49 messages🔥):

Gemma 2 9B Optimization on vLLM, Text-to-SQL SOTA, Connecting ERP Systems to LLMs, DSPy Tool Integration, DSPy Multi-Module Optimization


LlamaIndex ▷ #announcements (1 messages):

OpenAI Responses API in LlamaIndex, LlamaParse for Financial Applications Event, LlamaIndex Agent Memory Livestream, LlamaIndex Monorepo Overhaul


LlamaIndex ▷ #blog (3 messages):

OpenAI Responses API in LlamaIndex, LlamaIndex at AI Engineer World Fair, LlamaParse and AnthropicAI Sonnet 4.0


LlamaIndex ▷ #general (37 messages🔥):

RAG for legal documents, Reason-ModernColBERT compatibility with LlamaIndex, Llama Cloud Portal UI issues, LlamaIndex and Unsloth's retrieval augmented finetuning cookbook, MCP Server in LlamaIndex


Nomic.ai (GPT4All) ▷ #general (41 messages🔥):

GPT4All issues loading models, Granite 3.2 model recommendation, Offline library model recommendations, Text embedding LM for sentence synthesis, GPT4All future and open source contributions


Torchtune ▷ #general (13 messages🔥):

Deepwiki, Qwen2.5 Vocab Size, LORA finetuning


Torchtune ▷ #dev (2 messages):

Gemma 3n, Apple mobile AI


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (12 messages🔥):

Technical Blog vs Social Media Content, Gemini API Key Issues, AgentX Free Tier Access


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

melleny.pu_38442: Hello, everyone, may I ask is there any group targets to AI for Science?


Cohere ▷ #💬-general (7 messages):

Command-A website filtering, Tool Call Clarification, Link pasting, Command-A Language mix-ups


Cohere ▷ #🔌-api-discussions (1 messages):

michael: yep. you can call the API using normal HTTP requests


Cohere ▷ #🤝-introductions (5 messages):

Agentic AI, Creative Problem Solving, Blockchain Solutions, Emerging Tech


tinygrad (George Hotz) ▷ #general (11 messages🔥):

AMD_LLVM backend differences, ROCm 6.4.0 support, Tenstorrent updates, MLPerf CI and SDXL search speed, mselect contiguous removal


MLOps @Chipro ▷ #general-ml (4 messages):

PyTorch First Impressions, JAX vs NumPy, Keras and Scikit-learn Comparison