Frozen AI News archive

Gemini's AlphaEvolve agent uses Gemini 2.0 to find new Math and cuts Gemini cost 1% — without RL

**Deepmind's AlphaEvolve**, a 2025 update to AlphaTensor and FunSearch, is a Gemini-powered **coding agent for algorithm discovery** that designs faster matrix multiplication algorithms, solves open math problems, and improves data center and AI training efficiency. It achieves a **23% faster kernel speedup** in Gemini training and surpasses state-of-the-art on 20% of applied problems, including improvements on the Minimum Overlap Problem and Kissing number problem. Unlike Deep-RL, it optimizes code pieces rather than model weights. Meanwhile, **OpenAI** released **GPT-4.1** in ChatGPT, specializing in coding and instruction following, with a faster alternative **GPT-4.1 mini** replacing GPT-4o mini for all users. OpenAI also launched the Safety Evaluations Hub and the OpenAI to Z Challenge using o3/o4 mini and GPT-4.1 models to discover archaeological sites. *"Maybe midtrain + good search is all you need for AI for scientific innovation"* - Jason Wei.

Canonical issue URL

Agent Harnesses are all you need.

AI News for 5/15/2025-5/16/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 3819 messages) for you. Estimated reading time saved (at 200wpm): 341 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Deepmind's new AlphaEvolve, 2025's update of AlphaTensor and FunSearch, is hard to grok, as it summarizes a year of results across a vast swath of math and LLM training applications, AND is not publicly available to try, but GDM succinctly puts it as "a Gemini-powered coding agent for algorithm discovery... able to:

It is described as an agent rather than a model due to the mutiple components in a loop:

It's very Googley to understate their results, so one has to turn to the Twitterverse to get the highlights, which are much better:

Inquiring minds can watch the MLST interview about it:

https://www.youtube.com/watch?v=vC9nAosXrJw


AI Twitter Recap

GPT-4.1 and OpenAI Model Releases

Google's AlphaEvolve and Gemini

Open Source Models, Training, and Frameworks

Reasoning and Agentic Systems

AI Implementation, Tooling, and Infrastructure

AI Analysis and Evaluation

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

1. Text-to-Speech Model Training and Tools in Unsloth

2. New Features and Data Handling in llama.cpp

3. LLM Multi-Turn Conversation Challenges and Benchmarks

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

TO BE COMPLETED


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mania - New Releases and Capabilities Spark Fierce Debates

Theme 2: Engineering AI - Optimizing Performance and Refining Development Tools

Theme 3: Platform Quirks & User Workarounds - Navigating the AI Landscape

Theme 4: The Bustling AI Ecosystem - Collaboration, Learning, and Open Source Triumphs

Theme 5: AI's Wild Side - Controversies, Accidental Leaks, and Industry Shake-ups


Discord: High level Discord summaries

Perplexity AI Discord


Manus.im Discord Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Cursor Community Discord


LM Studio Discord


Yannick Kilcher Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Notebook LM Discord


Nous Research AI Discord


GPU MODE Discord


Latent Space Discord


Eleuther Discord


MCP (Glama) Discord


Modular (Mojo 🔥) Discord


Cohere Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


LlamaIndex Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


AI21 Labs (Jamba) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (869 messages🔥🔥🔥):

Perplexity Pro role, App answers cannot be read on the web version, Research function down, Deep Research broken, Deepsearch rate limits


Perplexity AI ▷ #sharing (1 messages):

23andMe files for Chapter 11


Perplexity AI ▷ #pplx-api (6 messages):

Sonar API, Perplexity hackathon Credits, sonar model


Manus.im Discord ▷ #general (472 messages🔥🔥🔥):

Manus Vibe Coding Livestream, Johnny's credit farming, Femboys, invite links gone, Credits Usage


LMArena ▷ #general (401 messages🔥🔥):

Gemini 2.5 Pro Reasoning Time, Elon's Grok 3.5 Release Delay, LLMs Steering Attention, LMArena Model Funding, O3 Pro on Arena


Unsloth AI (Daniel Han) ▷ #general (208 messages🔥🔥):

Quantized versions (GGUFs, QNL), Multi-GPU Finetuning, SLM vs LLM, Qwen3 model for translation, H200 Temp


Unsloth AI (Daniel Han) ▷ #off-topic (10 messages🔥):

Fine-tuning AI models, Jarvis-like AI clones, Continuous Thought Machine on Flappy Bird


Unsloth AI (Daniel Han) ▷ #help (120 messages🔥🔥):

Qwen3 DPO Training, Orpheus-3B Fine-tuning Issues, Mistral 7b VRAM Usage, Epoch Display Bug, BLIP2 and Transformers


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

WebGPU Vision-LLM app, Geminized Qwen3 MoE


Unsloth AI (Daniel Han) ▷ #research (5 messages):

Intellect-2, Solo Author, Mechanistic-Interpretable Ethics


aider (Paul Gauthier) ▷ #general (193 messages🔥🔥):

Grok's Alignment Issues, Gemini 2.5 Pro Removal, Free API Credits, Aider Token Usage, Consolidate Command for Aider


aider (Paul Gauthier) ▷ #questions-and-tips (99 messages🔥🔥):

Commit Prompt, Rate limited, Black box for code, aider config, home directory


aider (Paul Gauthier) ▷ #links (1 messages):

p0lyg0n: https://github.com/pig-dot-dev/muscle-mem


OpenAI ▷ #annnouncements (1 messages):

OpenAI to Z Challenge, Archaeological Sites, Amazon, GPT-4.1


OpenAI ▷ #ai-discussions (195 messages🔥🔥):

GPT-4.1 Mini Smarts, GPT-5 Release, Gemini 2.5 Pro, OpenAI's Open-Source Model, Context Window Expansion


OpenAI ▷ #gpt-4-discussions (21 messages🔥):

GPT-4.1 vs GPT-4o, Fine-tuning Datasets for Story Generation, Mathematics in GPT-4.1


OpenAI ▷ #prompt-engineering (2 messages):

Research GPT Feedback, Multilingual Capabilities


OpenAI ▷ #api-discussions (2 messages):

GPT feedback, English language issues, Korean language model performance


Cursor Community ▷ #general (192 messages🔥🔥):

Cursor Pro vs. Free, Client Version Details, Claude 3.5 Sonnet, Gemini Pro Preview, Agent Rules Neglected


LM Studio ▷ #general (58 messages🔥🔥):

Expanding left sidebar, Llama issues with fantasy prompts, Token loss and punctuation issues, LM Studio API Vision Endpoint, Reka Flash Presets


LM Studio ▷ #hardware-discussion (128 messages🔥🔥):

VRAM Importance vs DRAM, Qwen Models, KV Cache, 7900 XTX, 5060 Ti


Yannick Kilcher ▷ #general (155 messages🔥🔥):

AlphaEvolve Analysis, LLM vs. System Role, Gemini 2.5 Pro, LiquidAI Skepticism, Hybrid AI Approaches


Yannick Kilcher ▷ #paper-discussion (3 messages):

Sakana AI, AI Scientist Paper, Language Models, Reasoning Mistakes, Error Correction


Yannick Kilcher ▷ #ml-news (19 messages🔥):

Stable Audio Open Small, MythoMax-L2-13B Samsung Release, Meta researchers leaving


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Chatroom shortcut, Model Icons, Quick Chat


OpenRouter (Alex Atallah) ▷ #general (105 messages🔥🔥):

DeepSeek v3 MoE, Corvids cat food and bird food, Proxy for OpenAI, AlphaEvolve, Qwen3 /no_think bug


HuggingFace ▷ #general (67 messages🔥🔥):

Fine Tuning Llama on SageMaker, LibreChat privacy concerns, GraphQL schema code completion, Strcoder2 model distillation for Python, Emotion classification model accuracy


HuggingFace ▷ #today-im-learning (2 messages):

LangGraph


HuggingFace ▷ #i-made-this (6 messages):

Realistic Text-To-Speech, WebGPU Vision-LLM, AsianMOM, SmolVLM, Federated Learning AI


HuggingFace ▷ #reading-group (1 messages):

cleonorris: It is monthly, but this is actually our last one before the summer!


HuggingFace ▷ #NLP (6 messages):

DistilRoberta vs Roberta, Emotion Detection accuracy, GLoVE Paper, RobertaForTokenClassification extension, BERTopic


HuggingFace ▷ #smol-course (4 messages):

Qwen, AI Agent course


HuggingFace ▷ #agents-course (10 messages🔥):

Agent Template Errors, Course Completion, Final Unit Library, Certification Deadline


Notebook LM ▷ #use-cases (16 messages🔥):

Audiobook format, Ducky Bedtime Stories, Focus at Work, Pomodoro Timer, YouTube Music integration


Notebook LM ▷ #general (76 messages🔥🔥):

App Availability in Pakistan, Mobile App Program Limitations, Audio Generation Limits, Tabular Data with NLM, Scammy Links Warning


Nous Research AI ▷ #announcements (1 messages):

Solana Foundation, Decentralized AI, Psyche


Nous Research AI ▷ #general (85 messages🔥🔥):

Psyche Training, Meta AR, Smart Glasses, Grok crashing


GPU MODE ▷ #triton (1 messages):

TritonBench, AMD GPU Errors, Memory Access Fault


GPU MODE ▷ #cuda (3 messages):

cudaIpcMemHandle_t Serialization, Single GPU Multiprocess Communication


GPU MODE ▷ #torch (12 messages🔥):

Mapping Fused Operations, Pipeline Parallelism with torch.autograd.graph.saved_tensors_hooks, Custom CUDA Graphs and Caching Issues in vLLM V1, GEMM Codegen Performance vs. Native aten Implementation, torch.compile modes benchmark


GPU MODE ▷ #cool-links (1 messages):

real.optimus.prime: From DeepSeek: https://arxiv.org/abs/2505.09343


GPU MODE ▷ #beginner (6 messages):

CUDA SASS negation, CCCL/libcu++ vector types, CUDA compilation flags


GPU MODE ▷ #youtube-recordings (1 messages):

GitHub Repository, Lecture Scripts


GPU MODE ▷ #intel (3 messages):

Tensor Processing Unit (TPU)


GPU MODE ▷ #submissions (40 messages🔥):

MI300, amd-fp8-mm, amd-mixture-of-experts


GPU MODE ▷ #cutlass (3 messages):

Cutlass, fp8, bf16, narrow precision dtypes


GPU MODE ▷ #mojo (1 messages):

clattner: Y'all might find this techtalk interesting: https://www.youtube.com/watch?v=Invd_dxC2RU


Latent Space ▷ #ai-general-chat (50 messages🔥):

OpenMemory MCP, Grok issues, Microsoft fires TypeScript dude, Agentic tooling, FUNapis


Eleuther ▷ #general (39 messages🔥):

Knowledge graphs for papers, Cloud GPU/HW providers, MLPerf training benchmark, Data stalls in DNN training, Audio modality preprocessing


Eleuther ▷ #interpretability-general (3 messages):

BlackboxNLP, Interpretability, Causal Variable Localization, MIB Benchmark


Eleuther ▷ #lm-thunderdome (2 messages):

MCQ Evaluations, MMLU Issues, Model Outputs


Eleuther ▷ #multimodal-general (2 messages):

LLaVAGuard, SafeDiffuser, Multimodal Models


MCP (Glama) ▷ #general (33 messages🔥):

MCP Client-Server Call Flow, Chainlit Query Parameters, Jinko MCP for Hotel Sales, Smithery Server and Claude Desktop, Understanding MCP Resources


MCP (Glama) ▷ #showcase (8 messages🔥):

LLM Agent to MCP Server Connection, MCP for AI Agents Selling Hotels, MCP Democratizes Apache Kafka Usage, macos-automator-mcp for Autonomous Debugging, AI and MCP Language Barriers


Modular (Mojo 🔥) ▷ #general (5 messages):

Documentation updates, stdlib modifications


Modular (Mojo 🔥) ▷ #mojo (9 messages🔥):

Pointer declaration in Mojo structs, Mojo generics over origin, Mojo Lifetimes, Karpathy's micrograd porting to Mojo, Jeff's talks at Modular


Modular (Mojo 🔥) ▷ #max (11 messages🔥):

MAX Installation Issues, LoRA Trainer Difficulties, Mojo Weak Tensor Support, MAX and PyTorch Hybrid Approach, LLM Hallucinations with Modular's Platform


Cohere ▷ #💬-general (3 messages):

Cohere, Sponsorship, Grants, Nonprofit


Cohere ▷ #🔌-api-discussions (6 messages):

Cohere Classify API, Rate Limit Increase


Cohere ▷ #💡-projects (3 messages):

SiliconFlow, Gemma 3 4b 4bit, Llama 3.2 3B 4bit


Cohere ▷ #🤝-introductions (3 messages):

Web AI Engineer introduction, Full stack developer AI fan Introduction, Gabriel 20 years development experience


DSPy ▷ #general (7 messages):

Gemini Models Response Schema, Structured Outputs in DSPy, Pydantic models


Nomic.ai (GPT4All) ▷ #general (5 messages):

GPT4All's demise, Nomic's future direction, Jan.ai and LM Studio as alternatives


LlamaIndex ▷ #blog (2 messages):

Event Driven Agent Workflows, Multi-Agent Docs Assistant, Tig AI Coding Agent


LlamaIndex ▷ #general (2 messages):

PDF content extraction with LlamaIndex, Vibe Coding partnership opportunities


Torchtune ▷ #general (4 messages):

Custom Torchtune Network on vLLM, Unregistered Model with vLLM, Custom Model Implementation in vLLM


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (3 messages):

Lambda Workshop, Nobel FutureTech Info Session, Agentic AI, Inference API


tinygrad (George Hotz) ▷ #general (1 messages):

topk GPU, masked_select GPU, randperm_generator GPU, index_put_impl, index_tensor


MLOps @Chipro ▷ #events (1 messages):

Agentic Enrichment, LLM Data Access, Featureform


Codeium (Windsurf) ▷ #announcements (1 messages):

SWE-1, Software Engineering Models, Flow Awareness Approach, Windsurf Tab Experience, Cascade Optimization


AI21 Labs (Jamba) ▷ #general-chat (1 messages):

AI21 Labs, Maestro, AI Tinkerers Meetups