Frozen AI News archive

Gemini 2.5 Flash completes the total domination of the Pareto Frontier

**Gemini 2.5 Flash** is introduced with a new "thinking budget" feature offering more control compared to Anthropic and OpenAI models, marking a significant update in the Gemini series. **OpenAI** launched **o3** and **o4-mini** models, emphasizing advanced tool use capabilities and multimodal understanding, with **o3** dominating several leaderboards but receiving mixed benchmark reviews. The importance of tool use in AI research and development is highlighted, with **OpenAI Codex CLI** announced as a lightweight open-source coding agent. The news reflects ongoing trends in AI model releases, benchmarking, and tool integration.

Canonical issue URL

AI News for 4/16/2025-4/17/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (212 channels, and 11414 messages) for you. Estimated reading time saved (at 200wpm): 852 minutes. You can now tag @smol_ai for AINews discussions!

It's fitting that as LMArena becomes a startup, Gemini puts out what is likely to be the last major lab endorsement of chat arena elos for their announcement of Gemini 2.5 Flash:

image.png

With pricing for 2.5 Flash seemingly chosen to be exactly on the line between 2.0 Flash and 2.5 Pro, it seems that the predictiveness of the Price-Elo chart since it debuted on this newsletter last year has reached its pinnacle usefulness, after being quoted by Jeff and Demis.

Gemini 2.5 Flash introduces a new "thinking budget" that offers a bit more control over the Anthropic and OpenAI equivalents, though it is debatable whether THIS level of control is that useful (vs "low/medium/high"):

image.png

The HN Comments reflect the big "Google wakes up" trend we reported on 5 months ago:

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Model Releases and Capabilities (o3, o4-mini, Gemini 2.5 Flash, etc.)

AI Applications and Tools

Frameworks and Infrastructure

Economic and Geopolitical Analysis

Hiring and Community

Meta-Commentary and Opinions

Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Novel LLM Model Launches and Benchmarks (BLT, Local, Mind-Blown Updates)

2. Open-Source LLM Ecosystem: Local Use and Licensing (Llama 2, Gemma, JetBrains)

3. AI Industry News: DeepSeek, Wikipedia-Kaggle Dataset, Qwen 3 Hype

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI o3 and o4-mini Model Benchmarks and User Experiences

2. Recent Video Generation Model Launches and Guides (FramePack, Wan2.1, LTXVideo)

3. Innovative and Specialized Image/Character Generation Model Releases


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. Latest LLM Models: Hits, Misses, and Hallucinations

Theme 2. AI Development Tooling and Frameworks

Theme 3. Optimizing AI Hardware Performance

Theme 4. AI Model Safety, Data, and Societal Impact

Theme 5. Industry Watch: Bans, Acquisitions, and Business Shifts


PART 1: High level Discord summaries

Perplexity AI Discord


LMArena Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


OpenAI Discord


Cursor Community Discord


Yannick Kilcher Discord


Manus.im Discord Discord


Eleuther Discord


HuggingFace Discord


GPU MODE Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


Nous Research AI Discord


Notebook LM Discord


MCP (Glama) Discord


Torchtune Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Cohere Discord


Codeium (Windsurf) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Perplexity AI ▷ #announcements (1 messages):

Telegram Bot, WhatsApp Bot


Perplexity AI ▷ #general (952 messages🔥🔥🔥):

Ticketing Bot for Complaints, Attaching Images for Referral Link Help, Neovim Configuration Showcase, Claude 3 vs Gemini 2.5 Performance, Perplexity Voice Mode


Perplexity AI ▷ #sharing (3 messages):

tariffs, Trump, EU, China


Perplexity AI ▷ #pplx-api (3 messages):

Job search posts, PplxDevs Tweet, Move it to June


LMArena ▷ #general (1257 messages🔥🔥🔥):

Gemini 2.5 Pro vs Flash, O3 vs O4 Mini, thinking budget parameters, LLMs help studying?, OpenAI cost efficiency


LMArena ▷ #announcements (2 messages):

LMArena Company Formation, Beta Launch, Feedback Response


aider (Paul Gauthier) ▷ #general (832 messages🔥🔥🔥):

code2prompt, Aider's new command, Gemini 2.5 vs O3/O4, DeepSeek R2


aider (Paul Gauthier) ▷ #questions-and-tips (23 messages🔥):

Ask Mode Persistence, O4-mini Error Fix, Copy-Context Usage, Cloud Aider Instances, Architect & Edit Format Split


aider (Paul Gauthier) ▷ #links (2 messages):

New Model Analysis, O'Reilly AI Event


OpenRouter (Alex Atallah) ▷ #announcements (122 messages🔥🔥):

Terms & Privacy Policy Update, Free Model Limits, Gemini 2.5 Flash Model


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

LLM Cost Simulator, Vibe-coded LLM Chat Application


OpenRouter (Alex Atallah) ▷ #general (636 messages🔥🔥🔥):

OpenAI Codex not working with OpenRouter, BYOK, DeepSeek R3 and R4, OpenAI verification, API usage limits


OpenAI ▷ #ai-discussions (476 messages🔥🔥🔥):

Gemini 2.5 Pro vs o3/o4 models, o4 models hallucinations, GPT-4.1 Nano, Gemini 2.5 Flash


OpenAI ▷ #gpt-4-discussions (19 messages🔥):

o4-mini vs o3-mini for knowledge, GPT-4.5 speed, Custom GPT Instructions ignored, o4-mini usage limits, PDF Uploads for Study


OpenAI ▷ #prompt-engineering (9 messages🔥):

Image Generation, Contextual Memory on GPTPlus, Multi-Modular System


OpenAI ▷ #api-discussions (9 messages🔥):

Image generation prompts, r/chatgpt subreddit images, Textual prompts on GPT Plus accounts, Multi-modular system


Cursor Community ▷ #general (432 messages🔥🔥🔥):

Refund issues with Cursor subscription, DeepCoder 14B, GPT 4.1 Pricing, Cursor Terminal Hanging Fixes, Zed AI IDE


Yannick Kilcher ▷ #general (332 messages🔥🔥):

Brain connectivity, Responses API, Liquid State Machines, Meta-Simulation


Yannick Kilcher ▷ #paper-discussion (10 messages🔥):

Ultrascale Playbook Review, GPU layouts for large models, InternVL3 paper discussion


Yannick Kilcher ▷ #ml-news (8 messages🔥):

IBM Granite, Trump Administration Deepseek Ban, Brain Matter Music, Infantocracy, Blt weights


Manus.im Discord ▷ #general (347 messages🔥🔥):

Banning Discussion, Claude UI update, Game development with Manus, AI tools, Game Engines


Eleuther ▷ #general (310 messages🔥🔥):

AI-generated content, Human authentication, AI-influenced postings, Stochastic environments, LoRA-like styling for pretraining


Eleuther ▷ #research (11 messages🔥):

Quantization Effects on LLMs, Composable Interventions Paper, Muon for Output Layers, Empirical Performance of Muon


Eleuther ▷ #lm-thunderdome (1 messages):

lm-evaluation-harness PR


HuggingFace ▷ #general (99 messages🔥🔥):

sudolang, LayoutLMv3 vs Donut, Agents course deadline, Illustrious Models for anime, nVidia vs AMD


HuggingFace ▷ #today-im-learning (4 messages):

Chunking structured files into embeddings, Nomic embed text model, Python scripts and virtual environments, Hugging Face usage, Mistral-7b model


HuggingFace ▷ #cool-finds (6 messages):

Cable Management, Nuclear Energy Stagnation, Portable Microreactors, Vogtle Reactor Units, China's Energy Production


HuggingFace ▷ #i-made-this (14 messages🔥):

Tokenizer without text corpus, AI hallucinations, TRNG for AI training, Agent integrate with local ollama model, oarc-crawlers


HuggingFace ▷ #reading-group (5 messages):

Reading Group Session, YouTube Recordings


HuggingFace ▷ #computer-vision (5 messages):

Lightweight Multimodal Models, Model Memory Usage, InterVL2_5-1B-MPO, gemma-3-4b-it, InternVL3 Paper


HuggingFace ▷ #smol-course (9 messages🔥):

Agent Course Certification, Inference Credits, PromptTemplate format


HuggingFace ▷ #agents-course (40 messages🔥):

Ollama library model usage, Course assignment confusion, Agents Course 503 error, Course completion deadline


GPU MODE ▷ #general (6 messages):

OpenCL, SYCL, lu.ma/vibecode


GPU MODE ▷ #triton (6 messages):

fp16 matrix multiplication, triton autotune, TTIR optimization, kernel overhead


GPU MODE ▷ #cuda (14 messages🔥):

cuda::pipeline usage, H200 FP4 support, PyTorch float4 on 5090


GPU MODE ▷ #torch (4 messages):

AOTInductor, torch.compile, OpenXLA, libtorch C++


GPU MODE ▷ #beginner (3 messages):

CUDA learning resources, PyTorch on 5090, GPU puzzle repo


GPU MODE ▷ #torchao (2 messages):

``


GPU MODE ▷ #off-topic (9 messages🔥):

Slurm, HPC, Deployment, Admin Guides, Quickstart Admin Guide


GPU MODE ▷ #rocm (2 messages):

AMD challenge, compute resources, kernel submission, discord-cluster-manager, datamonsters


GPU MODE ▷ #self-promotion (1 messages):

x.com post by @mobicham


GPU MODE ▷ #general (10 messages🔥):

popcorn register, CLI submission errors, Discord/Github registration


GPU MODE ▷ #submissions (57 messages🔥🔥):

AMD FP8 MM Leaderboard, MI300 Performance, Matmul Benchmarking, AMD Identity Leaderboard


GPU MODE ▷ #status (3 messages):

CLI Tool, HIP code submission


GPU MODE ▷ #feature-requests-and-bugs (3 messages):

New CLI Release, Submission Fixes


GPU MODE ▷ #amd-competition (36 messages🔥):

MI300 usage statistics, Debugging Kernels, FP8 numerical precision finetuning, Team Registration, Torch Header improvements


GPU MODE ▷ #cutlass (2 messages):

Mx Cast Kernel, Cutlass Performance Bottleneck, CuTensorMap API, TMA usage with Cutlass


LM Studio ▷ #general (71 messages🔥🔥):

NVMe vs SATA SSD, Image models in LM Studio, RAG capable models in LM Studio, Granite Model Use-Cases, 5090 GPU and LM Studio


LM Studio ▷ #hardware-discussion (71 messages🔥🔥):

FP4 support in PyTorch and vLLM, AVX requirement for LM Studio, 5060Ti 16GB, GPU upgrade from RTX 3060 12GB


Unsloth AI (Daniel Han) ▷ #general (52 messages🔥):

Llama 4 timeline, Multi-GPU support, Custom tokens finetuning, Qwen 2.5 finetuning, Chat template importance


Unsloth AI (Daniel Han) ▷ #off-topic (19 messages🔥):

MetaAI's offensive output, Iggy's fake output streaming, Phishing website's emailjs key, MediBeng-Whisper-Tiny model


Unsloth AI (Daniel Han) ▷ #help (21 messages🔥):

OOM Issue, Tool Calls, Llama 4, LoRA hot swap, Multi-GPU delayed


Unsloth AI (Daniel Han) ▷ #showcase (8 messages🔥):

PolyThink, AI Hallucinations, Multi-Model AI System


Unsloth AI (Daniel Han) ▷ #research (14 messages🔥):

Untrained Deep Neural Networks, Mistral AI integration differences, Memory Latency Aware (MLA), GRPO trainer for reasoning, Qwen 2.5 3B model


Nous Research AI ▷ #general (58 messages🔥🔥):

GPT4o as Completion Model, Huawei Leading Globally, BitNet b1.58 2B 4T, Discord's Future


Nous Research AI ▷ #ask-about-llms (18 messages🔥):

o4mini, Gemini 2.5 pro, MCP servers, agent functionality of copilot, Tools in a coding environment


Nous Research AI ▷ #research-papers (1 messages):

BitNet b1.58 2B4T, 1-bit LLM, Hugging Face


Nous Research AI ▷ #research-papers (1 messages):

BitNet b1.58 2B4T, Native 1-bit LLM, Hugging Face model release, Computational Efficiency, Memory Footprint Reduction


Notebook LM ▷ #use-cases (8 messages🔥):

Gemini Pro, Deep Research, Accounting Month End Process, Vacation Itinerary, Google Maps


Notebook LM ▷ #general (45 messages🔥):

Webby Awards Voting, NotebookLM Mindmap details, NotebookLM Enterprise SSO Setup, NotebookLM Plus Admin Controls, NotebookLM uses RAG


MCP (Glama) ▷ #general (51 messages🔥):

Obsidian MCP Server, Cloudflare Workers SSE API Key, LLM Tool Understanding, MCP Personalities, MCP server time


MCP (Glama) ▷ #showcase (1 messages):

HeyGen API, MCP Server Release, Video Creation Platform


Torchtune ▷ #general (23 messages🔥):

GRPO Recipe Todos, PPO tasks, Single GPU GRPO Recipe, Reward Modeling RFC


Torchtune ▷ #papers (1 messages):

Titans Talk


LlamaIndex ▷ #blog (1 messages):

A2A Agents, Agent Communication, LlamaIndex support for A2A


LlamaIndex ▷ #general (22 messages🔥):

CondenseQuestionChatEngine Tool Support, Anthropic Bedrock Prompt Caching, Anthropic support with LlamaIndex


Nomic.ai (GPT4All) ▷ #general (11 messages🔥):

GPT4All Future, IBM Granite 3.3 for RAG, LinkedIn inquiry status


Modular (Mojo 🔥) ▷ #general (1 messages):

Modular Meetup, Mojo & MAX, GPU performance


Modular (Mojo 🔥) ▷ #mojo (6 messages):

MLIR in Mojo, Mojo Dict Pointer Behavior, FP languages optimizations


Modular (Mojo 🔥) ▷ #max (1 messages):

Orphan cleanup mechanism, Partitioned Disk, max repo


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

Lean auto-formalizer, Formal verification of programs, AI proof generation, Informal proofs, Computer code


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

CIRIS Covenant 1.0-beta Release, Open-Source AI Alignment Framework, Adaptive-Coherence AI Alignment


MLOps @Chipro ▷ #general-ml (2 messages):

Ensembling forecasting models, Final Year Project Ideas


Cohere ▷ #「💬」general (1 messages):

AI Model Development, Staged Training Process


Codeium (Windsurf) ▷ #announcements (1 messages):

New Discussion Channel, Windsurf Jetbrains Changelog





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}