Frozen AI News archive

Grok 3 & 3-mini now API Available

**Grok 3** API is now available, including a smaller version called Grok 3 mini, which offers competitive pricing and full reasoning traces. **OpenAI** released a practical guide for building AI agents, while **LlamaIndex** supports the Agent2Agent protocol for multi-agent communication. **Codex CLI** is gaining traction with new features and competition from **Aider** and **Claude Code**. **GoogleDeepMind** launched **Gemini 2.5 Flash**, a hybrid reasoning model topping the Chatbot Arena leaderboard. **OpenAI**'s o3 and o4-mini models show emergent behaviors from large-scale reinforcement learning. **EpochAIResearch** updated its methodology, removing **Maverick** from high FLOP models as **Llama 4 Maverick** training compute drops. **GoodfireAI** announced a $50M Series A for its Ember neural programming platform. **Mechanize** was founded to build virtual work environments and automation benchmarks. **GoogleDeepMind**'s Quantisation Aware Training for Gemma 3 models reduces model size significantly, with open source checkpoints available.

Canonical issue URL

AI News for 4/17/2025-4/18/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (211 channels, and 8290 messages) for you. Estimated reading time saved (at 200wpm): 650 minutes. You can now tag @smol_ai for AINews discussions!

Grok 3 (our coverage here) has been out for a couple of months, but wasn't API available. Now it is, with a bonus baby brother!

image.png

At 50 cents per output mtok, Grok 3 mini claims to be competitive with much larger frontier models, while displaying full reasoning traces:

image.png

You can get started here: https://docs.x.ai/docs/overview


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Agent Tooling, Frameworks, and Design

Model Updates, Releases, and Performance

Companies and Funding

Efficiency and Infrastructure

New AI Techniques

Broader Implications

AI and the China/U.S. Tech War

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. Google Gemma 3 QAT Quantization and Ecosystem Launches

2. Novel LLM Benchmarks: VideoGameBench & Real-time CSM 1B

3. Local-first AI Tools, Visualization, and Community Projects

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI o3 and GPT-4o User Experiences and Capabilities

2. New LLM and AI Model Benchmarks and Releases

3. AI Industry Infrastructure and Pricing Updates


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. New Models Enter the Ring: Gemini, Grok, and Dayhush Stir Competition

Theme 2. Framework Deep Dives: Mojo Pointers and tinygrad Bugs

Theme 3. Practical AI Tooling: From Financial Parsing to Agent Chats

Theme 4. AI Framework Integration & Features: LlamaIndex, Perplexity, Modular

Theme 5. Decoding AI "Thinking": Architecture, Reasoning, and Primer Power


PART 1: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


DSPy Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Perplexity AI ▷ #announcements (2 messages):

Telegram Bot, GPT 4.1 Release, Perplexity Travel, Trending Discover Feed, NBA Playoffs


Perplexity AI ▷ #general (1186 messages🔥🔥🔥):

LAN parties, Retirement at 30, Perplexity AI vs GPT-4, Mistral OCR, Gemini Live


Perplexity AI ▷ #sharing (4 messages):

Google Lawsuit, Gemma 3 QAT models, K2-18b exoplanet


Perplexity AI ▷ #pplx-api (3 messages):

Image Uploads, Date Range Filter, New Pricing Scheme, First Hackathon with Devpost


LMArena ▷ #general (1070 messages🔥🔥🔥):

Dayhush vs Nightwhisper, Gemini 2.5 Pro vs Grok 3 Mini, Polymarket & LM Arena, AlphaZero, AlphaGo, New models and model releases


LMArena ▷ #announcements (1 messages):

Beta feedback, Dark/Light mode toggle, Copy/paste images, Leaderboard polish


Modular (Mojo 🔥) ▷ #general (1 messages):

Modular Meetup, Mojo & MAX, GPU Optimization, In-Person Event, Virtual Attendance


Modular (Mojo 🔥) ▷ #mojo (17 messages🔥):

MLIR arith dialect in Mojo, Dict value pointer copies and moves, Variadic set element moves, Ratio type representation, Parallelization performance


LlamaIndex ▷ #blog (4 messages):

Agent2Agent Protocol, Gemini 2.5 flash, LlamaExtract, Google Cloud Next 2025


LlamaIndex ▷ #general (10 messages🔥):

LlamaIndex A2A Deployment, LlamaIndex Version Change


Cohere ▷ #「💬」general (7 messages):

AI Model Staged Training, Command A Reasoning, FP8 limitations, SEO Backlinks


Cohere ▷ #「💡」projects (1 messages):

Python package release, NLP pipeline tooling, Earnings call report parsing, Data variation challenges, Regex learning


Nomic.ai (GPT4All) ▷ #general (4 messages):

GPT4All status on LinkedIn, Speech recognition mode for GPT4All, MCP server configuration


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

Pattern Matcher Bug, Rockchip 3588 Crash with Beam Search


DSPy ▷ #show-and-tell (1 messages):

Reasoning Models, Primer on Reasoning Models






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}