Frozen AI News archive

lots of small launches

**GPT-4o Advanced Voice Preview** is now available for free ChatGPT users with enhanced daily limits for Plus and Pro users. **Claude 3.7 Sonnet** has achieved the top rank in WebDev Arena with improved token efficiency. **DeepSeek-R1** with 671B parameters benefits from the **Together Inference** platform optimizing NVIDIA Blackwell GPU usage, alongside the open-source **DeepGEMM** CUDA library delivering up to 2.7x speedups on Hopper GPUs. **Perplexity** launched a new Voice Mode and a **Deep Research API**. The upcoming **Grok 3 API** will support a 1M token context window. Several companies including **Elicit**, **Amazon**, **Anthropic**, **Cloudflare**, **FLORA**, **Elevenlabs**, and **Inception Labs** announced new funding rounds, product launches, and model releases.

Canonical issue URL

AI News for 2/25/2025-2/26/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (221 channels, and 7040 messages) for you. Estimated reading time saved (at 200wpm): 725 minutes. You can now tag @smol_ai for AINews discussions!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

AI Model Updates & Releases, focusing on new models, features, and versions

AI Tools, Libraries, and Datasets, covering frameworks, code, and resources

Research, Analysis, and Benchmarks, covering evaluations, performance, and insights

Industry and Company Announcements, covering partnerships, funding, and events

Opinions and Discussions, covering broader AI perspectives and commentary


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepGEMM Offers Efficient FP8 General Matrix Multiplications

Theme 2. Nvidia Gaming GPUs with Increased VRAM Enter Chinese Cloud Market

Theme 3. DeepSeek API Platform Introduces Off-Peak Discounts

Theme 4. TinyR1-32B Outperforms Official R1 Distills

Theme 5. Perplexity's Plan to Fork Chrome for AI Browsing

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Claude 3.7 Disruption in AI Development and Personal Assistance


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. AI IDE Showdown: Cursor Flexes, Windsurf Waffles

Theme 2. Claude 3.7: Leaks, Lies, and Load Balancing

Theme 3. DeepSeek's Deep Dive: Price Cuts and Performance Peaks

Theme 4. Open Source LLM Dev: High School Hustle & Hardware Hangups

Theme 5. Perplexity's Push & OpenAI's API Expansions


PART 1: High level Discord summaries

Cursor IDE Discord


Codeium (Windsurf) Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


Perplexity AI Discord


Latent Space Discord


Cohere Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


Nomic.ai (GPT4All) Discord


MCP (Glama) Discord


Torchtune Discord


Stability.ai (Stable Diffusion) Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (812 messages🔥🔥🔥):

Augment Code AI, Zed Editor vs Cursor, MCP servers, Cursor's chat summary, Claude-code

Links mentioned:


Codeium (Windsurf) ▷ #discussion (30 messages🔥):

Jetbrains AI Assistant, Claude 3.7 Sonnet, Codeium Extension, Augument Code, Codeium Emacs support

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (756 messages🔥🔥🔥):

Claude 3.7 Credit Consumption, Windsurf vs. Cursor, Windsurf Stability and Errors, Codeium Support, Windsurf Editor UX Issues

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (690 messages🔥🔥🔥):

Qwen Max release, olmOCR Model, Frameworks new desktop, bitsandbytes library, DeepSeek's GRPO

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (45 messages🔥):

Claude 3.7, RLOO and PPO Implementation with Unsloth, GRPO vs RLOO, TRL library editing


Unsloth AI (Daniel Han) ▷ #help (66 messages🔥🔥):

Tokenizer Issues with DeepSeek Models, GGUF conversion Problems with Llama 3.2, Infinte generation and chaotic output with Qwen2.5, LlamaForCausalLM Error, Unsloth Enterprise Pricing

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (11 messages🔥):

Paddler Load Balancer, SlamKit Speech Language Model Training, 4090 GPU server Self-Hosting

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

structured output method, token mask, constrained generation, open-source community support

Link mentioned: Reddit - Dive into anything: no description found


OpenAI ▷ #annnouncements (2 messages):

ChatGPT Plus Deep Research, GPT-4o Mini Preview


OpenAI ▷ #ai-discussions (664 messages🔥🔥🔥):

GPT-4.5 rumors, Alexa+ launch, DeepSeek R1, Claude Pro limits, OpenAI vs. competitors

Links mentioned:


OpenAI ▷ #gpt-4-discussions (10 messages🔥):

Darker side of AI, GPT Moderation Rules, GPT Replication of Deep Research, Conscious AI


OpenAI ▷ #prompt-engineering (25 messages🔥):

o3-mini-high for coding, Programming Disassembler with ChatGPT, Prompt Engineering for Learning


OpenAI ▷ #api-discussions (25 messages🔥):

o3-mini-high coding issues, Prompt Engineering for Beginners, ChatGPT as a Disassembler, LLMs for Algebra and Calculus, Creative Outputs from LLMs


aider (Paul Gauthier) ▷ #general (673 messages🔥🔥🔥):

Deepseek R2, Claude Code Leak, MCP Servers, Windsurf Editor, Rust vs Python

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (81 messages🔥🔥):

Aider with Claude Sonnet for free, Gemini 1.5 Pro vs GPT-3.5 for code editing, Groq’s Llama 3 70B for free, Avante uses Groq's Llama-3.3-70b-versatile for applying diffs, Sonnet 3.7 ridiculously overkeen

Links mentioned:


aider (Paul Gauthier) ▷ #links (2 messages):

R1 verification, Microsoft Trace framework, ax-llm/ax GitHub repository

Link mentioned: GitHub - ax-llm/ax: The "official" unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.: The "official" unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper. - ax-llm/ax


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Sonnet 3.7 Switchover, Cross-Model Reasoning Standard, Reasoning Parameter

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (241 messages🔥🔥):

Reasoning Tokens, Prompt Caching, DeepSeek API pricing, Claude 3.7, OpenRouter API Keys

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity Voice Mode, iOS App Update


Perplexity AI ▷ #general (166 messages🔥🔥):

Context window size, Comet Browser launch, Voice mode functionality, Coding with perplexity, Claude 3.7 Sonnet hallucinations

Links mentioned:


Perplexity AI ▷ #sharing (8 messages🔥):

Ruby Script Generation via Perplexity, Anthropic's Pokemon AI Benchmarks, Trump Fires Military Leaders news item, Meta's 200 Billion AI Compute Investment

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (6 messages):

Perplexity Deep Research API, Developer Meetup SF, Sonar Deep Research in Playground, Uploading files through API

Link mentioned: Tweet from Aravind Srinivas (@AravSrinivas): As a cricket nerd, throwing Perplexity Deep Research API on all of cricket data and stats and getting into rabbit holes would be fun. Who has a good stats repository here? Anyone who wants to build th...


Latent Space ▷ #ai-general-chat (76 messages🔥🔥):

Assistants API file search, Claude Plays Pokémon, Claude Sonnet Web vs API, OpenAI Deep Research, Raycast AI Extensions

Links mentioned:


Latent Space ▷ #ai-announcements (2 messages):

LLM Paper Club, Raycast AI

Links mentioned:


Cohere ▷ #discussions (56 messages🔥🔥):

Local LLM training code, Cohere models in OpenAI SDK, Open Source vs Paid Code, OpenAI SDK Integration

Links mentioned:


Cohere ▷ #announcements (1 messages):

Compatibility API, OpenAI SDK, Cohere Models

Link mentioned: Using Cohere models via the OpenAI SDK — Cohere: The document serves as a guide for Cohere's Compatibility API, which allows developers to seamlessly use Cohere's models using OpenAI's SDK.


Cohere ▷ #api-discussions (7 messages):

Cohere API blocking VPS, Token counting changes, Cohere API availability


Eleuther ▷ #general (3 messages):

HuggingFace deprecation, RAG tool


Eleuther ▷ #research (18 messages🔥):

KV Cache Compression, Activation Steering, Deepseek DeepGEMM Kernel, Data Mixing Optimization

Link mentioned: MixMin: Finding Data Mixtures via Convex Minimization: Modern machine learning pipelines are increasingly combining and mixing data from diverse and disparate sources, e.g., pre-training large language models. Yet, finding the optimal data mixture is a ch...


Eleuther ▷ #scaling-laws (2 messages):

Bigger Models, Ensembling, Flops


Eleuther ▷ #interpretability-general (5 messages):

SAEs, Weight tying in SAEs, Orthogonal features in SAEs


Eleuther ▷ #lm-thunderdome (27 messages🔥):

lm-evaluation-harness setup in a notebook, Local LLM API endpoints running via TRT, GPQA implementation

Links mentioned:


Eleuther ▷ #gpt-neox-dev (3 messages):

GQA in NeoX, Llama models export issues

Link mentioned: fix a GQA issue (#1314) by tiandeyu-cs · Pull Request #1315 · EleutherAI/gpt-neox: fix a GQA issue (#1314)do not create a fake head dim and split the 'mixed_x_layer' into QKV layers directly.


Modular (Mojo 🔥) ▷ #general (5 messages):

Modular MAX and Mojo repo changes, Mojo's standalone language status

Link mentioned: Upcoming changes to our GitHub repositories: Tomorrow (February 27), we’re streamlining our GitHub repositories! The max repo is merging into the mojo repo, bringing everything under one roof. A new subdirectory will house the Mojo standard libr...


Modular (Mojo 🔥) ▷ #mojo (44 messages🔥):

EmberJSON, Mojo auto-parallelization, algorithm package isn't open source, speedup using list.get_unsafe, smart iterators in Mojo

Links mentioned:


Yannick Kilcher ▷ #general (35 messages🔥):

Alignment tradeoff, DTMF, Google Experiments, Apple speech-to-text Trump issue, Claude 3.7

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (5 messages):

LIMO, Speculative Decoding, ipfs_accelerate_py

Links mentioned:


Yannick Kilcher ▷ #ml-news (2 messages):

ChatGPT plugins, Mystery model


Nomic.ai (GPT4All) ▷ #general (40 messages🔥):

CSV indexing, ModernBert models, Nomic Embed Text V2 Deployment, GPT4ALL roadmap, File splitting

Links mentioned:


MCP (Glama) ▷ #general (23 messages🔥):

Claude Code line numbers, Model Context Protocol (MCP), MCP Server Implementation, SSE server

Links mentioned:


MCP (Glama) ▷ #showcase (3 messages):

FastMCP, Typescript, Custom Authentication

Links mentioned:


Torchtune ▷ #dev (19 messages🔥):

StatefulDataLoader, single device recipes, truncation and skipping

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (15 messages🔥):

VRAM Efficiency, AI HackXelerator, Scammer Alert, Regional prompting

Links mentioned:


tinygrad (George Hotz) ▷ #general (5 messages):

good PRs for new people, TestSpeed.test_sum Performance issues, arange GROUP optimization, BEAM search adjustments

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (7 messages):

UOp Signatures, safetensors computation graphs, TestLinearizerFailures


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Agent Memory, Feedback Mechanism for Agents


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

hritabanghosh: https://discord.gg/ETxqXCfh




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}