Frozen AI News archive

Anthropic's $61.5B Series E

**Anthropic** raised a **$3.5 billion Series E funding round** at a **$61.5 billion valuation**, signaling strong financial backing for the **Claude** AI model. **GPT-4.5** achieved **#1 rank across all categories** on the LMArena leaderboard, excelling in multi-turn conversations, coding, math, creative writing, and style control. **DeepSeek R1** tied with GPT-4.5 for top performance on hard prompts with style control. Discussions highlighted comparisons between **GPT-4.5** and **Claude 3.7 Sonnet** in coding and workflow applications. The importance of the **LMSYS benchmark** was emphasized, though some questioned the relevance of benchmarks versus user acquisition. Additionally, **Perplexity AI** partnered with **Deutsche Telekom** to integrate the **Perplexity Assistant** into a new AI phone.

Canonical issue URL

AI News for 3/3/2025-3/4/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (221 channels, and 4084 messages) for you. Estimated reading time saved (at 200wpm): 481 minutes. You can now tag @smol_ai for AINews discussions!

Their brief blogpost here. It's not technical news, but it's still only every other week that a frontier lab raises money, and more money for Claude is only good news for AI Engineers.

Meanwhile, GPT 4.5 rated #1 across the board on LMArena. For posterity, here is where the current rankings lie under style control. Claude has a ways to go yet to reclaim frontier status.

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Model Performance & Benchmarks, Comparisons and Evaluations

Industry News, Funding, and Partnerships

Tools, Frameworks, and Coding Workflows

Research and Papers

AI in Business & Applications

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Atom of Thoughts Enhancing Smaller Models

Theme 2. Klee Open-Sourced for Local LLM Use with Zero Data Collection

Theme 3. Split Brain 'DeepSeek-R1-Distill-Qwen' and 'Llama' Fusion Architecture

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

TO BE COMPLETED


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. IDE Wars: Cursor Stumbles, Windsurf Surfs On, and Plugin Pains Persist

Theme 2. Claude 3.7: Speed Bumps and Credit Crunch, But Still Impresses

Theme 3. AI Models: New Releases, Performance Quirks, and Ethical Quandaries

Theme 4. Hardware Hustles: Tilelang Triumphs, AMD's Ascent, and SRAM Secrets

Theme 5. Agent Innovations and Frustrations: Travel Planning AI, Smol Agent Quiz Fails, and MCP Multi-Agent Visions


PART 1: High level Discord summaries

Cursor IDE Discord


Codeium (Windsurf) Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


HuggingFace Discord


aider (Paul Gauthier) Discord


GPU MODE Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Nous Research AI Discord


Interconnects (Nathan Lambert) Discord


Yannick Kilcher Discord


Notebook LM Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


MCP (Glama) Discord


DSPy Discord


LlamaIndex Discord


Latent Space Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (745 messages🔥🔥🔥):

Cursor IDE, MCP, Landing Page Design, Model Performance, Repo Prompt

Links mentioned:


Codeium (Windsurf) ▷ #announcements (1 messages):

Windows ARM support, Windsurf Next, Ubuntu 24.04, Claude 3.7 Sonnet, MCP Tools

Links mentioned:


Codeium (Windsurf) ▷ #discussion (37 messages🔥):

Codeium Pro issues and snoozing, Supercomplete availability in Codeium, Visual Studio Codeium extension versions, JetBrains extension issues

Link mentioned: GitHub - Exafunction/CodeiumVisualStudio: Visual Studio extension for Codeium: Visual Studio extension for Codeium. Contribute to Exafunction/CodeiumVisualStudio development by creating an account on GitHub.


Codeium (Windsurf) ▷ #windsurf (432 messages🔥🔥🔥):

Codeium customer support, Premium flow action credits, Windsurf's new update, Claude 3.7, Multiple selection using CTRL+D

Links mentioned:


OpenAI ▷ #annnouncements (2 messages):

Sora onboarding session, Sora prompt crafting


OpenAI ▷ #ai-discussions (423 messages🔥🔥🔥):

Mirror sites with pro accounts, GPT-4.5 Image Recognition, GPT prioritization of Pro vs Plus, Switching from ChatGPT to Grok, Gemini Free vs Pro Features

Links mentioned:


OpenAI ▷ #gpt-4-discussions (30 messages🔥):

GPT Model Selection, Projects vs GPTs, Claude 3.7 vs ChatGPT, Context window size comparison, Clearing Chatlogs and Uploaded Data


OpenAI ▷ #prompt-engineering (2 messages):

Dall-E image generation, Synthetic plants, Image prompting strategies


OpenAI ▷ #api-discussions (2 messages):

Dall-E image generation, Synthetic plants growing organs for transplant


Unsloth AI (Daniel Han) ▷ #general (252 messages🔥🔥):

Llama zipping WAVs confusion, GRPO training steps, 4-bit model saving issues, Unsloth team size, Continued pretraining of Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (10 messages🔥):

Github Issues, inline_asm, GRPO, VLLM, Online Training

Link mentioned: stickbreaking-attention/stickbreaking_attention/sb_varlen/softplus.py at main · shawntan/stickbreaking-attention: Stick-breaking attention. Contribute to shawntan/stickbreaking-attention development by creating an account on GitHub.


Unsloth AI (Daniel Han) ▷ #help (54 messages🔥):

GRPO Training, Qwen2.5-14B-instruct fine-tuning, DeepSeek-R1-Distill-Llama-8B error, Mistral embedding model, GCC compiler issue

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (94 messages🔥🔥):

GRPO Reward Functions, Distilling Models for Agent Skeletons, SWE-bench Performance, String Replacement for Code Editing, Model Tool-Calling and Composition

Links mentioned:


Perplexity AI ▷ #general (378 messages🔥🔥):

Perplexity AI Bugs, Claude 3 Opus vs Sonnet, Deepseek Propaganda?, Perplexity AI Business Fellowship, GPT-4.5 Quality Concerns

Links mentioned:


Perplexity AI ▷ #sharing (7 messages):

Shareable threads, Perplexity AI integrations, Ingredient breakdowns


Perplexity AI ▷ #pplx-api (3 messages):

Open Source Claude-Code, Perplexity API Limitations, Obsidian Web Clipper Issue

Link mentioned: Tweet from Aravind Srinivas (@AravSrinivas): If anyone wants to build an open source Claude-Code with some editor integrations and extensions, Perplexity would be happy to provide free API credits. Please DM @GregFeingold and @AarashHeydari


HuggingFace ▷ #general (118 messages🔥🔥):

RL fundamentals, DeepMind RL Course, Automated video generation, OpenAI image generation alternatives, Fine-tuning Phi-3 for Multi-modality

Links mentioned:


HuggingFace ▷ #today-im-learning (1 messages):

VLMs, Cracking VLMs

Link mentioned: VLM Notes: VLM Notes TODO : Understand how preprocessor handles images : More work into explaining vision_encoder Resources : https://github.com/merveenoyan/smol-vision Smolvlm and idefics Moondream especia...


HuggingFace ▷ #cool-finds (7 messages):

Dataset Viewer Errors, FastRTC


HuggingFace ▷ #i-made-this (7 messages):

AI Story Studio, MoD ControlNet Tile Upscaler, VAE comparison, Remote VAE from HF, Cross-device browser-based scratchpad

Links mentioned:


HuggingFace ▷ #core-announcements (1 messages):

Remote VAE Decode endpoints, Hybrid Inference, SD v1, SD XL and Flux

Link mentioned: Hybrid Inference: no description found


HuggingFace ▷ #computer-vision (3 messages):

Audio to Video matching, ViT resources, ViT and Global Average Pooling


HuggingFace ▷ #NLP (2 messages):

Web scraping with Python, Running Phi-4 as real-time API


HuggingFace ▷ #gradio-announcements (1 messages):

Gradio, Groovy, Python to Javascript

Link mentioned: Client Side Functions: A Step-by-Step Gradio Tutorial


HuggingFace ▷ #smol-course (5 messages):

Smol Agents Quiz, NLP Reasoning Course, ClaudePlaysPokemon replication with smolagents

Links mentioned:


HuggingFace ▷ #agents-course (87 messages🔥🔥):

Introductions, Lambda Go Labs, CodeAgent LLM Size, Quiz Grader Issues, Inference Credits Exhaustion

Links mentioned:


HuggingFace ▷ #open-r1 (2 messages):

Replicant model training, R1 reasoning dataset for coding tasks


aider (Paul Gauthier) ▷ #general (121 messages🔥🔥):

Aider Leaderboard, Claude Code, Grok vs O3 Mini, anon-kode, python + uv

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (85 messages🔥🔥):

Gemini 2.0 Pro Model issues, Aider + RAG/Vector embeddings, Aider editing with git diff, Aider with OpenRouter models and edit modes, Aider Architect mode

Link mentioned: GitHub - lutzleonhardt/copilot-proxy: Copilot Proxy is a Visual Studio Code extension that exposes the VS Code Language Model API via an Express server. This experimental extension is intended solely for research and prototyping purposes and should not be used in production environments.: Copilot Proxy is a Visual Studio Code extension that exposes the VS Code Language Model API via an Express server. This experimental extension is intended solely for research and prototyping purpos...


GPU MODE ▷ #general (1 messages):

Vision Models, Attention based ViTs, MLP-Mixer

Link mentioned: MLP-Mixer: An all-MLP Architecture for Vision: Convolutional Neural Networks (CNNs) are the go-to model for computer vision. Recently, attention-based networks, such as the Vision Transformer, have also become popular. In this paper we show that w...


GPU MODE ▷ #triton (54 messages🔥):

SRAM vs Cache Confusion, Triton Scalar Constants Data Type, CUDA backend hyper-parameters, Triton Autotuning Resources, Triton BLAS Implementations

Link mentioned: Block Scaled Matrix Multiplication — Triton documentation: no description found


GPU MODE ▷ #cuda (25 messages🔥):

FP8 GEMM in CUTLASS, Determine Architecture for NVCC, Flash Attention Indexing

Links mentioned:


GPU MODE ▷ #torch (17 messages🔥):

FSDP2 OffloadPolicy, register_post_accumuate_grad_hook, load_inline CUDA kernels, reduce and not scatter, optimizer scaling


GPU MODE ▷ #algorithms (5 messages):

fa3, absmax quantization, hada transform


GPU MODE ▷ #jobs (1 messages):

Internship Opportunity, Low-Level Programming, LLM Inference, Mobile and PC Platforms

Link mentioned: GitHub - githubpradeep/llm_np_cp: running llama gemma on cupy and numpy: running llama gemma on cupy and numpy. Contribute to githubpradeep/llm_np_cp development by creating an account on GitHub.


GPU MODE ▷ #beginner (5 messages):

Triton tensor creation, ROCm support for RX 7800 XT, NVIDIA GPU alternatives


GPU MODE ▷ #self-promotion (8 messages🔥):

Tilelang Kernel, Deepseek flashmla, MLA leaderboard, Bitnet group

Link mentioned: tilelang/examples/deepseek_mla at main · tile-ai/tilelang: Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels - tile-ai/tilelang


GPU MODE ▷ #reasoning-gym (11 messages🔥):

Chain of Draft PR, Throttling errors

Link mentioned: Chain of Draft: Thinking Faster by Writing Less: Large Language Models (LLMs) have demonstrated remarkable performance in solving complex reasoning tasks through mechanisms like Chain-of-Thought (CoT) prompting, which emphasizes verbose, step-by-ste...


GPU MODE ▷ #gpu模式 (4 messages):

Tilelang, MLA, FlashMLA, Python


GPU MODE ▷ #general (1 messages):

prefixsum submission, H100 submission


GPU MODE ▷ #submissions (17 messages🔥):

Leaderboard Submissions, Leaderboard Name Mismatches, Successful Submissions, GPU Usage


GPU MODE ▷ #ppc (3 messages):

AVX512, FMA instruction, Performance Improvement


GPU MODE ▷ #feature-requests-and-bugs (10 messages🔥):

L4 & T4 Timeout, AMD MI300s, Beta Launch


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Travel Reels, AI agents, Trip Planning

Link mentioned: ThatSpot Guide: no description found


OpenRouter (Alex Atallah) ▷ #general (126 messages🔥🔥):

Google Flash 2.0 Error, Claude 3.7 Sonnet Rate Limits, OpenRouter API Key with VS Studio/RooCode, BYOK azure models in openrouter, Accessing Links in Chat Models

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio SDK, Python, TypeScript, Agent API, MIT License

Links mentioned:


LM Studio ▷ #general (100 messages🔥🔥):

Context Length Error, Model Architecture unsupported by Llama.cpp, LM Studio CLI Commands, LM Studio SDKs, LM Studio Downgrading

Links mentioned:


LM Studio ▷ #hardware-discussion (21 messages🔥):

AMD and Intel vs CUDA, Vulkan vs CUDA, AMD GPU market share, Nvidia 5090 specs


Nous Research AI ▷ #general (93 messages🔥🔥):

Low-rank space reasoning, Nous API, CUDA Kernels, Hermes 3 erotic fiction, Ollama usability

Links mentioned:


Nous Research AI ▷ #research-papers (8 messages🔥):

Logic-RL, Rule-Based Reinforcement Learning, General World Models, Worldsim

Links mentioned:


Nous Research AI ▷ #interesting-links (1 messages):

``

Link mentioned: San: no description found


Nous Research AI ▷ #research-papers (8 messages🔥):

Rule-Based Reinforcement Learning (RL), DeepSeek-R1, Logic-RL, Worldsim, General World Models (GWM) by RunwayML

Links mentioned:


Nous Research AI ▷ #reasoning-tasks (1 messages):

Qwen2.5-Math-1.5B, longcot examples, dataset structuring, setting up the GRPOTrainer


Interconnects (Nathan Lambert) ▷ #news (28 messages🔥):

Unitree Open Source, Gemma 3 Release, GPT-4.5 tops Arena leaderboard, Post-Training Interpretation, Anthropic $3.5B Funding

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (8 messages🔥):

BobbyBroccoli videos, Deep Learning History, Shun-Ichi Amari

Link mentioned: Tweet from loss (derogatory) (@untitled01ipynb): gm this app is free


Interconnects (Nathan Lambert) ▷ #random (34 messages🔥):

Grok3 Pricing, LLM Summarization Ethics, Anon-Kode GitHub, Taiwan Security

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

Anthropic Funding, AI development, International Expansion

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rl (1 messages):

Med-RLVR


Interconnects (Nathan Lambert) ▷ #reads (11 messages🔥):

Post-Training Methodologies for LLMs, In-House Data Labeling for SOTA Models, Human Data vs Synthetic Data, Disentangling Post-training Performance from Data

Links mentioned:


Interconnects (Nathan Lambert) ▷ #policy (3 messages):

TSMC $100B investment in U.S. chip factories

Link mentioned: Tweet from Anissa Gardizy (@anissagardizy8): new: The CEO of TSMC is heading to the White House today to talk about a $100B investment in U.S. chip factories https://www.theinformation.com/briefings/trump-tsmc-to-announce-100-billion-chip-factor...


Yannick Kilcher ▷ #general (44 messages🔥):

Bachelor's degree project ideas in VLMs, Automating IRL jobs, Finding interesting problems to solve, Literature review article in AI, invite link to discord server

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (7 messages):

Joscha Bach, Presentation Time Slot


Yannick Kilcher ▷ #ml-news (3 messages):

Elsagate 3.0

Link mentioned: Elsagate 3.0 Is Worse Than we Thought.: THIS VIDEO IS NOT FOR CHILDREN. VIEWER DISCRETION IS ADVISEDGet a FREE sample pack from Five, just pay shipping (must be 21+): https://bit.ly/FreeFiveRaymund...


Notebook LM ▷ #use-cases (16 messages🔥):

Financial statement analysis in NotebookLM, Podcast length, Notebook Combination, Blog Outline, Podcast customization


Notebook LM ▷ #general (33 messages🔥):

Dynamically updated sources, Google Docs integration, Podcast timelines, Copying and pasting index numbers, Bulk deleting sources

Link mentioned: no title found: no description found


Stability.ai (Stable Diffusion) ▷ #general-chat (40 messages🔥):

IP Adapter, Reactor Faceswap, ControlNet, Reforge AMDGPU support, Zluda


Eleuther ▷ #general (13 messages🔥):

Finding good problems to solve, EleutherAI affiliation projects, RWKV models, 4D gaussian splatting


Eleuther ▷ #research (15 messages🔥):

Reasoning Model, GRPO based Agent, LLAMA 3.2 3B, Recurrent LLM reasoning, Atom of Thoughts (AoT)

Link mentioned: Atom of Thoughts for Markov LLM Test-Time Scaling: Large Language Models (LLMs) achieve superior performance through training-time scaling, and test-time scaling further enhances their capabilities by conducting effective reasoning during inference. H...


Eleuther ▷ #lm-thunderdome (10 messages🔥):

trust_remote_code in lm-evaluation-harness, dataset_kwargs override, dataset loading errors, data_dir specification

Links mentioned:


MCP (Glama) ▷ #general (36 messages🔥):

Terraform Registry MCP issues, MCP Multi-Agent Systems, fast-agent GitHub repo, Claude desktop FastMCP errors, MCP server claiming problems

Links mentioned:


MCP (Glama) ▷ #showcase (2 messages):

MCPHub.nvim, Graphlit MCP Server, Neovim Plugin, Model Context Protocol

Links mentioned:


DSPy ▷ #general (30 messages🔥):

Ash Framework, instructor_ex, Async Support in DSPy, LangProBe Benchmark, Minions Feature Benchmarks

Links mentioned:


LlamaIndex ▷ #blog (2 messages):

Workflow-based travel planner, LlamaParse updates, AnthropicAI Claude Sonnet 3.7, Google Gemini 2.0 Flash


LlamaIndex ▷ #general (19 messages🔥):

AgentWorkflow context vs chat history, MCP Support, PII redaction with LLMs, Anthropic DeltaStream

Link mentioned: llama_index/llama-index-integrations/tools/llama-index-tools-mcp/examples/mcp.ipynb at main · run-llama/llama_index: LlamaIndex is the leading framework for building LLM-powered agents over your data. - run-llama/llama_index


LlamaIndex ▷ #ai-discussion (1 messages):

Windsurf Checkpoints


Latent Space ▷ #ai-general-chat (13 messages🔥):

AI replacing programmers, Senior Engineers vs Junior Engineers, Anthropic Fundraising, Stagehand and Browserbase, Claude Code vs Cursor

Links mentioned:


tinygrad (George Hotz) ▷ #general (10 messages🔥):

tinygrad formalist project, Ops.CAT speed bounty, RDNA2/RX6000 usable with tinygrad, Intel Arc A770 usable with tinygrad

Link mentioned: Tweet from the tiny corp (@tinygrad): What is tinygrad?tinygrad is a formalist project. It attempts to capture the full gamut of software 2.0 in a non leaky abstraction. The methods on Tensor class create a directed graph of immutable RIS...


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

Charles Sutton, Coding Agents, AI for Vulnerability Detection

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (4 messages):

Discord Admin Spam Account Removal, Quiz Posting Schedule


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (4 messages):

Audio issues during lectures


Cohere ▷ #api-discussions (9 messages🔥):

Embed Images, 504 Errors


Modular (Mojo 🔥) ▷ #general (5 messages):

Renaming ownedtoown, Community meeting, AWS GenAI Loft event

Link mentioned: modular/max: The MAX Platform (includes Mojo). Contribute to modular/max development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #mojo (1 messages):

SIMD DType, Construction Checks, Globals vs Parameters

Link mentioned: max/mojo/stdlib/src/builtin/simd.mojo at main · modular/max: The MAX Platform (includes Mojo). Contribute to modular/max development by creating an account on GitHub.


Torchtune ▷ #general (1 messages):

Step-based checkpointing


Torchtune ▷ #dev (3 messages):

Profiler traces, Tensorboard, PyTorch memory visualizer tool, Perfetto


Nomic.ai (GPT4All) ▷ #general (3 messages):

Ollama vs GPT4All, Catalan Language support for GPT4All, GPT4All v3.10.0 Vulnerability




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}