Frozen AI News archive

not much happened today

**Weights and Biases** announced a **$1.7 billion acquisition by CoreWeave** ahead of CoreWeave's IPO. **CohereForAI** released the **Aya Vision models (8B and 32B parameters)** supporting **23 languages**, outperforming larger models like **Llama-3.2 90B Vision** and **Molmo 72B**. **Microsoft** introduced **Phi-4-Mini (3.8B parameters)** and **Phi-4-Multimodal models**, excelling in math, coding, and multimodal benchmarks. **CogView4**, a **6B parameter text-to-image model** with **2048x2048 resolution** and Apache 2.0 license, was released. **Alibaba** launched **Wan 2.1**, an open-source video generation model with **720p output** and **16 fps generation**. **Google** announced new AI features for Pixel devices including **Scam Detection** and **Gemini integrations**. **LlamaCloud** reached **General Availability** and raised **$19M Series A funding**, serving over **100 Fortune 500 companies**. **Weaviate** launched the **Query Agent**, the first of three Weaviate Agents.

Canonical issue URL

AI News for 3/4/2025-3/5/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (227 channels, and 2895 messages) for you. Estimated reading time saved (at 200wpm): 327 minutes. You can now tag @smol_ai for AINews discussions!

Congrats to Weights and Biases for their $1.7b acquisition to the soon-IPOing CoreWeave.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Model Releases and Updates

Company and Product Announcements

Research and Papers

Tools and Frameworks

Performance and Benchmarks

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Qwen 32b Coder instruct improvements drive agent capabilities

Theme 2. NVIDIA GeForce RTX 4090 with 96GB VRAM for AI Workloads

Theme 3. DiffRhythm: Fast Song Generation with Diffusion Models

Theme 4. C4AI Aya Vision vs Qwen2.5 72B Model

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. CogView4 Release: Open-Source Text-to-Image Breakthrough

Theme 2. GPT as Therapy: A New Mental Health Resource

Theme 3. Sonnet 3.7 Criticized for Overengineering and Complexity

Theme 4. Meta's AI Mind-Reading Breakthrough: 80% Accuracy in Focus


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. Big Model Moves and Fine-Tuning Feats

Theme 2. Tooling Hype: Agents, ReAct, and RAG

Theme 3. Performance Woes and HPC Triumphs

Theme 4. Business Stirs: Billion-Dollar Deals and Subscription Gripes

Theme 5. Specialized Applications: Agents, Ethics, and Stock Market Insights


PART 1: High level Discord summaries

Cursor IDE Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


Codeium (Windsurf) Discord


aider (Paul Gauthier) Discord


LM Studio Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


Interconnects (Nathan Lambert) Discord


Nous Research AI Discord


Notebook LM Discord


Cohere Discord


Stability.ai (Stable Diffusion) Discord


MCP (Glama) Discord


Torchtune Discord


LlamaIndex Discord


Eleuther Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (608 messages🔥🔥🔥):

Cursor's Voice Output, Claude 3.7 Performance Issues, Windsurf as an Alternative, MCP Tools with o3-mini, Repo Prompt vs Grok 3 Web

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (204 messages🔥🔥):

Deepseek Unsloth with MIG instance GPU, 4bit merging error with 24b Mistral model, Unsloth and Phi-4-mini-instruct compatibility, Running models using GGUF format and Koboldcpp, Weekly Google Meet for Unsloth tips and best practices

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (45 messages🔥):

Triton Community, Immutable Linux distros, New AI jobs for programmers

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (50 messages🔥):

Mistral BOS tokens, Torch 2.4 and Python 3.12, GRPO on Llama3-8b, Fine-tuning BERT, Export to Ollama on Windows

Link mentioned: All Our Models | Unsloth Documentation: no description found


Unsloth AI (Daniel Han) ▷ #showcase (33 messages🔥):

Llama 3.1 8B Fine-tuning, GPT4o-mini for Synthetic Data Generation, Training Data Generation Tricks, KoloLLM on Ollama, Fine-tuning vs. RAG

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (21 messages🔥):

GRPO Method Comparison, DeepSeek r1 Release, Bespoke Curator for Synthetic Data, Triton GPU Kernels, Unsloth Video Sessions

Links mentioned:

Everyone in the LLM space knows that name now, and for good reason; after a little over a year of their team quietly iterating on architecture, ...GRPO Judge Experiments: Findings & Empirical Observations: Many GRPO reproductions for LLM reinforcement learning available online lack useful intuitions or recommendations regarding hyperparameters and reward shapi...Scribe | Create Step-by-Step Guides — Fast.: Scribe documents your processes for you. Build visual guides with text, links and screenshots instantly.


Perplexity AI ▷ #general (314 messages🔥🔥):

Android vs iOS features, Perplexity Pro pricing issues, Broken UI, Augment Code, Deep Research in Perplexity AI

Links mentioned:


Perplexity AI ▷ #sharing (5 messages):

Explain Gen, Apple Air Product Teased, TSMC Investment Plans, Investment Thesis on, Reasoning LM SOTA and Future


Perplexity AI ▷ #pplx-api (9 messages🔥):

Perplexity AI Pricing, Obsidian Web Clipper integration, Perplexity API JSON output issues

Link mentioned: Integration of Perplexity AI's LLMs into Obsidian Web Clipper · Issue #376 · obsidianmd/obsidian-clipper: Integrating Perplexity AI's Large Language Models (LLMs) into the Obsidian Web Clipper would be highly beneficial. Contrary to the other models currently usable in Obsidian Web Clipper, Perplexity...


Codeium (Windsurf) ▷ #discussion (2 messages):

WPF Extension Issues, Visual Studio, Xcode Extension Error


Codeium (Windsurf) ▷ #windsurf (230 messages🔥🔥):

Windsurf font size adjustment, Flex credits pricing justification, Windsurf performance issues, Flow Credits Consumption with Claude 3.7, Team-based Cascade/Windsurf implementation

Links mentioned:


aider (Paul Gauthier) ▷ #general (153 messages🔥🔥):

Claude API understanding, Groq pricing and performance, Aider and git practice, Sonnet 3.7 tuning, Aider with Zed

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (69 messages🔥🔥):

Less smart models can be used by Architect, Aider update API key, Custom model ctxlen cost, Ericsson CodeChecker added in project, Set global settings to Aider

Links mentioned:


aider (Paul Gauthier) ▷ #links (2 messages):

Karpathy inspires voice AI, Claude 3.7 Sonnet, Claude Code Agent

Link mentioned: GPT-4.5 FLOP? Claude 3.7 Sonnet STARTER PACK. What is Claude Code REALLY?: 🔥 GPT-4.5 maybe the biggest FLOP. MEANWHILE Claude 3.7 Sonnet and Claude Code just CHANGED THE GAME! 🤯Anthropic's Claude Code maybe the greatest AI Agent T...


LM Studio ▷ #general (67 messages🔥🔥):

LM Studio CLI, Reporting LM Studio Vulnerabilities, LM Studio and Home Assistant, GPT-4o error loading, Model Settings

Links mentioned:


LM Studio ▷ #hardware-discussion (50 messages🔥):

4090 with 48GB VRAM, Pristine Datasets, NPU Support in LM Studio, Intel ARC iGPU with LM Studio, Model offloading to CPU/RAM


HuggingFace ▷ #general (39 messages🔥):

Anthropic raises $3.5 billion, Hugging Face LLM as a local microservice, Qwen2.5-Coder models, Bengali Text Romanization, Whisper Japanese models

Links mentioned:


HuggingFace ▷ #today-im-learning (1 messages):

ericmaubr: Today I saw my first video on RLHF. https://www.youtube.com/watch?v=2MBJOuVq380


HuggingFace ▷ #i-made-this (10 messages🔥):

AWQ conversion of InternVL 2.5, Ukrainian Text-to-Speech, Bias in LLMs, Multi-Agent Approach to AGI, Working Embedders

Links mentioned:


HuggingFace ▷ #computer-vision (4 messages):

ViT CLS token, DETR from scratch, GAP vs CLS, DETR inference

Link mentioned: GitHub - dimiz51/DetectionTransformer-DETR-PyTorch: This project is an implementation of the Detection Transformer (DETR) for state-of-the-art object detection using the well-known Transformer architecture. Using this project you can easily fine-tune and test DETR on your own dataset following the included notebook guide.: This project is an implementation of the Detection Transformer (DETR) for state-of-the-art object detection using the well-known Transformer architecture. Using this project you can easily fine-tun...


HuggingFace ▷ #NLP (5 messages):

Text Extraction with SpaCy, Hosting Phi-4 with Ollama and FastAPI, Accelerate CLI memory estimation

Link mentioned: Model memory estimator: no description found


HuggingFace ▷ #agents-course (50 messages🔥):

Gradio App Internals, LlamaIndex Notebook Error, SmolAgents vs SmolTools, Payment Required Issue, Deep Reinforcement Learning Resources

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

BYOK Errors, API Key Issues


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Price Estimates, Total Price Combinations


OpenRouter (Alex Atallah) ▷ #general (105 messages🔥🔥):

OpenRouter Provider Routing, Strongest AI For bookmark processing, Flash 2.0 vs GPT-4o-mini, Inception AI Diffusion Models, Anthropic 502 Overload Errors

Links mentioned:


Modular (Mojo 🔥) ▷ #general (43 messages🔥):

Mojo vs C++ vs Rust similarities, Mojo's design perspective drawbacks, Python Superset in Mojo, Concurrency and Sum Types in Mojo, Python's API vs Rust's API consistency

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (15 messages🔥):

Commandline argument conversion to int in Mojo, Mojo is operator for identity checks, Tensor addition operation removal, Mojo code build error: missing libtinfo


Yannick Kilcher ▷ #general (45 messages🔥):

AI Timelines, Differential Transformers, Numerical Stability in Softmax, Bilevel Optimization, Grokking Phenomenon

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (6 messages):

DDPM, CLIP, MLST, GNN, Graph Attention Networks (GATs)

Link mentioned: Graph Attention Networks: no description found


Interconnects (Nathan Lambert) ▷ #news (25 messages🔥):

NextGenAI Funding, Thinking Machines hires PyTorch engineer, Amazon Nova reasoning model, OpenAI Credit Plans, CoreWeave to Acquire Weights & Biases

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (1 messages):

saturatedfat: inspect the tex if its on arxiv


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

catboy_slimmer: usually people who behave like schmid are present but somewhat less prominent


Interconnects (Nathan Lambert) ▷ #random (11 messages🔥):

CogView4-6B, Anthropic vs OpenAI models, Aya Vision, LLM Summarization ethics, Meeting transcription summarization

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (3 messages):

Claude Code, GitHub Issues, System Bricked

Link mentioned: Command bricked system · Issue #168 · anthropics/claude-code: Ubuntu 24.02 server To automate claude updates it advises to run sudo chown -R $USER:$(id -gn) /usr && sudo chmod -R u+w /usr This bricked the sudo system to no longer work and had to attach a...


Interconnects (Nathan Lambert) ▷ #rl (2 messages):

MSR Health Futures, image based multi-model work, NLP folks in healthcare


Interconnects (Nathan Lambert) ▷ #reads (5 messages):

Qwen vs Llama Self Improvement, ARC-AGI, Inference Paradigms

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

saturatedfat: its nice ! been a minute


Nous Research AI ▷ #general (31 messages🔥):

LCPP parallel decoding, Granite 3.1 3B for tool calling, Speculative decoding with Llama 3.2 1B, Grokking in LLMs, Streaming with tool-calling


Nous Research AI ▷ #research-papers (1 messages):

WorldSim blog or paper, Future of WorldSim


Nous Research AI ▷ #interesting-links (2 messages):

Agentic Memory, Zettelkasten, anon-kode, Claude Code

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

WorldSim blog post, WorldSim paper


Notebook LM ▷ #use-cases (9 messages🔥):

Audio transcription, NotebookLM use cases, Linear integrated circuits, Medical school, Font updates


Notebook LM ▷ #general (26 messages🔥):

NotebookLM API, NotebookLM audio overview TTS, Google Doc sync, Podcast generation copyright, Store prompts in Chat-Bot

Link mentioned: no title found: no description found


Cohere ▷ #「💬」general (33 messages🔥):

Cohere Website Design, Aya Vision Models, Cohere Sales Team, Leveling Bots

Links mentioned:


Cohere ▷ #「🤝」introductions (1 messages):

Introductions, Community Guidelines, AI Projects, Favorite Tech Tools


Cohere ▷ #【🟢】status-updates (1 messages):

competent: # SOON


Stability.ai (Stable Diffusion) ▷ #general-chat (27 messages🔥):

Automatic1111 vs Linux, Pykaso AI Website, AMD card, Regional Prompter, Stable Diffusion


MCP (Glama) ▷ #general (26 messages🔥):

MCP server claim issues, Twitter API access and costs, MCP and cursor for visual assets, Installing sequential thinking MCP server, Tool usage in roo and cursor

Links mentioned:


Torchtune ▷ #general (9 messages🔥):

Torchtune Checkpointing, Step Based Checkpoint, Attention Masking vs Label Masking, Custom Special Tokens JSON


Torchtune ▷ #dev (14 messages🔥):

MPS/Metal support for AO, Manual tests for MPS, Saving completed tunes

Link mentioned: torchchat/docs/quantization.md at main · pytorch/torchchat: Run PyTorch LLMs locally on servers, desktop and mobile - pytorch/torchchat


LlamaIndex ▷ #blog (2 messages):

LlamaCloud GA, HuggingFace course, LLM-powered agents, LlamaIndex toolkit

Link mentioned: Introduction to LlamaIndex - Hugging Face Agents Course: no description found


LlamaIndex ▷ #general (16 messages🔥):

DeepSeek API issue, LLM.txt documentation files, Unintentional long output

Links mentioned:


LlamaIndex ▷ #ai-discussion (2 messages):

Windsurf Checkpoints, LlamaIndex vs Landing AI


Eleuther ▷ #general (2 messages):

Indian Reasoning, Foundational Models, Law, ChatGPT training limitations, Fine-tuning


Eleuther ▷ #research (1 messages):

Modded-nanogpt Adam-Matching Scaling, Kimi paper scaling multipliers, QKVO Matrices

Link mentioned: modded-nanogpt/records/101024_Muon/eb5659d0-fb6a-49e5-a311-f1f89412f726.txt at master · KellerJordan/modded-nanogpt: NanoGPT (124M) in 3 minutes. Contribute to KellerJordan/modded-nanogpt development by creating an account on GitHub.


Eleuther ▷ #lm-thunderdome (11 messages🔥):

lm-evaluation-harness, dataset_kwargs, datasets.load_dataset, Huggingface load_datasets, Llama 3 eval recipe

Links mentioned:


DSPy ▷ #general (12 messages🔥):

ReAct Agents, OctoTools, Agentic Reward Modeling, MinionsLM, dspygen

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (6 messages):

GPT4All vs Ollama, RAG support in GPT4All, Model comparison: tiny model vs Llama3-8b with LocalDocs


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Server maintenance, Sponsors zone activity


MLOps @Chipro ▷ #events (1 messages):

AI Stock Market Agent, AI in Finance, AI Investment Buddy, AI Real-World Success

Link mentioned: Build your AI Stock Market Agent · Zoom · Luma: Ever wondered how AI could help you make smarter investment decisions?Join us for an exciting, beginner-friendly workshop that combines the world of…




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}