Frozen AI News archive

not much happened today

**OpenAI** plans to release its first open-weight language model since **GPT-2** in the coming months, signaling a move towards more open AI development. **DeepSeek** launched its open-source **R1 model** earlier this year, challenging perceptions of China's AI progress. **Gemma 3** has achieved function calling capabilities and ranks on the **Berkeley Function-Calling Leaderboard**, while **GemmaCoder3-12b** improves code reasoning performance on **LiveCodeBench**. **Alibaba_Qwen's Qwen2.5-Omni** introduces a novel Thinker-Talker system and **TMRoPE** for multimodal input understanding. The **TogetherCompute** team achieved **140 TPS** on a 671B parameter model, outperforming **Azure** and **DeepSeek API** on **Nvidia GPUs**. **OpenAI** also expanded **ChatGPT** features with image generation for all free users and a new voice release. **Runway Gen-4** enhances animation for miniature dioramas, and **LangChain** launched a chat-based generative UI agent. Commercial deployment of **Figure 03 humanoid robots** at **BMW** highlights advances in autonomy and manufacturing scaling. New tools include **OpenAI's realtime transcription API** with **WebRTC** support and **Amazon's Nova Act AI browser agent**.

Canonical issue URL

AI News for 3/31/2025-4/1/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (230 channels, and 7148 messages) for you. Estimated reading time saved (at 200wpm): 719 minutes. You can now tag @smol_ai for AINews discussions!

people were mostly smart enough not to launch things on april fools'.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Open Source Models and Releases

Model Performance and Benchmarks

AI Product and Tool Releases & Updates

AI Research and Studies

Hugging Face and Gradio

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. LLM Mathematical Reasoning Limitations

2. DeepMind Research Publication Strategy

3. New Tools and Features for Local LLM Users

4. Novel LLM Research Concepts

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

1. GPT-4o Image Generation Capabilities

2. Claude vs Gemini Competition Heats Up

3. Video Generation Breakthroughs

4. AI Development Tools and Innovations

5. Pixel Art and Retro Graphics AI


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1: OpenAI's Open-Weight Model Sparks Excitement

Theme 2: New AI Models Under the Microscope

Theme 3: Users Vent Over AI Tool Troubles

Theme 4: Open-Source Contributions and Technical Innovations Shine

Theme 5: AI Makes Strides in Law and Healthcare


PART 1: High level Discord summaries

Manus.im Discord Discord


LMArena Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


OpenAI Discord


LM Studio Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


GPU MODE Discord


Latent Space Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


Nous Research AI Discord


Yannick Kilcher Discord


MCP (Glama) Discord


Notebook LM Discord


Torchtune Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


MLOps @Chipro Discord


AI21 Labs (Jamba) Discord


Codeium (Windsurf) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Manus.im Discord ▷ #showcase (1 messages):

Amazing case


Manus.im Discord ▷ #general (753 messages🔥🔥🔥):

Manus credits, Credit system, Pricing Structure, Token-Based System

Links mentioned:


LMArena ▷ #general (977 messages🔥🔥🔥):

Meta Model Safety Downgrades, Decoding 'venom' Prompts, Gemini 2.5 Pro's 'Aliveness', New LMArena models

Links mentioned:


LMArena ▷ #announcements (1 messages):

Alpha Arena updates, Copy Code feature, Image generation, Bug reports

Links mentioned:


Cursor Community ▷ #general (867 messages🔥🔥🔥):

Gemini 2.5 Pro Reasoning, Trial Abuse and Account Flagging, Roo Code Alternatives, Model Context Protocol, AI-Generated KFC Ad

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (256 messages🔥🔥):

Blackwell Support, VLM Training, GRPO Usage, Gemini 2.5 Pro, Training with Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (48 messages🔥):

Lightweight Pretraining Techniques, Bonsai pretraining, BitNet training Costs, Qwen Model Rebenched, Exllama2 vs vLLM Inference

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (248 messages🔥🔥):

Orpheus Dataset Issues, Model Evaluation Problems, Gemma 3 Inference Samples, Fine-tuning with PDFs, Vision Fine-tuning with Gemma 3

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (23 messages🔥):

Model Evaluation, Coding benchmarks, Long context benchmarks, Math benchmarks, Gemma 3 vs small LMs


Perplexity AI ▷ #announcements (2 messages):

Discord improvements, Simplified onboarding, Feedback consolidation, Pro channel access


Perplexity AI ▷ #general (544 messages🔥🔥🔥):

Space Instructions limitations, Image generation discontinued?, Apple Intelligence in the EU, Samsung AI vs Apple Intelligence, GPT Omni shortcomings

Links mentioned:


Perplexity AI ▷ #sharing (10 messages🔥):

Code Tracing in Python, AI Accuracy in Reading, API Research


Perplexity AI ▷ #pplx-api (5 messages):

Sonar API Access, Tier 2 Credits, JSON Formatting with Pydantic


OpenAI ▷ #annnouncements (1 messages):

ChatGPT's new voice Monday, voice mode, voice picker


OpenAI ▷ #ai-discussions (314 messages🔥🔥):

Fake ChatGPT Apps, Gemini 2.5 Pro Rate Limits, Image Generation with Ghibli style, ElevenLabs Voice Model, AI and Creative Industries

Links mentioned:


OpenAI ▷ #gpt-4-discussions (24 messages🔥):

Image generation rate limits, copilot experiences, ChatGPT instructions, 4o abilities, future of image model


OpenAI ▷ #prompt-engineering (9 messages🔥):

Custom Instructions in 'About Me' Box, Memory-Stored Prompts, Personalization in Model Responses, Model Pattern Recognition and Formatting


OpenAI ▷ #api-discussions (9 messages🔥):

Custom instructions in 'about me' box, Memory-stored prompts, Model Guessing vs Training, FORMAT_RESET for rigid patterns


LM Studio ▷ #general (198 messages🔥🔥):

eGPU with LM Studio, Gemini 2.5 Pro Evaluation, Gemma 3 27B Performance, Local LLM recommendations, Copilot hurts developer experience

Links mentioned:


LM Studio ▷ #hardware-discussion (63 messages🔥🔥):

Nvidia Drivers instability after 10-12 hours of usage, M4 Max vs 5090 Speed Comparison, Mac vs Nvidia GPUs for LLM, Tenstorrent Wormhole performance on Discord, Context Overflow and Shared Memory impact on LLM speed

Link mentioned: M3 Ultra vs RTX 5090 | The Final Battle: M3 Ultra Mac Studio vs AI beast with NVIDIA RTX 5090Efficient. Productive. Organized. | Baseus Spacemate Series(MAC)11-in-1 Docking StationBuy on Amazon.US: ...


aider (Paul Gauthier) ▷ #general (230 messages🔥🔥):

Gemini 2.5 Pro experiences and limitations, RateLimitError automation strategies, Dot Command Revolution, F#, Video analysis

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (30 messages🔥):

Temperature for coding, Stopping benchmarks, Aider with subdirectories, Aider local config, Model Summarization fails

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (13 messages🔥):

Organizations leave Beta, Web search results in Chatroom, Cerebras on OpenRouter, PDF support for OpenRouter API

Link mentioned: Tweet from OpenRouter (@OpenRouterAI): Today we're taking Organizations out of beta.With Organizations, teams have complete control over data policies and consolidated billing, adding peace of mind across dozens of model providers.Key ...


OpenRouter (Alex Atallah) ▷ #general (98 messages🔥🔥):

Aider OpenRouter Copilot, Gemini Flash 2 Context, Usage Downloads, Enterprise Level Rate Limits, GPT4o Image Generation

Links mentioned:


Eleuther ▷ #general (43 messages🔥):

Cosine Annealing LR, Mini-batch vs Batch, Gradient Accumulation, Stanford CS 25 Transformers Course, Category theory

Links mentioned:


Eleuther ▷ #research (21 messages🔥):

ACL Rebuttals, Deep Sets for Triangle Area, Comparing Language Model Embeddings, Relative Representations, Convergence of Representations in AI

Links mentioned:


Eleuther ▷ #scaling-laws (4 messages):

Learning Rate Impact, Scaling Efficiency, Model Oomph


Eleuther ▷ #interpretability-general (5 messages):

Neuronpedia Open Source, Delphi auto-interp server update, Actionable Interpretability Workshop at ICML 2025, Neuronpedia Datasets

Links mentioned:


Eleuther ▷ #lm-thunderdome (28 messages🔥):

Debugger updates, SmolLM Evaluation Issues, Open LLM Leaderboard Normalization, Subtask Aggregation PR

Links mentioned:


Eleuther ▷ #gpt-neox-dev (5 messages):

GPT-NeoX Pre-training on NVIDIA DGX Cloud, SLURM cluster restrictions, torchrun, DeepSpeed Launch modes

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (70 messages🔥🔥):

CodeScientist, OpenAI open language model, Meta's smart glasses, Multi-subject RLVR

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (24 messages🔥):

Pydantic Evals, Grok solves math, Gemini vs GPT 4.5, MidJourney v6, GPT-4o translation

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rl (4 messages):

KL Penalty in RL, Base Models vs Instruct Models, Reasoning and Reinforcement Learning

Link mentioned: Recent reasoning research: GRPO tweaks, base model RL, and data curation: The papers I endorse as worth reading among a cresting wave of reasoning research.


Interconnects (Nathan Lambert) ▷ #reads (7 messages):

OpenAI returning, Long timelines to advanced AI

Link mentioned: "Long" timelines to advanced AI have gotten crazy short: The prospect of reaching human-level AI in the 2030s should be jarring


GPU MODE ▷ #general (46 messages🔥):

CUDA occupancy, GPU parallel processing, A100 thread limit, GRPO training with Qwen


GPU MODE ▷ #triton (2 messages):

Disable autotune, Triton kernel


GPU MODE ▷ #cuda (2 messages):

Request for PMPP book PDF, PMPP book


GPU MODE ▷ #torch (5 messages):

FlexAttention, Arbitrary Sequence Lengths, PyTorch 2.6, Tensor Subclass Use Case, Memory savings

Link mentioned: Graph break on Tensor._make_subclass · Issue #150265 · pytorch/pytorch: 🐛 Describe the bug I am having the following problem from torch import nn import torch torch_compile_options = { "epilogue_fusion" : True, "max_autotune" : True, "shape_paddi...


GPU MODE ▷ #cool-links (1 messages):

marksaroufim: https://arxiv.org/abs/2503.20313


GPU MODE ▷ #jobs (1 messages):

MLX, Apple hiring, ML systems

Link mentioned: AIML - Software Engineer for MLX, MLR - Jobs - Careers at Apple: Apply for a AIML - Software Engineer for MLX, MLR job at Apple. Read about the role and find out if it’s right for you.


GPU MODE ▷ #beginner (2 messages):

CUDA Program Execution, GPU Volumetric Data Processing


GPU MODE ▷ #off-topic (2 messages):

Egg noodles with chicken and vegetables, Image Analysis with YouTube

Link mentioned: - YouTube: no description found


GPU MODE ▷ #irl-meetup (3 messages):

NYC Meetups, Community Meetup


GPU MODE ▷ #self-promotion (1 messages):

Megatron Tensor Parallelism, Fused/Parallel CE Loss

Link mentioned: Tweet from Daniel Vega-Myhre (@vega_myhre): For any ML folks who want to deepen their understanding of ML scalability & performance techniques, I wrote an illustrated deep-dive into Megatron-style tensor parallelism: https://danielvegamyhre.git...


GPU MODE ▷ #🍿 (1 messages):

AlphaGeometry, LLM for kernel optimization


GPU MODE ▷ #reasoning-gym (9 messages🔥):

OpenAI Open-Weight Reasoning Models, PR Review Requests, Arc AGI PR, Collisions PR, CodeIO Dataset Merged


GPU MODE ▷ #general (3 messages):

.py scripts vs .cu files, active python leaderboards


GPU MODE ▷ #submissions (17 messages🔥):

vectorsum, conv2d, vectoradd, matmul, grayscale


Latent Space ▷ #ai-general-chat (75 messages🔥🔥):

Cursor's Funding Round, Etched's New Transformer ASIC, OpenAI's New Open-Weight Language Model, OpenDeepSearch (ODS), Sophont: Open Multimodal Foundation Models for Healthcare

Links mentioned:


HuggingFace ▷ #general (42 messages🔥):

DeepSeek R1, xAI Acquires X, Hyperparameter tuning LLMs, SFTTrainer hanging, stable_baselines3 CPU faster than GPU

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

Agents Course Unit 2.1, Run Jupyter Lab Locally, RL Course Frozen Lake issue

Link mentioned: What are LLMs? - Hugging Face Agents Course: no description found


HuggingFace ▷ #cool-finds (3 messages):

OpenHands LM, Autonomous Agents, Nature article on data access

Links mentioned:


HuggingFace ▷ #i-made-this (1 messages):

tonic_1: very cool


HuggingFace ▷ #computer-vision (1 messages):

YOLO vertical object detection, CNN vertical object detection, Instance segmentation fragments


HuggingFace ▷ #gradio-announcements (2 messages):

Gradio Milestone, Million monthly active developers


HuggingFace ▷ #agents-course (16 messages🔥):

OpenAIServerModel with Ollama, Langraph OpenAI API model alternatives, Release of Unit 3


HuggingFace ▷ #open-r1 (1 messages):

Liger Kernel, GPU Memory Occupation, Speed vs. Memory Trade-off


Modular (Mojo 🔥) ▷ #general (9 messages🔥):

MAX 25.2 livestream, Chris lightning talk, GTC Chris video

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (59 messages🔥🔥):

Compiler Bug, Enums, Flex Attention, Float to String Algorithm, FlashAttention-2 in Mojo

Links mentioned:


Nous Research AI ▷ #general (40 messages🔥):

OpenAI API, Midjourney New Research, Sam Altman open-weight language model, Psyche p2p, Anthropic Insights on LLMs

Links mentioned:


Nous Research AI ▷ #ask-about-llms (8 messages🔥):

DeepHermes Reasoning, Structured Output with Langchain, DeepHermes AI, Tool Calling with Reasoning


Nous Research AI ▷ #research-papers (2 messages):

Project Loong Release, Synthetic Data Generation

Link mentioned: Tweet from CAMEL-AI.org (@CamelAIOrg): Introducing Project Loong 🐉Blog: https://camel-ai.org/blogs/project-loong-synthetic-data-at-scale-through-verifiers…• Our structured approach to generating and validating synthetic data for enhanced ...


Nous Research AI ▷ #interesting-links (10 messages🔥):

Nous Research Portal Git Repo, X Link Removal, Contributing to Nous Research, Google's Style Guide

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

Project Loong, Synthetic Data Generation, Model Performance Enhancement

Link mentioned: Tweet from CAMEL-AI.org (@CamelAIOrg): Introducing Project Loong 🐉Blog: https://camel-ai.org/blogs/project-loong-synthetic-data-at-scale-through-verifiers…• Our structured approach to generating and validating synthetic data for enhanced ...


Yannick Kilcher ▷ #general (35 messages🔥):

Graph Learning Evolution, AI/ML Job Impact, RLHF Alignment and Nerfed Models, Gemini 2.5 Pro Math Abilities, Dream Journaling App

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (5 messages):

RLHF, Reward Hacking, Response Diversity, Reasoning Task Verifiers, Generative Reward Model

Link mentioned: Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback: Reinforcement Learning from Human Feedback (RLHF) is crucial for aligning large language models with human preferences. While recent research has focused on algorithmic improvements, the importance of...


Yannick Kilcher ▷ #ml-news (20 messages🔥):

AI Dog Chasing Tail, AI Model Feedback, Runway Relevance, OpenAI Model Release Speculation, GPT-3.5 vs Thinking Models

Link mentioned: Tweet from Salma (@Salmaaboukarr): I'm blown away!😱 This KFC concept ad is 100% AI generated!My friend David Blagojevic (he's not on X) created this ad concept for KFC and it's incredible! Tools used: Runway, Pika, Kling...


MCP (Glama) ▷ #general (38 messages🔥):

MCP RBAC Implementation, Docker alternatives, MCP server for webapp, VirusTotal Integration, MCP for make.com or n8n cloud

Links mentioned:


MCP (Glama) ▷ #showcase (13 messages🔥):

ActivePieces drops MCP support, MCP Autotest Tool, MCP Weekly Newsletter, Playwrite MCP server with Smithery, MCP synchronous limitations

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

Webby Awards, Voting, NotebookLM nominations

Link mentioned: Vote for the best of the internet: I just voted in The Webby People's Voice Awards and checked my voter registration.


Notebook LM ▷ #use-cases (9 messages🔥):

Google Tasks integration with NotebookLM, Archiving notebooks in NotebookLM, Sharing sources on different notes in NotebookLM


Notebook LM ▷ #general (39 messages🔥):

Timestamped sections on the todo list, NotebookLM to Gemini 2.5 Pro, Conversation ending early, Limit the total number of words, not the number of sources?, Maths notation in NLM is very hard to read


Torchtune ▷ #general (11 messages🔥):

Torchtune office hours, Discord timezone handling

Link mentioned: Brain Brain Meme GIF - Brain Brain meme Big brain - Discover & Share GIFs: Click to view the GIF


Torchtune ▷ #dev (16 messages🔥):

PR #2441 Review, Regression Testing for PR #2477, Qwen Model Upload, S3 Bucket Hookup Issues, PR #2510

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (15 messages🔥):

ImageDtype and IMAGE env, tinygrad BEAM Performance, Mobile GPUs and ImageDType, arange() optimization

Links mentioned:


LlamaIndex ▷ #blog (1 messages):

LLM Agents for Technical Documentation, Structured Extraction from Complex Documents


LlamaIndex ▷ #general (6 messages):

ReAct Agents, Local Models via Ollama, OpenAI Rate Limit Errors, Embedding Models, Query Engines

Link mentioned: Agentic-Chat-RAG/agent_utils.py at jake-dev · JakeFurtaw/Agentic-Chat-RAG: Uses a Gradio interface to stream coding related responses from local models. Can be used in Chat Mode or Agent Mode. - JakeFurtaw/Agentic-Chat-RAG


Nomic.ai (GPT4All) ▷ #general (7 messages):

Official Translations, Llama3 8B instruct model, .bin vs .gguf

Link mentioned: Home: GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. - nomic-ai/gpt4all


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (4 messages):

Quizzes, Completion based


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

LLM Agents Cookbook, Llama 3

Link mentioned: Llama3 Cookbook - LlamaIndex: no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

DeepSeek-R1, Reinforcement Learning, Chains-of-Thought, Project Loong

Link mentioned: 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers: Project Loong is a collaborative effort lead by CAMEL-AI to explore Long CoTs data generation through verifiers at scale.


Cohere ▷ #「💬」general (3 messages):

Command A issues, Rem dream journaling app

Links mentioned:


Cohere ▷ #「🤝」introductions (2 messages):

Introductions, Community growth, User interests, Networking


MLOps @Chipro ▷ #events (1 messages):

AI in Legislation, Legalese Decoder, SVCAF's AI4Legislation competition

Links mentioned:


MLOps @Chipro ▷ #general-ml (1 messages):

smartinez.ai: I think you can ask Joe


AI21 Labs (Jamba) ▷ #general-chat (2 messages):

Language Use


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Sounds, Auditory UX, Windsurf Next Beta

Links mentioned:


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

io_uring.h, v0 openfunctions dataset, v1 dataset


{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}