Frozen AI News archive

not much happened today

**GPT-4o** was praised for its improved coding, instruction following, and freedom, becoming the leading non-reasoning coding model surpassing **DeepSeek V3** and **Claude 3.7 Sonnet** in coding benchmarks, though it still lags behind reasoning models like **o3-mini**. Concerns about policy compliance in image generation were noted, with efforts to improve adherence. **Gemini 2.5 Pro** was highlighted for its advanced audio and video understanding, long context capabilities, and integration with platforms like **Cursor AI** and **Windsurf AI**. AI infrastructure developments include a partnership between **Together AI** and **Hypertec Group** to deliver large-scale GPU clusters, and **CoreWeave's IPO** was celebrated for advancing AI infrastructure. GPU and TPU usage is expected to increase significantly. *"GPT-4o's transparency and background generation feature"* and *"Gemini 2.5 Pro scored above 50% on Simple-Bench AI Explanation"* were key highlights.

Canonical issue URL

AI News for 3/27/2025-3/28/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (230 channels, and 13422 messages) for you. Estimated reading time saved (at 200wpm): 1217 minutes. You can now tag @smol_ai for AINews discussions!

We soft launched the 2025 State of AI Engineering survey today, fill it out to join our $1000 Amazon gift card raffle + have your voice heard in the state of AI Eng!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Here's a summary of the tweets, organized by topic:

GPT-4o Model Performance and Features

Gemini 2.5 Pro Model Performance and Capabilities

AI Infrastructure and Compute

AI Engineering and Development

Company and Product Announcements

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Reverse Engineering GPT-4o: Architectural Insights and Speculations

Theme 2. MegaTTS3's Voice Cloning: Skepticism and Security Concerns

Theme 3. Qwen-2.5-72b: Leading the Open-Source OCR Revolution

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

our pipelines are down...


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. GPT-4o Dominates Leaderboards and Sparks Debate

Theme 2. DeepSeek V3 and Qwen2.5-Omni Emerge as Strong Contenders

Theme 3. Infrastructure Woes and User Frustrations Plague AI Platforms

Theme 4. Tools and Techniques for Enhanced AI Development Emerge

Theme 5. Ethical Considerations and AI Safety Remain Central


PART 1: High level Discord summaries

Manus.im Discord Discord


Perplexity AI Discord


Cursor Community Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


MCP (Glama) Discord


aider (Paul Gauthier) Discord


Latent Space Discord


LM Studio Discord


Eleuther Discord


GPU MODE Discord


Yannick Kilcher Discord


Interconnects (Nathan Lambert) Discord


Torchtune Discord


Nous Research AI Discord


HuggingFace Discord


Notebook LM Discord


LlamaIndex Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Codeium (Windsurf) Discord


Modular (Mojo 🔥) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Manus.im Discord ▷ #general (627 messages🔥🔥🔥):

Manus new Credit system Feedback, Alternative Energy for Manus GPU Farm, Cheaper AI Models like Deepseek and Qwen, Manus AI assistance for Exams, Manus UI Love

Links mentioned:


Perplexity AI ▷ #general (1219 messages🔥🔥🔥):

Perplexity AI outages, DeepSeek AI, Claude AI, User Frustrations, T-Mobile Promo

Links mentioned:


Perplexity AI ▷ #sharing (10 messages🔥):

Shareable threads, Super Prompt, LLM Research


Perplexity AI ▷ #pplx-api (7 messages):

API Parameter Error Handling, Llama Index RAG context with Perplexity Sonar, Deep Research Parity API vs Web


Cursor Community ▷ #general (1251 messages🔥🔥🔥):

Gemini 2.5 Pro Pricing, Cursor infrastructure, Humanoid robots?, Codebase tag removed from cursor

Links mentioned:


LMArena ▷ #general (906 messages🔥🔥🔥):

O1 Pro drop, GPT-4o latest Benchmarks, Deepseek V3, Meta LLama, AI Safety

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (580 messages🔥🔥🔥):

Elevenlabs Scribe V1 for audio event classification, OlmOCR loading in Unsloth, Fine-tuning LLMs for board games, Gemma 3 notebook quirks, Qwen Omni Hacking

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (1 messages):

``


Unsloth AI (Daniel Han) ▷ #help (68 messages🔥🔥):

Training Loss Interpretation, Gemma & Task Difficulty, Dataset Size & Overfitting, LM Studio Models, HF Upload & vLLM

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

Orpheus-TTS, Voice Model Finetuning, UnslothAI

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

Dynamic Quantization, DeepSeek-R1, ACDiT

Links mentioned:


OpenAI ▷ #ai-discussions (305 messages🔥🔥):

Gemini 2.5 Pro vs GPT-4o, Google AI Studio, Perplexity for News and Current Events, Claude vs GPT for Reasoning, AI Transcription Tools


OpenAI ▷ #gpt-4-discussions (8 messages🔥):

Image generator, GPT-4.5 Error, GPT models for summarization, AI voice chatbot


OpenAI ▷ #prompt-engineering (83 messages🔥🔥):

Yu-Gi-Oh! card art prompting, Microsoft PromptWizard, ChatGPT prompting methods, Hierarchical communication with markdown, AI prompt engineering


OpenAI ▷ #api-discussions (83 messages🔥🔥):

Yu-Gi-Oh! card art prompting, Microsoft PromptWizard, ChatGPT prompting tips, Hierarchical communication with markdown, GPTs in conversation


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Fount AI Character Interactions Framework, Gideon project

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (327 messages🔥🔥):

Gemini 2.5 Pro Access and Limitations, OpenRouter AI SDK Configuration, Free Models with Function Calling, Token Per Second Performance for Coding Models, OpenAI Responses API

Links mentioned:


MCP (Glama) ▷ #general (299 messages🔥🔥):

MCP server config, Prompts and ICL, Ollama models and MCP, Google search integration, Oterm client and MCP

Links mentioned:


MCP (Glama) ▷ #showcase (9 messages🔥):

Canvas MCP, Docker Compose for MCP Servers, Model Context Protocol (MCP) Explanation, Speech MCP, Gradescope Integration

Links mentioned:


aider (Paul Gauthier) ▷ #general (216 messages🔥🔥):

R1 vs O3 Mini, Anthropic Thoughts Microscope, GPT-4o Update, OpenRouter Limits, Running Local Aider Branch with UV

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (31 messages🔥):

AiderMacs, Cargo Build Integration, Gemini 2.5 Pro Rate Limits, Aider Architect Mode, Model Combinations

Links mentioned:


Latent Space ▷ #ai-general-chat (26 messages🔥):

GPT-4o Update, OpenAI Image Generation Policy, Devin Wiki Launch, AI Writing Editing

Links mentioned:


Latent Space ▷ #ai-announcements (3 messages):

Dharmesh Shah, HubSpot, Agent.ai, hybrid teams, Claude Plays Pokemon hackathon

Link mentioned: The Agent Network — Dharmesh Shah: Dharmesh Shah on Intelligent Agents, Market Inefficiencies, and Building the Next AI Marketplace


Latent Space ▷ #ai-in-action-club (189 messages🔥🔥):

LLM Codegen Workflow, Documentation for LLMs, Memory-Ref Tool, Cursor IDE, Self-Improving Agents

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.14 Release, Multi-GPU Controls, GPU Management Features, Beta Releases, Advanced GPU Controls

Links mentioned:


LM Studio ▷ #general (74 messages🔥🔥):

Threadripper vs EPYC, LM Studio UI, Visualize LLM calculations, Model details error in LM Studio, Continue VSCode extension

Links mentioned:


LM Studio ▷ #hardware-discussion (71 messages🔥🔥):

ROCm Support, P100 vs 6750xt, Nvidia vs AMD, Mac Pro 2013 for LLMs


Eleuther ▷ #general (23 messages🔥):

transformer storage errors, torchtune use cases, self-awareness in language models, bias-augmented consistency training (BCT), adaptive compression + intelligent routing for distributed systems

Link mentioned: Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought: While chain-of-thought prompting (CoT) has the potential to improve the explainability of language model reasoning, it can systematically misrepresent the factors influencing models' behavior--for...


Eleuther ▷ #research (2 messages):

Architectural inductive biases, Neural-guided CoT, Reasoning-adjacent work


Eleuther ▷ #interpretability-general (83 messages🔥🔥):

Neural Networks as Bodies Without Organs (BwO), Mechanistic Interpretability (Mech Interp) Critique, Specialized Heads in Neural Networks, The Hydra Effect, Reasoning Models for AI Safety

Links mentioned:


Eleuther ▷ #lm-thunderdome (19 messages🔥):

MMLU pro dataset path, MMLU pro process_doc function, MMLU pro eval modifications, MMLU pro COT content, LM harness selecting dataset

Links mentioned:


Eleuther ▷ #gpt-neox-dev (2 messages):

Dependency Issue, Test Understanding


GPU MODE ▷ #triton (9 messages🔥):

local tensor element repetition, torch.Tensor.expand() porting to triton, tl.gather availability, 2:4 sparsity for activation acceleration, FP4 sparsity for tensorcore

Link mentioned: Accelerating Transformer Inference and Training with 2:4 Activation Sparsity: In this paper, we demonstrate how to leverage 2:4 sparsity, a popular hardware-accelerated GPU sparsity pattern, to activations to accelerate large language model training and inference. Crucially we ...


GPU MODE ▷ #cuda (4 messages):

CUDA Profiling, Nsight Compute, Nvidia's Profiling Software


GPU MODE ▷ #torch (1 messages):

PyTorch Profiler, save calls, detach calls, copy calls


GPU MODE ▷ #jobs (3 messages):

Red Hat, Software Engineer, C++, GPU kernels, CUDA


GPU MODE ▷ #pmpp-book (1 messages):

PMPP 4th edition errata, Fig 5.2 error


GPU MODE ▷ #off-topic (5 messages):

Miyazaki AI Art Scolding, AI Art Ethics, Studio Ghibli AI Art

Link mentioned: Tweet from Nuberodesign (@nuberodesign): Since this utter garbage is trending, we should take a look at what Hayao Miyazaki, the founder of Studio Ghibli, said about machine created art.Quoting Grant Slatton (@GrantSlatton) tremendous alpha ...


GPU MODE ▷ #triton-puzzles (7 messages):

Triton Puzzle 12, tl.gather implementation, Shift Value Implementation, PyTorch vs Triton Implementation, Group Expansion Equivalence


GPU MODE ▷ #metal (10 messages🔥):

Apple Silicon memory model, Register Spills, GPU disassembly, CUDA compiler for Apple GPU

Link mentioned: GitHub - dougallj/applegpu: Apple G13 GPU architecture docs and tools: Apple G13 GPU architecture docs and tools. Contribute to dougallj/applegpu development by creating an account on GitHub.


GPU MODE ▷ #reasoning-gym (2 messages):

Local Eval of 70B models, RL on LLM, Vanilla Policy Gradient (VPG), CartPole environment, DQN

Link mentioned: AI-Playground/rl-from-scratch/VPG-from-scratch.ipynb at main · Adefioye/AI-Playground: Contribute to Adefioye/AI-Playground development by creating an account on GitHub.


GPU MODE ▷ #gpu模式 (1 messages):

nuttt233: 因为batch gemm中默认前两个维度是batch stride,后两维才是row col


GPU MODE ▷ #general (2 messages):

.cu file upload errors, CUDA inline fix, Leaderboard submissions

Link mentioned: reference-kernels/problems/pmpp/vectoradd_py/solutions/correct/submission_cuda_inline.py at main · gpu-mode/reference-kernels: Reference Kernels for the Leaderboard. Contribute to gpu-mode/reference-kernels development by creating an account on GitHub.


GPU MODE ▷ #submissions (66 messages🔥🔥):

Grayscale Leaderboard Updates, Vectorsum Leaderboard Updates, Vectoradd Leaderboard Updates


Yannick Kilcher ▷ #general (54 messages🔥):

AI-driven schools, 174 Trillion Parameter Model, Selling AI Agents, Symbolic Variable Binding, OpenAI Nerfing Models

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (20 messages🔥):

Anthropic's Tracing Thoughts, Transformer Circuits Pub Updates, Rolling Diffusion, Erdős, Selfridge, and Strauss N! Product

Links mentioned:


Yannick Kilcher ▷ #ml-news (22 messages🔥):

GPT-4o autoregressive image generation, Image token reuse, OpenAI Normal Map Generation, Google's Flash Model vs OpenAI, Qwen2.5-Omni multimodal model

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (19 messages🔥):

GPT-4o Update, Anthropic's Economic Index, Softmax Organic Alignment, Musk's xAI acquires X

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (8 messages🔥):

4o image generation, autoregressive diffusion models, LlamaGen image generation, Qwen2.5-Omni multimodal model

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (49 messages🔥):

Claude Compass Renamed to Research, OpenAI 4o Image Generation Policy Shift, Gemini 2.5 Pro Crushes Wordle, Allen AI's Ai2 PaperFinder, Claude Reward Hacking

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (16 messages🔥):

White House Ghibli Tweet Deletion, 4o First Place Coding, Alignment Problem Solved Parody, Capybara GPU Smuggling YOLO Run

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (1 messages):

Coding Agents, Symflower blogpost

Link mentioned: How well can coding agents be installed with a good cheap model, transpile a repository, and then generate & execute tests?: Evaluating all major coding agents: All-Hands, Cline, Goose, gptme, SWE-Agent, VS Code Copilot Agent, ...


Interconnects (Nathan Lambert) ▷ #expensive-queries (2 messages):

LaTeX spacing


Torchtune ▷ #general (2 messages):

FP8 QAT, TorchAO

Link mentioned: FP8 QAT / FP8 block-wise quantization · Issue #1632 · pytorch/ao: Having QAT for FP8 would be a great addition, and FP8-blockwise quantization in general.


Torchtune ▷ #dev (69 messages🔥🔥):

GRPO PRs, RL/RLHF, vLLM, Anthropic confidence intervals

Links mentioned:


Nous Research AI ▷ #general (64 messages🔥🔥):

Claude UI Update, DeepSeek diffusion transformers, U.S TinyZero model, EXAONE Deep, Ghibli gen

Links mentioned:


Nous Research AI ▷ #ask-about-llms (4 messages):

Hermes-3, OLMoE-1B-7B

Link mentioned: allenai/OLMoE-1B-7B-0125-Instruct · Hugging Face: no description found


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/yangjunr/status/1904943713677414836?s=46


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/yangjunr/status/1904943713677414836?s=46


HuggingFace ▷ #general (49 messages🔥):

DeepSeek combines diffusion and transformers like gpt-4o multimodal, zero gpu quota not reseting, Hugging Face library and tutorials on training image data set for fine tuning llm, offload models from memory once the task is complete, Hugging Face Transformers library minor bug

Links mentioned:


HuggingFace ▷ #i-made-this (12 messages🔥):

Teachable Machine Alternatives, Linuxserver.io desktop environment, GUI agent demos, OpenAI CUA model

Links mentioned:


HuggingFace ▷ #smol-course (2 messages):

smol-course credits, agent course credits, HuggingFace credits


HuggingFace ▷ #agents-course (7 messages):

Evaluating Toxicity LLM-as-a-Judge in Langfuse, Base Models vs Instruct Models, Adjusting Agent System Prompt After Initializations

Link mentioned: Reddit - The heart of the internet: no description found


Notebook LM ▷ #use-cases (1 messages):

Streamlining Job Applications, Company Research, Cover Letter Generation


Notebook LM ▷ #general (29 messages🔥):

Mindmapping, Uploading sources, Versioning, Pasted sources naming, Readability of lecture transcripts

Link mentioned: Tole Cat GIF - Tole Cat Cute - Discover & Share GIFs: Click to view the GIF


LlamaIndex ▷ #blog (2 messages):

LlamaCloud MCP Server, LlamaIndex MCP Client, AI Agent Systems, Text-to-SQL Conversion


LlamaIndex ▷ #general (18 messages🔥):

ChatMessage history to the FunctionAgent workflow, Support rich content in agent responses, Custom telemetry attributes when interacting with Llama Index's LLM, Selectors, Agents , VannaPack and adding a memory with history = []

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

LlamaParse PDF Issues, Multi-PDF Parsing


Cohere ▷ #「💬」general (13 messages🔥):

Cohere "Command" naming, Coral Model Selection, Job opportunities at Cohere

Link mentioned: Careers | Cohere: Our team of ML/AI experts is passionate about helping developers solve real-world problems. From our offices in Toronto, London, and Palo Alto, we work at the cutting edge of machine learning to unloc...


Cohere ▷ #「🤖」bot-cmd (2 messages):

Testing Bot Commands


Cohere ▷ #「🤝」introductions (4 messages):

Full-Stack Web Development, Mobile App Development, AI Solutions, Cloud Technologies, Oracle ERP Fusion


Nomic.ai (GPT4All) ▷ #general (7 messages):

GPT4All usability issues, Mistral Small 3.1 and Gemma 3 implementation, GPT4All advantages, GPT4All v4.0.0 expectations, GPT4All model settings page


tinygrad (George Hotz) ▷ #general (1 messages):

georgehotz: can everyone close open PRs and issues that are stale?


tinygrad (George Hotz) ▷ #learn-tinygrad (4 messages):

TinyGrad Codegen, TinyGrad indexing


DSPy ▷ #general (1 messages):

DSPy output validation, DSPy handling invalid outputs


DSPy ▷ #examples (3 messages):

Optimizers in DSPy, Declarative Self-improving Python, Modular AI systems

Link mentioned: DSPy: The framework for programming—rather than prompting—language models.


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Entrepreneurship Track Mentorship, Office Hours with Sponsors


Codeium (Windsurf) ▷ #announcements (2 messages):

Gemini 2.5 Pro release, Windsurf rate limits

Link mentioned: Tweet from Windsurf (@windsurf_ai): Gemini 2.5 Pro is now available in Windsurf! ✨


Modular (Mojo 🔥) ▷ #mojo (1 messages):

self parameter, Foo[1] default parameter




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}