Frozen AI News archive

not much happened today

**OpenAI** announced the new **GPT-4o** model with enhanced instruction-following, complex problem-solving, and native image generation capabilities. The model shows improved performance in math, coding, and creativity, with features like transparent background image generation. Discussions around content filtering and policy for image generation emphasize balancing creative freedom and harm prevention. **DeepSeek V3-0324** APIs, available on **Hugging Face** and powered by **SambaNovaAI**, outperform benchmarks and models like **Gemini 2.0 Pro** and **Claude 3.7 Sonnet**. **Gemini 2.5 Pro** is recommended for coding, and **Gemini 3** can be deployed easily on Google Cloud Vertex AI via the new Model Garden SDK. The **Gemma 3 Technical Report** has been released on arXiv.

Canonical issue URL

AI News for 3/26/2025-3/27/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (230 channels, and 7972 messages) for you. Estimated reading time saved (at 200wpm): 757 minutes. You can now tag @smol_ai for AINews discussions!

There's a new 4o model in ChatGPT, but there's no blogpost and not much detail beyond the announcement tweet so there's not much to report. However you can see that the time between SOTA models has been shortening recently.

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

GPT-4o and Multimodal Models

DeepSeek and Gemini

AI Safety and Interpretability

AI Tools and Frameworks

Trends and Opinions

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek V3 0324 on Livebench Surpasses Claude 3.7 with Hallucination Issues

Theme 2. Microsoft's KBLaM: Plug-and-Play Knowledge in LLMs

Theme 3. New QVQ-Max Feature on Qwen Chat Enhances User Experience

Theme 4. Gemini 2.5 Pro Faces Performance Criticism Despite ASIC Advantage

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

debugging issues with our pipelines, sorry...


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Gemini 2.5 Pro: Rate Limits, Pricing, and Performance Hype

Theme 2. OpenAI's GPT-4o: Updates, Image Generation, and Policy Shifts

Theme 3. Model Context Protocol (MCP) Gains Momentum and Faces Challenges

Theme 4. Local LLM and Tooling Updates: Unsloth, LM Studio, and Aider

Theme 5. Turing Institute Turmoil and Open Source RL System DAPO


PART 1: High level Discord summaries

Cursor Community Discord


Perplexity AI Discord


Manus.im Discord Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


Modular (Mojo 🔥) Discord


HuggingFace Discord


MCP (Glama) Discord


Notebook LM Discord


Yannick Kilcher Discord


GPU MODE Discord


LlamaIndex Discord


Latent Space Discord


Torchtune Discord


Cohere Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


DSPy Discord


Codeium (Windsurf) Discord


Nomic.ai (GPT4All) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor Community ▷ #general (1297 messages🔥🔥🔥):

Gemini 2.5 Pro Pricing and Access, Windsurf vs. Cursor: Pros and Cons, Context Window Limitations, Model Performance and Preferences, Workflow Strategies

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity Discord Bot, Testing the Discord Bot, Discord Bot Feedback


Perplexity AI ▷ #general (748 messages🔥🔥🔥):

GPT-4.5 Discontinued, Complexity Extension, MCP Servers, Perplexity Pro API vs Subscription, CEO Suggests Ad Removal

Links mentioned:


Perplexity AI ▷ #sharing (6 messages):

Perplexity AI Search, Android 15, Bluetooth Toggle


Perplexity AI ▷ #pplx-api (27 messages🔥):

sonar API issues, llama-3.1-sonar-small-128k-online problems, Tier 3 access needed, Perplexity API parameter error handling

Link mentioned: no title found: no description found


Manus.im Discord ▷ #general (1045 messages🔥🔥🔥):

Gemini 2.5 Pro, Manus Invitation Code Wait Times, Discord Sidebar Changes, Manus Staging WordPress Issues, Manus and N8N Workflow Automation

Links mentioned:


LMArena ▷ #general (652 messages🔥🔥🔥):

Livebench benchmark discussion, Gemini 2.5 Pro performance, Censorship in AI models, Qwen 3 release, DeepSeek V3 0324 performance

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (391 messages🔥🔥):

Unsloth Dynamic 4-bit Quantization, Qwen/Qwen2.5-Omni-7B in Unsloth, GRPO research, TTS fine-tuning, Llama 3.2 vision

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

YouTube feed algorithms, context length limitations


Unsloth AI (Daniel Han) ▷ #help (67 messages🔥🔥):

Gemma3 finetuning issues, Dynamic 4-bit quantization, Qwen2.5VL-7B finetuning, Toxicity injection attacks, Llama 3 fine-tuning with LoRA

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (65 messages🔥🔥):

ByteDance training policy, Dr GRPO paper, Catastrophic overtraining, Low precision training, Nvidia pruning paper

Links mentioned:


aider (Paul Gauthier) ▷ #general (452 messages🔥🔥🔥):

Gemini 2.5 Pro, Rate Limits with Gemini 2.5 Pro, Model Context Protocol (MCP), OpenAI's GPT-4o Update, Aider's New /context Command

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (48 messages🔥):

Readonly file addition PR, Gemini issues with OpenAI API, /context mode explanation, Aider git issues, Setting different models as architect and coder


OpenAI ▷ #ai-discussions (222 messages🔥🔥):

Keyboard Remapping, Dashes vs. Semicolons, Sora Prompts and AI Image Generation, Midjourney vs Sora image generation, NSFW content and AI


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

Context Window, Image generation


OpenAI ▷ #prompt-engineering (42 messages🔥):

Sora Prompt Engineering, AI for Academic Research, Arxiv's role in STEM publishing, AI peer review, Translating Foreign Language Data


OpenAI ▷ #api-discussions (42 messages🔥):

Sora Prompts, AI Research Paper, Arxiv, Meta-Prompting, AI Peer Review


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Gemini 2.5, OpenRouter tips, Cursor IDE integration

Link mentioned: Tweet from OpenRouter (@OpenRouterAI): To maximize your free Gemini 2.5 quota:1. Add your AI Studio API key in https://openrouter.ai/settings/integrations. Our rate limits will be a “surge protector” for yours.2. Set up OpenRouter in your ...


OpenRouter (Alex Atallah) ▷ #general (268 messages🔥🔥):

Stripe security, Gemini 2.5 Pro, OpenRouter and OpenAI SDK compatibility, Deepseek R1 provider issues, OpenRouter provider routing

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.14, Multi-GPU Controls, NVIDIA GPUs, AMD GPUs

Links mentioned:


LM Studio ▷ #general (135 messages🔥🔥):

Vision Model Plugins, LM Studio Download Speed, VRAM Requirements for Models, Github Copilot vs Cursor, Fine-tuning Models with Unsloth

Links mentioned:


LM Studio ▷ #hardware-discussion (67 messages🔥🔥):

Gemma 3, 9070XT, ROCm, P100, RTX 4060ti 16gb


Eleuther ▷ #general (88 messages🔥🔥):

Deepseek V3 on Mac Studios vs. Cloud Instances, ICLR 2025 Meetup, Qwen2.5-Omni-7B Audio Testing, Qwen 32B Model Evaluation with LLM Harness, Transformers Library Errors

Links mentioned:


Eleuther ▷ #research (2 messages):

Catastrophic Overtraining, OLMo-1B instruction-tuned model, Gemma Team

Links mentioned:


Eleuther ▷ #interpretability-general (86 messages🔥🔥):

privileged basis, neural networks and fixed mechanisms, CoT for reward hacking, manifold manipulation

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):

AlpacaFarm logprob/loss implementation, Instruction tuning EOS token

Link mentioned: [Discussion] about compute_logprobs · Issue #56 · tatsu-lab/alpaca_farm: alpace_farm implementation https://github.com/tatsu-lab/alpaca_farm/blob/94b02079b74af731b2671e3691a5080d5d340fd8/src/alpaca_farm/models/rl_models.py#L97C30-L97C46 DeepSpeedExamples implementation ...


Eleuther ▷ #gpt-neox-dev (7 messages):

GPT-NeoX Data Chunking, Cross-Document Attention, FA3 Support for H100s, FP8 and H100 performance


Interconnects (Nathan Lambert) ▷ #news (96 messages🔥🔥):

Gemini 2.5 Pro, Claude 3.5 Sonnet, OpenAI Revenue, Midjourney CEO, 4o Image Generation

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (23 messages🔥):

Ghibli model training and copyright, Anthropic referral program, CoT French, OpenAI's image generation policy

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (43 messages🔥):

GPT-4o Image Generation Rollout, Naming Conventions for Models, Gary Marcus on AI Economics, White House Deletes Ghibli-Style Tweet

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (6 messages):

Alan Turing Institute Crisis, WanTeam's AI Failure Paper, dewey_en_beta embedding model

Links mentioned:


Interconnects (Nathan Lambert) ▷ #expensive-queries (7 messages):

Gemini 2.5, AI Studio, Long Contexts

Link mentioned: ‎Gemini - LaTeX Typos and Formatting Issues : Created with Gemini Advanced


Modular (Mojo 🔥) ▷ #mojo (168 messages🔥🔥):

Unit Scaling, SI Units as a Closed Set, Return Type Logic, Conditional Type, Extension Methods

Links mentioned:


HuggingFace ▷ #general (52 messages🔥):

Model Parameters Explained, ComfyUI InfiniteYou, HuggingFace Inference API Pricing, OpenAI 4o dataset

Links mentioned:


HuggingFace ▷ #today-im-learning (1 messages):

LLMs, Transformers, Guidance for Newcomers


HuggingFace ▷ #cool-finds (6 messages):

Windsurf for Vite Frontend, ComfyUI InfiniteYou Integration

Link mentioned: GitHub - ZenAI-Vietnam/ComfyUI_InfiniteYou: An implementation for InfiniteYou: An implementation for InfiniteYou. Contribute to ZenAI-Vietnam/ComfyUI_InfiniteYou development by creating an account on GitHub.


HuggingFace ▷ #i-made-this (12 messages🔥):

sieves zero-shot NLP pipeline, llama-cpp-connector updates for vision models, HFInheritedModelConfig for custom model building, Morphos web tool

Links mentioned:


HuggingFace ▷ #computer-vision (6 messages):

Image Reference Points, Qwen 2.5 VL Models on Kaggle, Memory Errors with Qwen 2.5 VL, Flash Attention 2 for GPU Offloading


HuggingFace ▷ #NLP (16 messages🔥):

SetFit v4 release, Reranker models, LLMs generating JSON, Converting PDFs to JSON, Training models on precaution data

Link mentioned: Training and Finetuning Reranker Models with Sentence Transformers v4: no description found


HuggingFace ▷ #smol-course (4 messages):

AI Agents Course, Smol Course

Links mentioned:


HuggingFace ▷ #agents-course (16 messages🔥):

Course Unit Release Dates, Hugging Face Token Setup, Gemini vs. You.com, Agent Building Ideas, LLM Evaluator Issues


MCP (Glama) ▷ #general (94 messages🔥🔥):

MCP adoption across OpenAI products, MCP impact on businesses and the future, Cloudflare's MCP tooling, Security risks of MCPs from GitHub, MCP server implementation issues with Claude

Links mentioned:


MCP (Glama) ▷ #showcase (12 messages🔥):

Canvas MCP, Truto's SuperAI, Model Context Protocol (MCP), Gradescope integration, Docker Compose for MCP servers

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

Mind Map public release


Notebook LM ▷ #use-cases (8 messages🔥):

Spanish podcasts not working, Sharing Notebooks issues, Company research for cover letters and resumes


Notebook LM ▷ #general (83 messages🔥🔥):

Gemini 2.5 Pro Turkey Test, Gemini Advanced Research Limit, NotebookLM API and Podcast Creation, Mind Map Improvements, Gemini 2.0 Flash Readability


Yannick Kilcher ▷ #general (33 messages🔥):

Sketch-to-Model Pipeline, Alternatives to Kernel Attention (KA), AI Solving Puzzles, ChatGPT and Grok 3 for UX/UI, Information Theory in AI/ML

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (18 messages🔥):

Discord Timestamps, Chain-of-Bias issues, Paper Discussion Format, Tracing Thoughts in a Language Model, Attribution Graphs

Link mentioned: Scaling Laws of Synthetic Data for Language Models: Large language models (LLMs) achieve strong performance across diverse tasks, largely driven by high-quality web data used in pre-training. However, recent studies indicate this data source is rapidly...


Yannick Kilcher ▷ #ml-news (14 messages🔥):

Alan Turing Institute Crisis, GPT-4o Autoregressive Image Generation, Image Token Reusal

Links mentioned:


GPU MODE ▷ #general (2 messages):

Data distribution in DP and TP ranks, TRL handling of data distribution


GPU MODE ▷ #triton (8 messages🔥):

Pre/Post Hooks in Triton, num_ctas for Hopper, Local Tensor expansion


GPU MODE ▷ #cuda (4 messages):

Memory Coalescing, CUDA Memory Hierarchy


GPU MODE ▷ #torch (1 messages):

PyTorch profiler, profiler trace


GPU MODE ▷ #jobs (3 messages):

Red Hat, Software Engineer, C++, GPU Kernels, CUDA


GPU MODE ▷ #beginner (1 messages):

Knowledge Distillation for Video Models, Estimating Model Parameters, Estimating Inference Throughput on Consumer GPUs


GPU MODE ▷ #torchao (5 messages):

Blocksparse, TorchAO, Pull Request #1734, Pull Request #1974

Links mentioned:


GPU MODE ▷ #off-topic (1 messages):

Hayao Miyazaki on AI art, Studio Ghibli anime filter

Link mentioned: Tweet from Nuberodesign (@nuberodesign): Since this utter garbage is trending, we should take a look at what Hayao Miyazaki, the founder of Studio Ghibli, said about machine created art.Quoting Grant Slatton (@GrantSlatton) tremendous alpha ...


GPU MODE ▷ #sparsity-pruning (1 messages):

srns27: gosh I'm so blind thanks man haha


GPU MODE ▷ #gpu模式 (1 messages):

nuttt233: 因为batch gemm中默认前两个维度是batch stride,后两维才是row col


GPU MODE ▷ #general (3 messages):

ComfyUI, CUDA, load_inline, Triton

Link mentioned: reference-kernels/problems/pmpp/vectoradd_py/solutions/correct/submission_cuda_inline.py at main · gpu-mode/reference-kernels: Reference Kernels for the Leaderboard. Contribute to gpu-mode/reference-kernels development by creating an account on GitHub.


GPU MODE ▷ #submissions (25 messages🔥):

Modal Runners, vectorsum, grayscale


LlamaIndex ▷ #blog (1 messages):

LlamaCloud, MCP Server, Claude Desktop


LlamaIndex ▷ #general (22 messages🔥):

LlamaExtract Schema Inference, TS Chatbot with Postgres DB, E-commerce Chatbot Architecture, SQL Query Generation Issues, Structured Prediction Bug

Links mentioned:


LlamaIndex ▷ #ai-discussion (13 messages🔥):

PDF Parsing Tools, LlamaParse and Image Reading, LLMs for Image Captioning, Hybrid Chunking, OCR for Scanned Documents


Latent Space ▷ #ai-general-chat (25 messages🔥):

Nvidia Acquires Lepton AI, Model Context Protocol, Replit Agent v2, GPT-4o Update, OpenAI Image Generation Policy

Links mentioned:


Torchtune ▷ #general (4 messages):

FP8 QAT, Optimizer State with Fake Quant

Link mentioned: FP8 QAT / FP8 block-wise quantization · Issue #1632 · pytorch/ao: Having QAT for FP8 would be a great addition, and FP8-blockwise quantization in general.


Torchtune ▷ #dev (18 messages🔥):

Deprecated code deletion, Linter installation issues, Anthropic using TensorFlow, GRPO PRs, JoeI sora

Link mentioned: Full train_on_input deprecation, removing other deprecated components by RdoubleA · Pull Request #2533 · pytorch/torchtune: ContextWhat is the purpose of this PR? Is it to add a new feature fix a bug update tests and/or documentation other (please add here)ChangelogWhat are the changes made in this PR?Use mas...


Cohere ▷ #「💬」general (12 messages🔥):

Vector Database Options, Hosting Vector DB Online, AI Agent Pricing, Cohere at QCon London

Link mentioned: Integrating Embedding Models with Other Tools — Cohere: Learn how to integrate Cohere embeddings with open-source vector search engines for enhanced applications.


Cohere ▷ #「🤝」introductions (2 messages):

Refugee Organization, Peacebuilding, Livelihood Opportunities


tinygrad (George Hotz) ▷ #general (12 messages🔥):

Budget AI Rig, AX650N NPU, Tinygrad PRs

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

TinyGrad Code Generation, Codegen Translators


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (7 messages):

sharing lecture recordings, mentorship deadline extension, mentorship for Entre track


DSPy ▷ #papers (3 messages):

Atom of Thoughts (AOT), Tree of Thoughts (ToT), Markovian Reasoning, Two-phase Transition, Atomic Granularity & Dependencies


DSPy ▷ #general (1 messages):

MiproV2 Issues, ValueError in DSPy


Codeium (Windsurf) ▷ #announcements (2 messages):

Gemini 2.5 Pro, Windsurf credits, Rate limiting

Link mentioned: Tweet from Windsurf (@windsurf_ai): Gemini 2.5 Pro is now available in Windsurf! ✨


Nomic.ai (GPT4All) ▷ #general (1 messages):

GPT4All issues, Model import problems, User experience frustrations




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}