Frozen AI News archive

not much happened today

**GPT-4.5** sparked mixed reactions on Twitter, with **@karpathy** noting users preferred **GPT-4** in a poll despite his personal favor for GPT-4.5's creativity and humor. Critics like **@abacaj** highlighted **GPT-4.5's slowness** and questioned its practical value and pricing compared to other models. Performance-wise, **GPT-4.5** ranks above **GPT-4o** but below **o1** and **Claude 3.5 Sonnet**, with **Claude 3.7** outperforming it on many tasks yet GPT-4.5 praised for its humor and "vibes." Speculation about GPT-4.5's size suggests around **5 trillion parameters**. Discussions also touched on pricing disparities, with **Perplexity Deep Research** at $20/month versus ChatGPT at $200/month. The emotional intelligence and humor of models like **Claude 3.7** were also noted.

Canonical issue URL

AI News for 2/27/2025-2/28/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (221 channels, and 8236 messages) for you. Estimated reading time saved (at 200wpm): 795 minutes. You can now tag @smol_ai for AINews discussions!

Much discussion about the relative merits of GPT 4.5, which you can read below.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

GPT-4.5 Model Performance and User Perception

Model Architecture, Scaling Laws and Efficiency

Open Source Models, Tools, and Frameworks

AI Applications and Industry Use Cases

AI Research and Papers

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek Realse: Revolutionary Storage and Data Processing Tech

Theme 2. French Reasoning Model: Economical and Effective

Theme 3. Sesame Realtime Voice Model Rivals OpenAI

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Humorous and Creative Applications of GPT 4.5

Theme 2. Innovations in AI Video and Audio Processing

Theme 3. AI Identity Confusions and Hallucinations

Theme 4. AI Tools Streamlining Programming and Writing


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. GPT-4.5 Enters Arena, but Claude 3.7 Still King of the Code

Theme 2. IDE Wars: Cursor and Windsurf Trade Blows Over AI Coding Supremacy

Theme 3. Hardware Hustle: DeepSeek's DualPipe and TinyLM Offer Glimmers of Innovation

Theme 4. Pricing Pressure: GPT-4.5 API Costs Spark Outrage, Open Source Alternatives Beckon

Theme 5. Community Pulse: From Robotics Arms to LeetCode for CUDA, Innovation Thrives


PART 1: High level Discord summaries

Cursor IDE Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Codeium (Windsurf) Discord


GPU MODE Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Interconnects (Nathan Lambert) Discord


Latent Space Discord


Nous Research AI Discord


HuggingFace Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


Yannick Kilcher Discord


Cohere Discord


LlamaIndex Discord


DSPy Discord


Torchtune Discord


Notebook LM Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (975 messages🔥🔥🔥):

GPT-4.5 performance, Claude 3.7 Sonnet, Cursor bugs, Windsurf vs Cursor, Memory bank usefulness

Links mentioned:


aider (Paul Gauthier) ▷ #general (1144 messages🔥🔥🔥):

GPT-4.5 Analysis, Claude 3.7 vs o3-mini, Aider Improvements, deepseek R2, GPT-4o versus 4.5

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (74 messages🔥🔥):

aider auto-retry mode, Deepseek Model Reliability, Aider and Venice AI, Aider install on offline computer, Using Claude 3.7 with Aider

Links mentioned:


OpenAI ▷ #annnouncements (3 messages):

GPT-4.5 release, ChatGPT Pro users, Scaling unsupervised learning, Multimodal features


OpenAI ▷ #ai-discussions (618 messages🔥🔥🔥):

Sonnet 3.7 vs GPT 4.5, Grok Model Speculation, GPT-4.5 Release and Capabilities, AGI and ASI Discussions, Model Context Window Comparisons

Links mentioned:


OpenAI ▷ #gpt-4-discussions (9 messages🔥):

Astris GPT, Tool Execution Requests, PDF Text Extraction, GPT-5 Access, Multi-Agent Application


OpenAI ▷ #prompt-engineering (29 messages🔥):

Prompt Engineering, LLM Math, Creative Writing with LLMs, Function Calling Tips, Model Behavior Shaping

Link mentioned: OpenAI Model Spec: The Model Spec specifies desired behavior for the models underlying OpenAI's products (including our APIs).


OpenAI ▷ #api-discussions (29 messages🔥):

Prompt Engineering, LLMs for Education, Creative Writing with ChatGPT, Function Calling in Assistants, ChatGPT Disassembler


Unsloth AI (Daniel Han) ▷ #general (557 messages🔥🔥🔥):

Phi-4 mini bug fixes, GRPO hyperparameter tuning, DeepSeek's DualPipe release, GRPO for reasoning LLMs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (29 messages🔥):

EPYC chip arrival, Thinking OnePicyeah model, Claude's capabilities, Pycraft engine by Deepseek, Open Source vs. Early Access


Unsloth AI (Daniel Han) ▷ #help (39 messages🔥):

Ollama Think Token, Qwen 2.5 VL loading issues, Unsloth pricing for 8x4090, ONNX vs TFLite, Fine-tuning Qwen 2.5 VL

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

ifeval, Instruction-following eval

Link mentioned: GitHub - oKatanaaa/ifeval: A clean IFEval implementation: A clean IFEval implementation. Contribute to oKatanaaa/ifeval development by creating an account on GitHub.


Unsloth AI (Daniel Han) ▷ #research (4 messages):

Emergent Misalignment Paper, Mercury dLLM, Diffusion vs Transformers

Links mentioned:


Codeium (Windsurf) ▷ #announcements (1 messages):

Claude 3.7 Sonnet, Prompt Flow Actions, Credit Multiplier Adjustment


Codeium (Windsurf) ▷ #discussion (25 messages🔥):

Codeium.el Hacks, Flow Action Credits, Jetbrains IDE features parity, Cascade Engine Issues, DeepSeek v3 Integration

Link mentioned: Codeium Feedback: Give feedback to the Codeium team so we can make more informed product decisions. Powered by Canny.


Codeium (Windsurf) ▷ #windsurf (579 messages🔥🔥🔥):

Claude 3.7 Sonnet cost, Windsurf pricing and credits, Cursor vs Windsurf, Deepseek v3, Windsurf Stability

Links mentioned:


GPU MODE ▷ #general (36 messages🔥):

Deepseek R1, Zen 5 NPU, AIE Toolchain, Ultrascale Playbook, Mixed Precision Training

Links mentioned:


GPU MODE ▷ #triton (46 messages🔥):

INT4 TC, FP4 vs INT4, reinterpret_cast on tl.tensor, Threads in the block with lock, Packed Integer Values

Links mentioned:


GPU MODE ▷ #cuda (61 messages🔥🔥):

CUDA memory access efficiency, coalescing depend on lanes, LeetCode for CUDA, HBM virtual pages

Links mentioned:


GPU MODE ▷ #torch (4 messages):

MPS Development, CI-based development


GPU MODE ▷ #announcements (1 messages):

Nouamane Tazi, Ultra-Scale Playbook, LLM training, 5D Parallelism

Link mentioned: The Ultra-Scale Playbook - a Hugging Face Space by nanotron: no description found


GPU MODE ▷ #algorithms (1 messages):

Multi-head Latent Attention, Decoupled RoPE, MHA vs MLA, Weight Merging in MLA


GPU MODE ▷ #cool-links (10 messages🔥):

DualPipe, GPU Architecture Fundamentals, CUDA Leetcode, Diffusion Models, TinyLM

Links mentioned:


GPU MODE ▷ #beginner (7 messages):

HBM Bandwidth Estimation, CUDA Kernel Access Patterns, Mathematics for PMPP/CUDA, Discord Scams


GPU MODE ▷ #self-promotion (5 messages):

CUDA C++ and CUDA Python Tutorials, Accelerated Python Profiling Tools Survey, L1 store-caching in CUDA, tinylm WebGPU acceleration, LeetCode for CUDA

Links mentioned:


GPU MODE ▷ #reasoning-gym (25 messages🔥):

Reasoning Gym Eval Script, Mercury Diffusion LLMs, GPT-4.5 Release, willccbb/verifiers issue

Links mentioned:


GPU MODE ▷ #gpu模式 (16 messages🔥):

Chinese Internet Trends (Douyin vs. Xiaohongshu), Experiences with NVIDIA Hardware, MLSys and CUDA Discussions on Xiaohongshu, Chinese Room Thought Experiment, CUDA QQ Groups

Link mentioned: 中文房间 - 维基百科,自由的百科全书: no description found


GPU MODE ▷ #general (1 messages):

1000 Submissions Milestone


GPU MODE ▷ #submissions (206 messages🔥🔥):

Grayscale Leaderboard, Histogram Leaderboard, Vectoradd Leaderboard, Vectorsum Leaderboard, Sort Leaderboard


GPU MODE ▷ #ppc (10 messages🔥):

INT8 Matmul, Loop Reordering, CPU optimization


GPU MODE ▷ #feature-requests-and-bugs (6 messages):

Custom Kernel Preprocessing, Bot Submitter Identification, Matmul Preprocessing Time


OpenRouter (Alex Atallah) ▷ #announcements (4 messages):

OpenAI Outage, DeepSeek R1, Claude Sonnet 3.7, GPT-4.5 Preview

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

YPerf, Gemini Flash, Llama 3, Claude 3.5 Sonnet

Link mentioned: YPerf: no description found


OpenRouter (Alex Atallah) ▷ #general (389 messages🔥🔥):

Sonnet 3.7 thinking endpoint, DeepSeek R1 reasoning, OpenAI's GPT 4.5 pricing and performance, OpenRouter Documentation

Links mentioned:


LM Studio ▷ #general (278 messages🔥🔥):

Robotics DIY, LLM backend website, Grok-3 performance vs O3, DeepSeek political controversy, OpenAI defense contracts

Links mentioned:


LM Studio ▷ #hardware-discussion (41 messages🔥):

Framework desktop, Unified RAM, AMD Ryzen AI, GPU Pricing

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (274 messages🔥🔥):

Claude Annual Subscriptions, Microsoft Phi-4 Models, GPT-4.5 System Card, OpenAI Livestream, Meta AI Standalone App

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (4 messages):

Anthropic data collection, Alignment for monitoring

Link mentioned: Tweet from Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 (@elder_plinius): sneaky sneaky, @AnthropicAIcollecting user data from everyone that used the Computer Use API without informed consent or an opt-out option is dirty workusing that data to then train a classifier to im...


Interconnects (Nathan Lambert) ▷ #random (19 messages🔥):

Claude Code access and potential uses, DeepEP analysis, AI competing on Pokemon Red%, Claude 3.7 Sonnet RL issues

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (10 messages🔥):

GPT-4.5 release, DeepSeek r1, Claude Code ls node_modules, Gary Marcus GPT-4.5

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (3 messages):

Alignment, Realism-grounded alignment


Interconnects (Nathan Lambert) ▷ #posts (2 messages):

olmOCR vs Top PDF tools, Pairwise judgments and Elo score

Link mentioned: Tweet from Ai2 (@allen_ai): olmOCR dominates the competition! Our human evaluation using pairwise judgments against top PDF processing tools show olmOCR's rating significantly above other tools. Don't take our word for i...


Latent Space ▷ #ai-general-chat (133 messages🔥🔥):

Speak AI revenue graph, Hume AI's Octave text-to-speech LLM, Levelsio flying project, Perplexity Sonar API Deep Research, Firecrawl Deep Research API

Links mentioned:


Latent Space ▷ #ai-in-action-club (166 messages🔥🔥):

GPT 4.5, Claude 3.7 Sonnet, Model Scaling, Open Source, Every Hiring

Links mentioned:


Nous Research AI ▷ #general (280 messages🔥🔥):

Apple Intelligence Underwhelming, Efficient CoT, GPT-4.5, MoE Models, Wan2.1 video model

Links mentioned:


Nous Research AI ▷ #ask-about-llms (4 messages):

AI Voice Commands, Reasoning in AI Models, Text-to-Speech AI, Elevenlabs, Cartesia

Link mentioned: Deepseek AI Assistant: ALWAYS ON Python AI Agent for Engineers that SHIP: 🔥 Is your Personal AI Assistant truly ALWAYS ON? Discover how Ada, powered by DeepSeek V3, is revolutionizing the way engineers ship code! 🚀🎥 Resources fo...


Nous Research AI ▷ #research-papers (1 messages):

Language Models, REFUTE benchmark, algorithmic problem solving

Link mentioned: Paper page - Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation: no description found


Nous Research AI ▷ #interesting-links (3 messages):

Diffusion LLMs, Mercury dLLM, LLaDA Release

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

Language Models, Scientific discovery, REFUTE Benchmark

Link mentioned: Paper page - Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation: no description found


HuggingFace ▷ #general (132 messages🔥🔥):

HuggingFace Spaces licensing, Fal AI vs Deepinfra pricing, Lighteval MMLU-Pro support, LEFFA paper implementation, HuggingMod bot

Links mentioned:


HuggingFace ▷ #today-im-learning (4 messages):

Hiding vs Removing, F2 vs F12, Smol Agents Framework


HuggingFace ▷ #i-made-this (8 messages🔥):

LLM performance benchmark, Face similarity questionnaire, PyTorch library for 360° images, Phi-4 models

Links mentioned:


HuggingFace ▷ #reading-group (2 messages):

Language Models (LMs), REFUTE Benchmark, Reasoning Agents

Link mentioned: Paper page - Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation: no description found


HuggingFace ▷ #computer-vision (2 messages):

``


HuggingFace ▷ #gradio-announcements (1 messages):

FastRTC


HuggingFace ▷ #smol-course (9 messages🔥):

Inference Engine Alternatives, Smolagents Quiz Iframe, Smolagents Quiz Failures, HfApiModel vs LiteLLMModel Confusion, SFT Trainer Loss Function


HuggingFace ▷ #agents-course (129 messages🔥🔥):

Chat templates, agent, and LLM interaction, NVIDIA AI Red Team Prompt Injection, CodeAgent's Python interpreter, Smolagents codeagents to set the system prompts, Agent Laboratory for research reports and code repositories

Links mentioned:


Perplexity AI ▷ #general (264 messages🔥🔥):

Perplexity Pro Flair, New Voice Mode, Disable Web Search, Coding with Perplexity, Gemini Real Time Video Chat

Links mentioned:


Perplexity AI ▷ #sharing (17 messages🔥):

Majorana-1 Quantum, AI Communication, Lab Mice First Aid, House Blueprint, Ransomware Leaks

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity Pro API credits, Obsidian Web Clipper configuration, sonar-deep-research model, Refunds for Perplexity API


Stability.ai (Stable Diffusion) ▷ #announcements (1 messages):

Website Redesign Contest, Stable Diffusion 3.5, AI-generated artwork, US participants only


Stability.ai (Stable Diffusion) ▷ #general-chat (92 messages🔥🔥):

ControlNet models for consistent characters, LLMs referencing real-time data, SDXL alternative with T5 CLIP, Inpaint Anything error, Selling ComfyUI workflows


Eleuther ▷ #general (8 messages🔥):

Hugging Face Deprecation, Best RAG Tool, LLM Pretraining Guide


Eleuther ▷ #research (36 messages🔥):

Data Mixing, DualPipe, DeepSeek, Gemini Flash Thinking, SWE-RL

Links mentioned:


Eleuther ▷ #interpretability-general (22 messages🔥):

Jacobian Sparse Autoencoders, SmolLM2 Intermediate Checkpoints, Mechanistic Interpretability Resources, Saving Weights after Iteration, Open Problems in Mechanistic Interpretability

Links mentioned:


Eleuther ▷ #lm-thunderdome (17 messages🔥):

QA Task Evaluation, ARC-Easy, ARC-hard, Mosaic's Eval Framework, GPQA Diamond COT Zero-Shot Evaluation

Link mentioned: lm-evaluation-harness/lm_eval/tasks/arc/arc_challenge_chat.yaml at main · EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


Yannick Kilcher ▷ #general (58 messages🔥🔥):

Microsoft's survival aided by governments, Deterministic manners of AI models, AI in programming, Agentic systems struggle, Small team build a better browser than Chrome

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (7 messages):

Hash Collisions, KV Similarity

Link mentioned: ClaudePlaysPokemon - Twitch: Claude Plays Pokemon - Debut Stream


Yannick Kilcher ▷ #ml-news (15 messages🔥):

Remarkable Alexa, GPT-4.5 Announcement, DeepSeek AI Open Infra Index

Links mentioned:


Cohere ▷ #discussions (44 messages🔥):

Cohere models in OpenAI SDK, Auto Subtitles, Command R+ update, R7B Arabic vs Fanar and ALLaM

Links mentioned:


Cohere ▷ #announcements (1 messages):

Command R7B Arabic Model, Multilingual AI Model, Arabic Language Optimization

Links mentioned:


Cohere ▷ #cmd-r-bot (3 messages):

Differential Transformers, World Without Coffee Essays


Cohere ▷ #projects (9 messages🔥):

Free auto caption APIs, Adobe Premiere auto transcription


LlamaIndex ▷ #blog (2 messages):

LlamaIndex CentralReach, LlamaExtract Public Beta


LlamaIndex ▷ #general (48 messages🔥):

Data Leak in LlamaParse 0.6.2, Reloading pgvector Index Table, AgentWorkflow Custom Exception Handling, Elasticsearch Metadata Schema, LlamaExtract Documentation Outdated

Links mentioned:


DSPy ▷ #show-and-tell (1 messages):

Prompt Engineering Studio, AI-powered assistant, Reusable templates, Version control, Team collaboration

Link mentioned: Demo: Prompt Engineering Studio · Zoom · Luma: Join us for an exclusive first look at Portkey's Prompt Engineering Studio - the most comprehensive toolkit for building, testing, and deploying AI prompts at…


DSPy ▷ #general (37 messages🔥):

ReAct Agent Integration, DSPy Release Bug, MIPROv2 Optimizer Error, Refine API Feedback, Community Engagement

Links mentioned:


Torchtune ▷ #general (1 messages):

yamashi: Gpt4.5 available on azure


Torchtune ▷ #dev (26 messages🔥):

CI troubles, Activation Offloading, Distributed Torch FL Code, DPO Integration Test

Links mentioned:


Torchtune ▷ #papers (10 messages🔥):

DeepSeek DualPipe, Federated Learning at Scale

Link mentioned: GitHub - deepseek-ai/DualPipe: A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.: A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training. - deepseek-ai/DualPipe


Notebook LM ▷ #use-cases (2 messages):

``


Notebook LM ▷ #general (29 messages🔥):

Notebook emoji changes, Arraying instructions with keywords, Sharing Notebooks with groups, Audio overview error, Public link to notebook

Links mentioned:


Modular (Mojo 🔥) ▷ #general (5 messages):

Repo Structure Simplification, Mojo Prioritization, Chris Lattner's Blog Post

Link mentioned: Upcoming changes to our GitHub repositories: Tomorrow (February 27), we’re streamlining our GitHub repositories! The max repo is merging into the mojo repo, bringing everything under one roof. A new subdirectory will house the Mojo standard libr...


Modular (Mojo 🔥) ▷ #mojo (25 messages🔥):

MLIR in stdlib, HyperLogLog in Mojo, MLIR Dialects in Mojo, MAX Graph Compiler, Unions in Mojo

Link mentioned: GitHub - axiomhq/mojo-hyperloglog: Contribute to axiomhq/mojo-hyperloglog development by creating an account on GitHub.


MCP (Glama) ▷ #general (18 messages🔥):

MCP in production, Claude Code diff based editing, Official everything server SSE, Glama AI GitHub App, Claude Code Invite

Links mentioned:


MCP (Glama) ▷ #showcase (5 messages):

Redmine MCP Server, Ableton Voice Control, tinylm library for running LLMs

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (18 messages🔥):

Live Mode, Voice Assistant, GGUF models, Alltalk TTS

Links mentioned:


tinygrad (George Hotz) ▷ #general (12 messages🔥):

GROUP operations AST changes, BEAM search strategies for OptOps, arange GROUP optimization failure, LLVM speed regression

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

``


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Research Plans Announcement, Discord Server Recruitment


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

Research Track, Predictive Decision Making, Long Term Memory in Agents


MLOps @Chipro ▷ #general-ml (1 messages):

tinylm, WebGPU, OpenAI SDK, client-side LLMs

Link mentioned: tinylm - Run Models Locally with WebGPU: no description found



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}