Frozen AI News archive

small news items

**OpenAI** announced plans for **GPT-4.5 (Orion)** and **GPT-5**, with GPT-5 integrating the **o3** model and offering unlimited chat access in the free tier. **DeepSeek R1 Distilled Qwen 1.5B** outperforms OpenAI's **o1-preview** on math benchmarks, while **ModernBERT 0.3b** surpasses **Qwen 0.5b** at MMLU without fine-tuning. **Mistral** and **Perplexity** adopt **Cerebras** hardware for 10x performance gains. OpenAI's **o3** model won a gold medal at the 2024 International Olympiad in Informatics. Partnerships include **Qwen** with **Groq**. Significant RLHF activity is noted in Nigeria and the global south, and **Bytedance** is expected to rise in AI prominence soon. *"GPT5 is all you need."*

Canonical issue URL

AI News for 2/11/2025-2/12/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (211 channels, and 5266 messages) for you. Estimated reading time saved (at 200wpm): 497 minutes. You can now tag @smol_ai for AINews discussions!

No title story but lots of cool updates:


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Models & Performance

Industry & Business

Research & Papers

Tools & Applications

Development & Coding

Humor & Meta


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Revolutionary Latent Space Reasoning in LLMs

Theme 2. AMD's Strategic Moves in AI Hardware Competition

Theme 3. Project Digits: Nvidia’s Next Big Step in AI Workstations

Theme 4. Phi-4's Unconventional Approach to AI Creativity

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. OpenAI's New Models: GPT-4.5 'Orion' and Chain-of-Thought Integration

Theme 2. DeepSearch Goes Mainstream: Plus and Free User Access

Theme 3. Grok 3 Performance Leak and xAI Resignation Fallout

Theme 4. OpenAI Multimodal Models: o1, o3-mini, and o3-mini high


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1: OpenAI Unveils GPT-5 and Opens Floodgates to o1 and o3

Theme 2: GRPO Powers Up AI Models, Sending Performance Soaring

Theme 3: Thomson Reuters Clobbers AI Copycats in Court

Theme 4: DeepScaleR Rockets RL Back into the Spotlight

Theme 5: AI Models Get Curious with Automated Capability Discovery


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


OpenAI Discord


Codeium (Windsurf) Discord


Perplexity AI Discord


LM Studio Discord


Interconnects (Nathan Lambert) Discord


Eleuther Discord


Cursor IDE Discord


GPU MODE Discord


OpenRouter (Alex Atallah) Discord


Nous Research AI Discord


Notebook LM Discord


aider (Paul Gauthier) Discord


Stability.ai (Stable Diffusion) Discord


MCP (Glama) Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


Torchtune Discord


Yannick Kilcher Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


MLOps @Chipro Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (1027 messages🔥🔥🔥):

GRPO Implementation, Dataset Cleaning, MoE Models, Liger Kernel vs Apple Kernel, OpenR1-Math-Raw Dataset

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (12 messages🔥):

Reading resources, Sourcing information on AI/ML/RAG, Reddit as a source, Local Llama hosting


Unsloth AI (Daniel Han) ▷ #help (149 messages🔥🔥):

Fine-tuning Models, GRPO Optimization, Experiment Tracking, Qwen Model Performance, Mistral Benchmarking

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (71 messages🔥🔥):

Reward Function Challenges, In-context Training, GRPO Methodology, Fine-tuning Experiences, Mistral and Numeric Data Limitations

Links mentioned:


OpenAI ▷ #ai-discussions (308 messages🔥🔥):

Voice Chatbot Technology, Home Assistant Voice Integration, Moxie Robot Companion, ESP32 Microcontrollers, ChatGPT and Other AI Models

Links mentioned:


OpenAI ▷ #gpt-4-discussions (9 messages🔥):

Account Management Improvements, Custom GPT Model, Hiring Experts, Discussion Guidelines


OpenAI ▷ #prompt-engineering (12 messages🔥):

Iterative Prompting Benefits, Function Calling Issues, Prompt Engineering Best Practices, Community Engagement, Prompt Sharing Guidelines


OpenAI ▷ #api-discussions (12 messages🔥):

Iterative Prompting, Function Calling Issues, Prompt Sharing Etiquette, Clarifying Model Instructions


Codeium (Windsurf) ▷ #discussion (19 messages🔥):

Codeium Extension Issues, Switching to Alternatives, Pre-release Version Usage, Support Concerns, Recent Updates


Codeium (Windsurf) ▷ #windsurf (281 messages🔥🔥):

Windsurf Issues and Errors, Model Comparisons, User Feedback and Suggestions, Aggregate Model Performances, Support and Stability Concerns

Links mentioned:


Perplexity AI ▷ #general (240 messages🔥🔥):

Perplexity AI models, Sonar vs DeepSeek R1, API functionality, Real-time browsing, User experiences with Perplexity

Links mentioned:


Perplexity AI ▷ #sharing (9 messages🔥):

OpenAI Rebranding, Apple Tabletop Robot Prototype, Largest Structure in Universe, Street Art Discussion, EU AI Investment

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (20 messages🔥):

401 Authorization Required, API 500 Errors, Token Issues


LM Studio ▷ #general (85 messages🔥🔥):

LM Studio and Model Usage, Deepseek Model Applications, Audio Models Support, Markdown Input Rendering Issue, Coding LLM Preferences

Links mentioned:


LM Studio ▷ #hardware-discussion (175 messages🔥🔥):

GPU Comparison: 3080 Ti vs 3090 vs 4090 vs 5090, Importance of PCI-E Bandwidth in Gaming and Inference, AMD vs NVIDIA: Current Performance and Recommendations, Potential Issues with 5090 GPU, Building a Multi-GPU AI Setup

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (80 messages🔥🔥):

Thomson Reuters AI copyright case, Current AI fundraising, OpenAI's GPT-4.5 and GPT-5 roadmap, GRPO training improvements, DeepSeek-R1 and reasoning models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (24 messages🔥):

Notebook LM alternatives, Long context models, Elicit.com and PaperQA, Claude performance, Gemini capabilities

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (21 messages🔥):

Grok 3 Controversy, Resignation from xAI, Grok 3 Performance Speculations, Free Speech Concerns at xAI, DeepSeek Comparisons

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (78 messages🔥🔥):

Voiceover Techniques, Claude Reasoning Model, OpenThinker-32B Release, Nuclear Physicists Comparison, RL Roundup Challenges

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (5 messages):

Goldman Sachs AI Interview Controversy, Deepseek Model Insights, OpenAI's Strategy Shift

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (18 messages🔥):

Invention of Science, Dwarkesh Podcast with Jeff Dean and Noam Shazeer, Metascience and History of Science Discussions

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (13 messages🔥):

Naming Schemes for LLMs, Philosophical Views on Kuhn's Book, Future of LLMs vs LRMs


Interconnects (Nathan Lambert) ▷ #policy (7 messages):

Shitty slides, Instructor concerns, Course content, Immortalization of students


Interconnects (Nathan Lambert) ▷ #expensive-queries (8 messages🔥):

Deep Research and O1 Pro Combo, ChatGPT UX Feedback, RL Question Misinformation, Switching Models in ODR Chat


Eleuther ▷ #general (22 messages🔥):

Deep Learning Loss Issues, Deepfrying Phenomenon, Pythia's Safety Training, Dataset Perplexity Evaluation, Polyglot Project Interest


Eleuther ▷ #research (198 messages🔥🔥):

Long-Term Memory models, Memory in Transformers, Automated Capability Discovery, Reinforcement Learning via Self-Play, Recursive Inference Scaling

Links mentioned:


Eleuther ▷ #interpretability-general (4 messages):

Fine-tuning and recognition, Testable hypotheses in AI


Cursor IDE ▷ #general (167 messages🔥🔥):

Cursor Documentation Updates, MCP Server Issues, O3-Mini Performance, Pricing Models for AI Services, Claude Model Performance

Links mentioned:


GPU MODE ▷ #general (2 messages):

NVIDIA GB200 images, Discord server purpose


GPU MODE ▷ #triton (1 messages):

Error in default mode vs INTERPRET mode, Triton kernel comparison, Matrix multiplication differences

Link mentioned: error is significantly larger in default mode than INTERPRET mode · Issue #5895 · triton-lang/triton: Describe the bug For the simple 2D matrix multiplation, difference between my triton kernel and torch are: INTERPRET mode: 9.5367431640625e-07 (set os.environ["TRITON_INTERPRET"] = "1&q...


GPU MODE ▷ #cuda (7 messages):

CUDA Memory Model Confusion, PTX Instruction Explanation, Blackwell Tensor Memory Management

Link mentioned: CUDA memory model: why acquire fence is not needed to prevent load-load reordering?: I am reading the book "Programming Massively Parallel Processors" and noticed the below code snippets to achieve "domino-style" scan: if (threadIdx.x == 0) {&#x...


GPU MODE ▷ #torch (18 messages🔥):

CPUOffload functionality, CPU optimizer step, Backward pass derivation, Shared memory techniques, Gradient checking issues


GPU MODE ▷ #cool-links (1 messages):

iron_bound: https://github.com/RC4ML/LoHan


GPU MODE ▷ #beginner (27 messages🔥):

CUDA Installation Issues, Global Memory Coalescing in CUDA, Feedback on CUDA Code Structure, Error Handling in CUDA, Memory Coalescing Visualization

Links mentioned:


GPU MODE ▷ #pmpp-book (2 messages):

CUDA memory model, Atomic Operations in CUDA, Thread Synchronization, GPU vs CPU Architecture, Understanding Scan/Prefix Sum

Link mentioned: CUDA memory model: why acquire fence is not needed to prevent load-load reordering?: I am reading the book "Programming Massively Parallel Processors" and noticed the below code snippets to achieve "domino-style" scan: if (threadIdx.x == 0) {&#x...


GPU MODE ▷ #torchao (2 messages):

FP8 Dynamic Quantization, INT8 Dynamic Quantization, Issue Resolution


GPU MODE ▷ #off-topic (2 messages):

Quantization-Aware Training, QuEST Method, YouTube Animation Parody

Links mentioned:


GPU MODE ▷ #irl-meetup (1 messages):

Nebius Meetup, Kubernetes operator for Slurm, Test-time computation in agentic systems

Link mentioned: Nebius AI Cloud Unveiled. San Francisco Meetup: Discover the most efficient way to build, tune and run your AI models and applications on top-notch NVIDIA® GPUs.


GPU MODE ▷ #bitnet (5 messages):

T-mac paper inquiries, FMHA BWD Kernel example, Contributions to BWD Kernels

Link mentioned: tilelang/examples/flash_attention/example_mha_bwd.py at main · tile-ai/tilelang: Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels - tile-ai/tilelang


GPU MODE ▷ #liger-kernel (2 messages):

FSDP, Liger Kernel


GPU MODE ▷ #self-promotion (3 messages):

Tilelang v0.1.0 Release, Community Engagement, High-Performance AI Kernels

Link mentioned: GitHub - tile-ai/tilelang: Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels: Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels - tile-ai/tilelang


GPU MODE ▷ #🍿 (42 messages🔥):

DeepSeek-R1 Model for GPU Kernel Generation, Collaboration on Project Popcorn, KernelBench Benchmark Discussion, Performance of Generated Kernels

Link mentioned: Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling | NVIDIA Technical Blog: As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is emerging. Also known as AI reasoning ...


GPU MODE ▷ #thunderkittens (2 messages):

Complex MatMul Performance, Reinterpreting ST to CST


GPU MODE ▷ #reasoning-gym (49 messages🔥):

Prompt Optimization, Dataset Evaluation, Model Formatting Flexibility, CodeSteer Release, Math Verification

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

OpenAI o1 and o3 availability, Groq-powered Llamas introduction, Nitro feature upgrade, Groq DeepSeek R1 70B performance

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (163 messages🔥🔥):

Chat History Issues, Provider Routing and Performance, Model Performance Comparisons, OpenRouter Updates, Credit and Subscription Concerns

Links mentioned:


Nous Research AI ▷ #general (149 messages🔥🔥):

Deep Hermes Model Release, Speculative Decoding in LM Studio, Calibration Dataset Characteristics, Quantization Strategy for LLMs, Hugging Face Agent Certification Course

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

LoRA Forget Less and Learn Less, Automated Capability Discovery, Diversity in Technology, Self-Exploration in AI Models

Links mentioned:


Nous Research AI ▷ #interesting-links (3 messages):

AI value systems, Superagent Hackathon, cRACK'd Den event, AI safety declaration summit

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

LoRA Forget Less and Learn Less, Diversity in Technology, Automated Capability Discovery, Foundation Models Self-Exploration

Links mentioned:


Notebook LM ▷ #announcements (1 messages):

Google Sheets Support, NotebookLM Feedback Survey

Link mentioned: Google sheets for NotebookLM: For those of you who have asked to be able to ingest Google Sheets into NotebookLM, we are looking for your feedback to better understand your use case!


Notebook LM ▷ #use-cases (15 messages🔥):

Health Tracking with Notebook LM, NotebookLM for Writing Assistants, Anki Flashcards Creation, Using NotebookLM for Resume Review, AI-Podcast Creation

Links mentioned:


Notebook LM ▷ #general (90 messages🔥🔥):

User Limits and Sharing Notebooks, Student Experiences with NotebookLM, Audio Features and Personalization, Feedback on Product Limits, Source Handling Issues

Links mentioned:


aider (Paul Gauthier) ▷ #general (56 messages🔥🔥):

OpenRouter access to o1 and o3, Aider multi-session support, Editor model for code editing, GPT-5 roadmap update, User experiences with o3-mini

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (24 messages🔥):

Implementing AI Agents in Testing, Aider Multi-Step Execution, Managing Aider Configuration, Using OpenRouter with Aider, Tracking Aider Configuration Sources

Links mentioned:


aider (Paul Gauthier) ▷ #links (9 messages🔥):

Aider Mention on LinkedIn, CodeSteer-v1 Discussion, PyCharm Plugin Feasibility, Microsoft Support for JetBrains, O3 High and 3.5 Sonnet PRs

Link mentioned: Paper page - CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance: no description found


Stability.ai (Stable Diffusion) ▷ #general-chat (85 messages🔥🔥):

Differences between SDXL and SD 1.5, Performance of Flux model, Distillation methods in AI models, Human preference benchmarks, Linux transition issues with ComfyUI

Link mentioned: segmind/Segmind-Vega · Hugging Face: no description found


MCP (Glama) ▷ #general (48 messages🔥):

Server Author Flair, Trust Issues with Crypto/NFTs, Code Review Process, Open Source LLM Models, Generative Dashboards with Clickhouse and Streamlit


Modular (Mojo 🔥) ▷ #general (1 messages):

eggsquad: new Modular job postings 👀


Modular (Mojo 🔥) ▷ #mojo (20 messages🔥):

stdlib meetings, Mojo language features, sum types and parameterized traits, List with trait as element type, Bug report submitted

Link mentioned: [BUG][stdlib] Limitation of parallelize with Mojo code that interacts with Python · Issue #3993 · modular/mojo: Bug description Actual behavior When a parallelized function call interacts with Python, it crashes at runtime under certain conditions. Referring to the code example below, struct Bar.start() uses...


Modular (Mojo 🔥) ▷ #max (24 messages🔥):

MAX -> Wasm compilation, Running Mojo programs, ONNX model execution, CUDA library issues, Mojo API considerations

Link mentioned: Modular: Build the future of AI with us and learn about MAX, Mojo, and Magic.


Latent Space ▷ #ai-general-chat (37 messages🔥):

VAE Reparameterization Trick, OpenAI IOI Paper, Scaled Cognition APT-1, Glean Agents Launch, OpenAI Roadmap Update

Links mentioned:


Torchtune ▷ #general (15 messages🔥):

Checkpointing in Torchtune, MLFlow Logger Integration, Distributed Inference with Torchtune, Crypto Portfolio Growth

Links mentioned:


Torchtune ▷ #dev (10 messages🔥):

Testing Assumptions, Gradient Accumulation Issue, Checkpointing Functionality, DPO Loss Calculation, Memory Management with Opt-in BWD

Links mentioned:


Torchtune ▷ #papers (4 messages):

RWKV RNN, Scaling Challenges of RNNs, State Space Models, Importance of Attention Mechanisms


Yannick Kilcher ▷ #general (5 messages):

Training ML models with limited datasets, Accessing research papers, Tinystories paper, Anna's Archive

Link mentioned: Anna’s Archive: no description found


Yannick Kilcher ▷ #paper-discussion (6 messages):

Reasoning over Latent Spaces, Multilingual Language Models, Llama-2 and Linguistic Bias, Latent Languages in Transformers

Link mentioned: Do Llamas Work in English? On the Latent Language of Multilingual Transformers: We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language mo...


Yannick Kilcher ▷ #agents (4 messages):

Tokenization Issues, Character Representation in LLMs


Yannick Kilcher ▷ #ml-news (12 messages🔥):

DeepScaleR Model, EU AI Funding, Thomson Reuters Copyright Case, OpenAI Roadmap Update, Literature Review Tools

Links mentioned:


tinygrad (George Hotz) ▷ #general (20 messages🔥):

CUDA backend on Windows, PR feedback and testing issues, Windows CI environment variable propagation, Testing failures with recursion vs iteration approach

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (5 messages):

Graph Scheduler Tests, Tinygrad vs PyTorch


LlamaIndex ▷ #blog (2 messages):

Open Source Engineer Position, Agentic Document Workflows, Nomic AI Embedding Model


LlamaIndex ▷ #general (17 messages🔥):

Building RAG systems, Batch processing PDFs, Creating query engine tools, Exhaustive RAG search methods, Vector databases


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

LLM Agents MOOC Hackathon, Global participation, Winning teams announcement, Top represented countries and universities, Hackathon website

Link mentioned: Tweet from Dawn Song (@dawnsongtweets): 🎉 Excited to announce the winning teams of LLM Agents MOOC Hackathon! We’re thrilled by the amazing participation and enthusiasm from the global AI community:🌍 ~3,000 participants from 127 countries...


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

Spring 2025 MOOC Launch, Advanced LLM Topics, Dawn Song Announcement, Course Attendance Stats

Link mentioned: Tweet from Dawn Song (@dawnsongtweets): Really excited to announce our Advanced LLM Agents MOOC (Spring 2025)!Building on the success of our LLM Agents MOOC from Fall 2024 (15K+ registered learners, ~9K Discord members, 200K+ lecture views ...


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (5 messages):

MOOC Curriculum Updates, Hackathon Participation


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (2 messages):

MooC student applications, Research subjects


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (3 messages):

DeepScaleR Model, Assignments Deadline

Link mentioned: Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.: A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team


Nomic.ai (GPT4All) ▷ #general (6 messages):

Steam Gift Card Giveaway, Voice Model Updates, TextWeb-UI Installation, Session Memory Management, Mobile Application Usability


Cohere ▷ #discussions (2 messages):

Login Issues, API Request Blocking


MLOps @Chipro ▷ #events (1 messages):

Podcasting Workshop, AI Voice Generation, ElevenLabs and PlayHT

Links mentioned:




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}