Frozen AI News archive

not much happened today

**DeepSeek-R1 surpasses OpenAI in GitHub stars**, marking a milestone in open-source AI with rapid growth in community interest. **AlphaGeometry2 achieves gold-medalist level performance with an 84% solving rate on IMO geometry problems**, showcasing significant advancements in AI reasoning. **LangChain releases a tutorial for building AI agents in JavaScript**, enhancing developer capabilities in agent deployment. Reflections on **Anthropic's Claude model** reveal early access and influence on AI development timelines. Lighthearted AI humor includes calls to ban second-order optimizers and challenges in web development longevity. The AI Engineer Summit 2025 workshops were announced, continuing community engagement and education.

Canonical issue URL

AI News for 2/6/2025-2/7/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (210 channels, and 6269 messages) for you. Estimated reading time saved (at 200wpm): 638 minutes. You can now tag @smol_ai for AINews discussions!

For the curious, the SmolLM2 paper, the AlphaGeometry 2 paper and the AIME2025 results were candidate stories for today.


Workshops for AI Engineer Summit 2025 were announced with the Latent Space Pydantic AI episode. All Workshops for AI Engineer 2024 are now released!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek Model Developments and Market Impact

Theme 2. Dolphin3.0-R1: Performance and Community Insights

Theme 3. OpenAI Chain of Thought Updates Triggered by DeepSeek

Theme 4. Kokoro WebGPU: Local Real-time TTS Innovation

Theme 5. Cerebras Mistral Le Chat: Instant Inference Revolution

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. Theoretical Insights into the Superiority of RNNs Over Feedforward Models

Theme 2. o3-mini's Updated Chain of Thought: Clarifying AI Reasoning

Theme 3. MistralAI Launches Fast, Competitive Mobile LLM Application


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Here's a summary of key discussion themes across the provided Discord channels:

Theme 1. DeepSeek Models: Performance, Security, and Open Source Buzz

Theme 2. Gemini Models: Image Generation Glory and API Integration Teasers

Theme 3. Efficiency and Optimization Frenzy: Squeezing Performance from GPUs and Models

Theme 4. AI Agents and Tooling: Navigating the Agentic Landscape

Theme 5. Ethical Quandaries and Oversight: Navigating the Murky Waters of AI

X.ai Grok-beta

Theme 1: AI Model Capabilities and Concerns

Theme 2: AI Tools and Frameworks

Theme 3: AI Development and Optimization

Theme 4: AI in Specialized Fields

Theme 5: AI Community and Policy

X.ai Grok-2

Theme 1: Model Performance and Optimization

Theme 2: AI Model Security and Reliability

Theme 3: AI Tool Integration and Workflow Efficiency

Theme 4: AI Model Capabilities and Applications

Theme 5: AI Ethics and Regulation

Claude 3.5 Sonnet

1. DeepSeek Security and Performance Concerns

2. Meta's Book Torrenting and Cerebras-Mistral Partnership

3. Breakthrough Research in AI Models

4. Developer Tools and Infrastructure Updates

o1-mini-2024-09-12

Theme 1. AI Models Battle Greatness and Glitches

Theme 2. AI Tools and Integration Innovations

Theme 3. Performance Hacks and GPU Glory

Theme 4. AI Research and Interpretability Insights

Theme 5. Policy, Security, and Ethical AI Developments

Relevant Links Mentioned:

o1-preview-2024-09-12

Theme 1. New AI Models Make a Splash

Theme 2. Developers Navigate AI Tool Turbulence

Theme 3. AI Security Breaches Cause Alarm

Theme 4. AI Ethics and Regulations Tighten

Theme 5. Community Collaborations Fuel AI Progress

o1-2024-12-17

Theme 1. Model Rivalries: GPT-4, DeepSeek, and Aider Power-Ups

Theme 2. AI for Creating: Art, 3D Dogs, and YouTube Summaries

Theme 3. Security Stumbles and Bans: DeepSeek, Altman, and the EU

Theme 4. GPU Acceleration: Big Gains, Kernel Fusions, and HPC Feats

Theme 5. Agents, Tools, and the AI Frontier

o3-mini-2025-01-31-low

1. Gemini and DeepSeek Innovations

2. LM Studio Performance and Quantization

3. AI Agent Frameworks and Integrations

4. GPU Optimization and Triton Advances

5. NotebookLM Capabilities and Limitations

o3-mini-2025-01-31-medium

1. DeepSeek & Security Concerns

2. GPU and Low-Level Optimization

3. LLM Agents and Summarization Tools

4. API and Integration Challenges

5. Model Interpretability and Research

o3-mini-2025-01-31-high

1. DeepSeek Innovations & Security Issues

2. Gemini Multimodal Capabilities

3. GPU and Triton Optimizations

4. LLM Agents & Workflow Enhancements

5. OpenRouter and API Integrations

GPT-4o 0513

1. Gemini AI Image Generation

2. DeepSeek Model Issues

3. GPU Optimization Techniques

4. AI Agents and Tools

5. AI Model Benchmarking

GPT-4o 0806

1. DeepSeek Model Performance and Security Concerns

2. AI Art Generation and Prompt Techniques

3. Optimizing GPU and Model Inference

4. Open Source AI and Community Contributions

5. LLM Model Limitations and Improvements


PART 1: High level Discord summaries

OpenAI Discord


Stability.ai (Stable Diffusion) Discord


LM Studio Discord


Cursor IDE Discord


Perplexity AI Discord


Codeium (Windsurf) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


MCP (Glama) Discord


HuggingFace Discord


GPU MODE Discord


OpenRouter (Alex Atallah) Discord


Yannick Kilcher Discord


Notebook LM Discord


Nomic.ai (GPT4All) Discord


Eleuther Discord


LLM Agents (Berkeley MOOC) Discord


LlamaIndex Discord


Modular (Mojo 🔥) Discord


Cohere Discord


tinygrad (George Hotz) Discord


Torchtune Discord


DSPy Discord


Gorilla LLM (Berkeley Function Calling) Discord


MLOps @Chipro Discord


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

OpenAI ▷ #ai-discussions (729 messages🔥🔥🔥):

Gemini AI Image Generation, AI Art and Human Perception, AI Setup Recommendations, DeepSeek Performance Comparison, AI Model Limitations

Links mentioned:


OpenAI ▷ #gpt-4-discussions (7 messages):

User Reactions to GPT-4, AVM Wait Anxiety


OpenAI ▷ #api-discussions (7 messages):

Word counting in Python, Controlling AI output, Batch API assistance, Bot response stability, Indirect prompt injection vulnerability


Stability.ai (Stable Diffusion) ▷ #general-chat (472 messages🔥🔥🔥):

AI Art and Prompts, 3D Model Generation, AI Models and Platforms, AI Tools for Art, US Government and AI Policy

Links mentioned:


LM Studio ▷ #general (313 messages🔥🔥):

DeepSeek R1 Qwen 14B Performance, Optimizing GPU Offload, Template Prompts in LM Studio, Quantization and Model Comparisons, Uncensored Vision LLMs

Links mentioned:


LM Studio ▷ #hardware-discussion (59 messages🔥🔥):

Memtest86 and stress testing, LM Studio settings for performance, ML performance on M1 Max and M2 Ultra, GPU overclocking for inference speed, Standardized questions for benchmarking

Links mentioned:


Cursor IDE ▷ #general (335 messages🔥🔥):

MCP Servers, Cursor Model Comparisons, Cursor Features and Configuration, AI Workflow Improvements, GitHub Copilot Agent

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity file uploads, Image uploads, Expanded context window


Perplexity AI ▷ #general (269 messages🔥🔥):

Perplexity Pro features, R1 model performance, Context limits, DeepSeek model, Model selection and API usage

Links mentioned:


Perplexity AI ▷ #sharing (15 messages🔥):

Open-Source Strategy, EU AI Regulations, Super-Earth Discoveries, CME Gap in Trading, Carbon Dating Definition

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (7 messages):

Sonar API Usage, API Source Limitations, Chatbot Context Management, Case Study Discussion


Codeium (Windsurf) ▷ #discussion (40 messages🔥):

Codelens in VSCode, Model Credit System in Extensions, Supercomplete Support for JetBrains, Extension Performance Issues, Server Activity Concerns

Link mentioned: Supercomplete for Jetbrains | Feature Requests | Codeium: I think jetbrains lack the most in the field of "consecutive action proposals". Supercomplete would be a thing that would be first-of-its-kind in this


Codeium (Windsurf) ▷ #windsurf (245 messages🔥🔥):

Gemini 2.0 Features, Windsurf Usage Challenges, Model Performance Comparisons, User Experiences with Credits, Windsurf Development and Requests

Links mentioned:


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.74.0, Bugfixes, Docker improvements, Support for new models, Markdown generation

Link mentioned: Release history: Release notes and stats on aider writing its own code.


aider (Paul Gauthier) ▷ #general (193 messages🔥🔥):

Job Transition, Code Development in Rust, DeepSeek Security Issues, OpenAI Data Breach, Aider Performance and Features

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (32 messages🔥):

Aider Commands, Using OpenRouter, Architect Mode Behavior, Voice Chat Utilization, Aider Installation Issues

Link mentioned: OpenRouter: aider is AI pair programming in your terminal


Nous Research AI ▷ #general (206 messages🔥🔥):

GRPO Performance, Model Quantization, Conciseness in Reasoning, Using LLMs for Reward Functions, Benchmarking Models

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

LIMO Model Performance, AI Oversight Challenges

Links mentioned:


Nous Research AI ▷ #interesting-links (8 messages🔥):

Meta's Torrenting Practices, Mistral's Collaboration with Macron, Cerebras Powers Mistral's Le Chat, Mistral's Performance Comparison, UAE's Investment Plans

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

LIMO Model Performance, AI Oversight Challenges

Links mentioned:


MCP (Glama) ▷ #general (148 messages🔥🔥):

MCP CLI usage, MCP Server Development, Building Docker Images for MCP, Embedding Models Performance, Using MCP with Various LLMs

Links mentioned:


MCP (Glama) ▷ #showcase (64 messages🔥🔥):

MCP Web Research Setup, Tool Support and Challenges, Sampling Support in MCP, Claude's Research Framework, Integration of Tools

Links mentioned:


HuggingFace ▷ #general (56 messages🔥🔥):

DeepSeek R1, AI agents and summarization, Frugal AI Challenge, Slither-audited-smart-contracts dataset, NotebookLM

Links mentioned:


HuggingFace ▷ #today-im-learning (2 messages):

DeepSeek Download, Creating AI Agents, Agent Framework


HuggingFace ▷ #i-made-this (90 messages🔥🔥):

FastAPI tool calling, Model similarity study, MLPwned project, Kokoro TTS integration

Links mentioned:


HuggingFace ▷ #computer-vision (8 messages🔥):

Uncertainty Quantification in VLMS, Open-source alternatives to GPT-4, InterVL2.5 MPO overview, Qwen 2.5 VL model in manufacturing


HuggingFace ▷ #NLP (4 messages):

NLP Transfer Learning, Japanese BERT Model, Twitter Corpus, Data Source Selection


HuggingFace ▷ #smol-course (4 messages):

Smol Agents Course, Resource Sharing


HuggingFace ▷ #agents-course (1 messages):

Agent as Assistant, Freecad Methodology, Dataset Automation, DeepSeek R1 Integration

Link mentioned: Paper page - Executable Code Actions Elicit Better LLM Agents: no description found


HuggingFace ▷ #open-r1 (17 messages🔥):

Open-R1 vs SearX, Math-500 Evaluation, API Provider Challenges, H200 vs A100 Performance, R1 Traces Dataset

Links mentioned:


GPU MODE ▷ #general (3 messages):

Economizing AI research, Reinforcement learning paradigms, Optimizer interactions, Data formulation, Sampling rollouts


GPU MODE ▷ #triton (14 messages🔥):

Open Source Triton Contribution, Improving Triton Code Performance, Triton Implementations on GitHub, Debugging Triton Programs, Atomic Operations in Triton

Links mentioned:


GPU MODE ▷ #cuda (22 messages🔥):

Triton Performance Optimization, Kernel Fusion in CUDA Streams, Memory Bandwidth Analysis, PTX Code Extraction, Unit Testing for GPU Code

Link mentioned: Triton — Codefile: Create collaborative code files online for your technical interviews, pair programming, teaching, etc.


GPU MODE ▷ #torch (3 messages):

debugging performance in PyTorch, torch.profiler, memory tooling for GPU issues

Link mentioned: Understanding GPU Memory 1: Visualizing All Allocations over Time: During your time with PyTorch on GPUs, you may be familiar with this common error message:


GPU MODE ▷ #algorithms (9 messages🔥):

Grouped GEMM Implementation, Performance of cuOpt LP Solver, GPU Architecture Performance, Batch Processing for Small LPs, Warp Divergence in GPU Solvers

Link mentioned: Accelerate Large Linear Programming Problems with NVIDIA cuOpt | NVIDIA Technical Blog: The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method (IPM). The introduction of primal-dual&...


GPU MODE ▷ #cool-links (7 messages):

Keep Your Internal Pressure High, C++ Concepts, C++ Standards, CUDA Support, PyTorch and template constraints

Links mentioned:


GPU MODE ▷ #jobs (1 messages):

vish_44: Absolutely loved the GPU Glossary!


GPU MODE ▷ #beginner (7 messages):

Video Frame Classification, Memory Optimization Techniques, Profiler Issues with CUDA


GPU MODE ▷ #self-promotion (11 messages🔥):

Flash Attention with CUDA, Fused SwiGLU kernel, Performance benchmarks on GPUs, Self-Attention and MLP optimization

Links mentioned:


GPU MODE ▷ #avx (24 messages🔥):

Optimizing Adam Implementation, FSDP2 CPU Offloading, Pytorch SIMD instructions, Numerical Precision in Optimization, Memory Bottlenecks in HPC

Links mentioned:


GPU MODE ▷ #thunderkittens (1 messages):

DSM utilization, Memory operations in ThunderKittens

Link mentioned: ThunderKittens/include/ops/warp/memory/util/tma.cuh at main · HazyResearch/ThunderKittens: Tile primitives for speedy kernels. Contribute to HazyResearch/ThunderKittens development by creating an account on GitHub.


GPU MODE ▷ #reasoning-gym (58 messages🔥🔥):

re-arc dataset development, Tsumego puzzles implementation, scoring methodology for answers, self-reference logic puzzles, reasoning_gym updates

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Authentication issues, Reasoning tokens visibility


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Chat-Thyme, Discord bots, OpenAI compatibility, Search capabilities with Exa


OpenRouter (Alex Atallah) ▷ #general (129 messages🔥🔥):

Downtime Issues, DeepSeek R1 Differences, Gemini Model Capabilities, OpenRouter API Usage, Reasoning Content Handling

Links mentioned:


Yannick Kilcher ▷ #general (35 messages🔥):

Anthropic Code Leak, OpenAI Trademark Filing, Dolphin 3.0 Model Release, Collaboration on Synthetic Dataset, RL and LLM Resources

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (76 messages🔥🔥):

OmniHuman Framework, DeepSeek's AI Chips, Sparse Autoencoders Research, Linear Probes Investigation, Chinese HBM Production

Links mentioned:


Yannick Kilcher ▷ #agents (2 messages):

Reinforcement Learning for AI agents, VectorDB for memory storage, Genuine RL vs evaluation frameworks, Adaptive agent behavior, RL papers for agentic frameworks

Link mentioned: STAKES.MONEY | Register: no description found


Yannick Kilcher ▷ #ml-news (6 messages):

GitHub Copilot Agent Mode, Meta PARTNR Collaboration Video, AlphaGeometry2, Machine Learning without LLM, Stake Promotion

Links mentioned:


Notebook LM ▷ #use-cases (13 messages🔥):

Using NotebookLM for Poetry Analysis, Challenges Reviewing Multiple Documents, Case Study Summarization, AI in RPG Game Reviews, Utilizing AI for Medical Jargon


Notebook LM ▷ #general (69 messages🔥🔥):

NotebookLM Sharing Issues, Gemini 2.0 Capabilities, Notebook Creation Limit, Document Reading Functionality, Source Footnote Visibility

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (50 messages🔥):

LocalDocs functionality, Model memory limitations, Use of historical chat data, Debugging model setup, User feedback on interface improvements

Links mentioned:


Eleuther ▷ #announcements (2 messages):

Image Classifiers and Concept Erasure, Skip Transcoders vs Sparse Autoencoders, Quadratic Feature Removal Methods

Links mentioned:


Eleuther ▷ #general (15 messages🔥):

Using Accelerate with DeepSpeed, Stable Chaos Model, CLIP Fine-Tuning for Different Languages, Interpretability/Explainability Resources, Linear Attention Improvement

Links mentioned:


Eleuther ▷ #research (29 messages🔥):

Learning coefficients from data, Quadratic fitting and trust regions, User preferences and reward models, AI reasoning framework, Token prediction and MTP paper


Eleuther ▷ #lm-thunderdome (2 messages):

Turkish MMLU Config Update, Main Evaluation Modifications

Link mentioned: Turkish mmlu Config Update by ArdaYueksel · Pull Request #2678 · EleutherAI/lm-evaluation-harness: Structural Change now matches Huggingface Dataset Card.Before it was 0-4 for class labels now A-E.Config change addresses it.


Eleuther ▷ #gpt-neox-dev (1 messages):

Query Repetition


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (46 messages🔥):

Certificate Issuing Confusion, Article Assignment Requirements, Quizzes Submission Deadlines, Email Communication Hiccups, Course Enrollment and Accessibility

Links mentioned:


LlamaIndex ▷ #blog (3 messages):

YouTube Summarization Bot, LlamaParse Gemini 2.0


LlamaIndex ▷ #general (33 messages🔥):

Multi-Agent Workflow with Tavily, Llama Index Node Editor Playground, Troubles with Image Descriptions using Ollama, Custom Prompt Templates for FunctionAgent, Token Counting in LLM Workflows

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (13 messages🔥):

LinkedList iterator implementation, Mojo Style Guide, Mojo Documentation, Compiler and Undefined Behavior


Modular (Mojo 🔥) ▷ #max (6 messages):

MAX Graphs in MAX-nightly, Python MAX Graph API, Mojo MAX Graph API support

Links mentioned:


Cohere ▷ #discussions (7 messages):

Using Accelerate with DeepSpeed, Cohere Free API Rate Limits, Command-Medium Model Status, Job Application Advice

Link mentioned: Working with Cohere's API and SDK — Cohere: Cohere's NLP platform provides customizable large language models and tools for developers to build AI applications.


Cohere ▷ #api-discussions (6 messages):

LibreChat API Endpoints, Cohere Base URL, Curl Testing


Cohere ▷ #cmd-r-bot (5 messages):

Febryanvaldo's Commands, Cmd R Bot Responses


tinygrad (George Hotz) ▷ #general (14 messages🔥):

HEVC cuviddec Location, LLVM and Z3 Dependency, YAML File Formatting, Tinygrad CPU Speed Project, LLM Browser Demo Testing

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

Discord Rules Update, ChatGPT Feedback


Torchtune ▷ #general (3 messages):

Hugging Face Tokenizers, Torchtune Configuration

Link mentioned: HF tokenizers: initial base tokenizer support by ebsmothers · Pull Request #2350 · pytorch/torchtune: Fixes #2212This is an initial PR to support general tokenizers from Hugging Face via a tokenizer.json file. This is just a starting point to parse relevant JSON files, infer BOS and EOS, and defin...


DSPy ▷ #general (2 messages):

DSPy release schedule, Task simplification in DSPy


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (2 messages):

RAFT method for synthetic data, Prompt quantity for synthetic data generation, Using Llama 7B for synthetic dataset, Custom templates for synthetic data, CoT prompts and accuracy


MLOps @Chipro ▷ #events (1 messages):

MLOps Workshop, Feature Store, GCP with BigQuery, Simba Khadder, Cloud DataProc

Link mentioned: MLOps Workshop: Building a feature store on GCP with BigQuery: Join our 1-hr webinar with Simba Khadder as he demos building a feature store on GCP with Bigquery, BigLake, and Cloud DataProc!


{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}