Frozen AI News archive

DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level

**DeepSeek** released **DeepSeek R1**, a significant upgrade over **DeepSeek V3** from just three weeks prior, featuring 8 models including full-size 671B MoE models and multiple distillations from **Qwen 2.5** and **Llama 3.1/3.3**. The models are MIT licensed, allowing finetuning and distillation. Pricing is notably cheaper than **o1** by 27x-50x. The training process used **GRPO** (reward for correctness and style outcomes) without relying on PRM, MCTS, or reward models, focusing on reasoning improvements through reinforcement learning. Distilled models can run on **Ollama** and show strong capabilities like writing **Manim code**. The release emphasizes advances in **reinforcement-learning**, **fine-tuning**, and **model-distillation** with a novel RL framework from DeepSeekMath.

Canonical issue URL

AI News for 1/17/2025-1/20/2025. We checked 7 subreddits, 433 Twitters and 34 Discords (225 channels, and 8019 messages) for you. Estimated reading time saved (at 200wpm): 910 minutes. You can now tag @smol_ai for AINews discussions!

We knew that we'd get an open weights release of DeepSeek at some point, and DeepSeek is already well known for their papers and V3 was the top open model in the world, but all our AI sources could not take their eyes off the DeepSeek R1 release today.

image.png

R1's performance which turned out to be leaps and bounds above DeepSeek V3 from literally 3 weeks ago:

image.png image.png

When we say "R1", it's ambiguous. DeepSeek actually dropped 8 R1 models - 2 "full" models, and 6 distillations on open models:

Other notables from the launch:

Surprises from the paper:


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

DeepSeek-R1 Model Developments

Benchmarking and Performance Comparisons

Reinforcement Learning in LLM Training

Open-Source Models and Distillation

AI Research Papers and Technical Insights

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek-R1 Distilled Models Showcase Exceptional SOTA Performance

Theme 2. DeepSeek-R1 Models Outprice OpenAI's High-Cost Tokens

Theme 3. DeepSeek-R1 Embraces Full MIT License for Models

Theme 4. DeepSeek-R1 Distilled Models Revolutionize Precision Benchmarks

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. DeepSeek-R1 Launches Open-Source Model at Hardware Cost

Theme 2. AI Autonomy in Job Applications with Browser-Use Tool

Theme 3. Critique of OpenAI's Marketing and AGI Promises

Theme 4. Criticism of Perplexity AI's Reliability and Bias Concerns


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. Open-Source LLM Rivalries

Theme 2. Code & Agentic Tools

Theme 3. RL & Reasoning Power-Ups

Theme 4. HPC & Hardware High Jinks

Theme 5. Partnerships & Policy Kerfuffles


PART 1: High level Discord summaries

Codeium (Windsurf) Discord


Perplexity AI Discord


Cursor IDE Discord


Nous Research AI Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


aider (Paul Gauthier) Discord


Stackblitz (Bolt.new) Discord


LM Studio Discord


Latent Space Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Notebook LM Discord Discord


MCP (Glama) Discord


Yannick Kilcher Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


Mozilla AI Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Wave 2 features, Cascade web and doc search, Cascade autogenerated memories, Performance improvements, Status updates

Links mentioned:


Codeium (Windsurf) ▷ #discussion (226 messages🔥🔥):

Windsurf Error Messages, Deepseek R1 Release, Codeium Features, User Support Issues, API Key Usage in Windsurf

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (577 messages🔥🔥🔥):

Windsurf Performance Issues, Deepseek R1 Discussion, Cascade History Management, User Experience with Long Chats, AI Integration with Development Tools

Links mentioned:


Perplexity AI ▷ #general (624 messages🔥🔥🔥):

Perplexity's Model Changes, User Feedback and Issues, New AI Tools and Alternatives, DeepSeek-R1 Integration, User Interactions and Community Support

Links mentioned:


Perplexity AI ▷ #sharing (24 messages🔥):

RedNote App, FBI Malware Uninstallation, Gaia Sky Scan Co., Perplexity AI Acquisition, ISO27001 and NIS2 Controls

Links mentioned:


Perplexity AI ▷ #pplx-api (3 messages):

CrewAI models, Litellm monkey fix, Unnecessary pings


Cursor IDE ▷ #general (588 messages🔥🔥🔥):

Cursor Performance Issues, DeepSeek R1, Agent Functionality Comparison, Slow Request Concerns, GitHub Integrations

Links mentioned:


Nous Research AI ▷ #general (522 messages🔥🔥🔥):

DeepSeek-R1, AI and Crypto, MiniCPM-o 2.6, Reasoning Models, Reinforcement Learning

Links mentioned:


Nous Research AI ▷ #ask-about-llms (36 messages🔥):

High accuracy handwritten text OCR models, Contrast between MOEs and dense models, Efficiency of structured sparsity in AI models, Learning rate scheduling in LLM training

Link mentioned: Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines | NVIDIA Technical Blog: Deep learning is achieving significant success in various fields and areas, as it has revolutionized the way we analyze, understand, and manipulate data. There are many success stories in computer&#82...


Nous Research AI ▷ #research-papers (2 messages):

Climate Change Impact on Agriculture, Mind Evolution in LLMs

Link mentioned: Tweet from AK (@_akhaliq): Google presents Evolving Deeper LLM ThinkingControlling for inference cost, we find that Mind Evolution significantly outperforms other inference strategies such as Best-of-N and Sequential Revision i...


Nous Research AI ▷ #interesting-links (4 messages):

Liquid AI LFM-7B, Recurrent models influence, New business model, Mistral Ministral 3B, Codestral 2501

Link mentioned: Introducing LFM-7B: Setting New Standards for Efficient Language Models: The world’s best-in-class English, Arabic, and Japanese model, native in French, German, and Spanish, optimized to be the substrate for private enterprise chat, code, fast instruction following, and a...


Nous Research AI ▷ #research-papers (2 messages):

Collaborative Research on Climate Change, Google's Mind Evolution in LLMs

Link mentioned: Tweet from AK (@_akhaliq): Google presents Evolving Deeper LLM ThinkingControlling for inference cost, we find that Mind Evolution significantly outperforms other inference strategies such as Best-of-N and Sequential Revision i...


Unsloth AI (Daniel Han) ▷ #general (450 messages🔥🔥🔥):

DeepSeek R1 Models, Unsloth Training Script, Quantization Methods, Windows Installation Issues, VTube Models and Rigging

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (11 messages🔥):

OpenRouter for LLM comparison, Open source web UI options, Running models locally, Flowise as a chat framework

Link mentioned: GitHub - open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...): User-friendly AI Interface (Supports Ollama, OpenAI API, ...) - open-webui/open-webui


Unsloth AI (Daniel Han) ▷ #help (77 messages🔥🔥):

Fine-tuning Models, Model Saving Techniques, Performance Issues with Models, Inference Sampling, Using Unsloth Docs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (20 messages🔥):

Chatterbox Dataset Builder, Sky-T1 Model Performance, Synthetic Datasets, LLM Integration, Docker-Compose Setup

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (8 messages🔥):

Dataset usage for model training, LLM Research Cohort at Cohere For AI, Deep Learning resources for beginners

Link mentioned: Tweet from Mayank Bhaskar (@cataluna84): From the BIRDS(Beginners in Research Driven Studies) organized by @akankshanc of @cohere Open Science Community, we're thrilled to announce our new LLM Cohort! 🎉 🚀This isn't just another lea...


Eleuther ▷ #general (154 messages🔥🔥):

RWKV Model Discussions, Model Quantization Formats, Mixture of Experts (MoE), Performance of AI Models, AI Development and Career Sharing

Links mentioned:


Eleuther ▷ #research (297 messages🔥🔥):

DeepSeek R1, Gradient Spikes, Optimization Techniques, Titan Models and Memorization, RL Training in LLMs

Links mentioned:


Eleuther ▷ #interpretability-general (4 messages):

Steering LLMs with SAE features, Open source steering libraries

Links mentioned:


Eleuther ▷ #lm-thunderdome (63 messages🔥🔥):

Qwen2.5 performance discrepancies, Few-shot prompting techniques, VLLM evaluation issues, Quantization effects on performance, MMLU-PRO evaluation insights

Links mentioned:


Eleuther ▷ #multimodal-general (1 messages):

phi 3 and 3.5 vision, MPS device errors


Eleuther ▷ #gpt-neox-dev (8 messages🔥):

Host RAM Requirements, Vocab Size Optimization, 3D Parallelism with ZeRO Stage 1, Issue Raising for Hangs, Updating Markdown Files

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (237 messages🔥🔥):

DeepSeek-R1 Release, Kimi 1.5 Paper Insights, GRPO and RLHF, Benchmarking Evaluations, Impacts of MIT Licensing

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (27 messages🔥):

O1 pro streaming summary, Test-time search vs forward passes, Use of self-consistency in reasoning, Gflownet in training O1, Asymmetry in RL setups

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (51 messages🔥):

MosaicAI Departures, OpenAI Transparency Issues, Epoch AI and FrontierMath, Perceptron Inc's New Venture, AGI Buzz

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (76 messages🔥🔥):

Molmo AI, DeepSeek Model Insights, VLM Performance, Trae AI IDE, Chinese Startup Landscape

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

Vagueposting, AI Moats, Amanda Askell

Link mentioned: Tweet from Minh Nhat Nguyen (@menhguin): the only moat left in AI is amanda askell


Interconnects (Nathan Lambert) ▷ #rl (6 messages):

Reinforcement Learning for Robotics, Vision & Language Models, Computer Vision Reinforcement Learning, Robotics Perception Models

Link mentioned: Tuning computer vision models with task rewards: Misalignment between model predictions and intended usage can be detrimental for the deployment of computer vision models. The issue is exacerbated when the task involves complex structured outputs, a...


Interconnects (Nathan Lambert) ▷ #reads (21 messages🔥):

Post-Training for AI Applications, Challenges with Devin vs. Cursor, AI Researchers' Overestimation, Reinforcement Learning (RL) Discussions, SOP-Agents Framework

Links mentioned:


Interconnects (Nathan Lambert) ▷ #lectures-and-projects (13 messages🔥):

RLHF Book Progress, Outcome Reward Models, CS329A Course Overview, Reward Modeling Techniques, Value Networks

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (3 messages):

Meta Glasses Integration, WhatsApp Bot Functionality

Link mentioned: GitHub - josancamon19/meta-glasses-gemini: Meta + Rayban Glasses whatsapp bot integration: Meta + Rayban Glasses whatsapp bot integration. Contribute to josancamon19/meta-glasses-gemini development by creating an account on GitHub.


Interconnects (Nathan Lambert) ▷ #policy (3 messages):

Executive Order on AI, NAIRR Event

Link mentioned: Tweet from Charles Foster (@CFGeek): The US President has rescinded the previous administration’s major Executive Order on AI (EO 14110).


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.72.0 Release, DeepSeek R1 Support, Kotlin Syntax Support, File Writing Enhancements, Bugfix Updates


aider (Paul Gauthier) ▷ #general (334 messages🔥🔥):

DeepSeek R1 performance, Aider benchmarks, Kimi k1.5 model, Data privacy in AI models, Local model usage

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (74 messages🔥🔥):

Aider Usage with Language Models, OpenRouter vs Anthropic API, DeepSeek Model Issues, File Management in Aider, API Key Configuration Problems

Links mentioned:


Stackblitz (Bolt.new) ▷ #announcements (1 messages):

Bolt.new update, Setup issues, Prompt accuracy

Link mentioned: Tweet from bolt.new (@boltdotnew): Bolt 🧠 update:bolt․new is now more accurate at picking & configuring the right template — making the setup spot on, from the first prompt, every time!


Stackblitz (Bolt.new) ▷ #discussions (367 messages🔥🔥):

Bolt error loops, RLS policy issues, Stripe integration, Payment processing options, Community support and resources

Links mentioned:


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.7 Release, DeepSeek R1 Support, New Features in Mission Control, KV Cache Quantization Updates

Links mentioned:


LM Studio ▷ #general (179 messages🔥🔥):

Model Performance Comparisons, File Attachment in LM Studio, DeepSeek R1 Model Discussion, Using Multiple Images with Models, LM Studio Updates and Features

Links mentioned:


LM Studio ▷ #hardware-discussion (186 messages🔥🔥):

NVIDIA Digits, GPU Comparisons, Quality of Model Performance, LM Studio vs Ollama, Kaggle Notebooks

Links mentioned:


Latent Space ▷ #ai-general-chat (97 messages🔥🔥):

DeepSeek R1 Release, Transcription Tools, OpenAI Operator Leaks, Liquid Foundation Model, Claude AI Alignment Perspectives

Links mentioned:


Latent Space ▷ #ai-announcements (4 messages):

O1 podcast discussion, DeepSeek v3, SGLang framework, Mission Critical Inference, Kubernetes challenges

Links mentioned:


Latent Space ▷ #ai-in-action-club (220 messages🔥🔥):

AI tooling for accessibility, MCP server framework, Whisper for STT, YouTube captions, Live captions in Windows 11

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (4 messages):

DeepSeek R1 Launch, Performance Comparison with OpenAI, Censorship-Free Access, Llama Endpoints Shutdown

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (258 messages🔥🔥):

DeepSeek R1 Launch, OpenAI Model Rate Limits, User Experience with DeepSeek, Web Search API in OpenRouter, Reasoning Content Access

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (249 messages🔥🔥):

Using Stable Diffusion for Photorealism, E-commerce Text-To-Image Models, Artistic Style Consistency in LoRA Training, Image Generation Issues and Solutions, AI Tools for Background Editing

Links mentioned:


Notebook LM Discord ▷ #use-cases (27 messages🔥):

Podcast creation and voice integration, Gemini Advanced Deep Research workflow, Using NotebookLM for college courses, Experiences with sourcing tools, Community introductions

Links mentioned:


Notebook LM Discord ▷ #general (212 messages🔥🔥):

Google One AI Premium, NotebookLM Plus, Podcast Generation Issues, Document Uploading Issues, Language Support in Interactive Podcast

Links mentioned:


MCP (Glama) ▷ #general (193 messages🔥🔥):

MCP server feedback, Roo Cline features, Rate limits with Claude, Chat log summarization, User interface concerns in MCP clients

Links mentioned:


MCP (Glama) ▷ #showcase (30 messages🔥):

Figma MCP contribution, MCP Logic Calculator, LibreChat performance, TestFlight feedback, Anthropic model compatibility

Links mentioned:


Yannick Kilcher ▷ #general (167 messages🔥🔥):

GPU vs CPU Performance, Agent Learning Models, Self-Adaptive LLMs, AI Tools Evaluation, Online Community Dynamics

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (37 messages🔥):

Lightning Attention Paper Discussion, rStar-Math Research Findings, Tensor Product Attention (TPA) Mechanics, Linear Tensor Product Lightning Attention, DeepSeek's Group Relative Policy Optimization

Links mentioned:


Yannick Kilcher ▷ #agents (3 messages):

Titans, Adaptive Transformers, RNN testing, 760M model performance, BABILong

Link mentioned: no title found: no description found


Yannick Kilcher ▷ #ml-news (15 messages🔥):

Microsoft OpenAI partnership concerns, AI security vulnerability findings, AI compliance tools for trading, TikTok ownership and ban implications, FrontierMath funding controversies

Links mentioned:


Cohere ▷ #discussions (81 messages🔥🔥):

Konkani Language AI Model, Cohere's Accessibility, Project Ideas, API Access and Limitations

Link mentioned: Once you have an error uploading a model, your account (web and api) corrupts and Dataset/Model environment will no longer work · Issue #632 · cohere-ai/cohere-python: Using your example with your CSV file. import cohere co = cohere.Client() # upload a dataset my_dataset = co.datasets.create( name="datasettest", data=open("./Arts.Class.1000.csv",...


Cohere ▷ #questions (11 messages🔥):

Billing Issues, AI Behavior Management, Invoices and Receipts, AI Project Feedback


Cohere ▷ #api-discussions (12 messages🔥):

Command-R Model Versioning, Embed Job Concurrent Limits, Dify.ai Integration Issues


Cohere ▷ #cmd-r-bot (32 messages🔥):

Cohere Models Overview, Tool Calling and Code Generation, Understanding AGI


Cohere ▷ #cohere-toolkit (4 messages):

Cohere's Math Performance, Limitations of LLMs, Tool Usage Tips


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

MOOC course confirmation, Spring mailing list


Mozilla AI ▷ #announcements (1 messages):

Document to Podcast blueprint, Open source projects, Community engagement



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}