Frozen AI News archive

not much happened to end the year

**Reinforcement Fine-Tuning (RFT)** is introduced as a **data-efficient** method to improve **reasoning in LLMs** using minimal **training data** with strategies like **First-Correct Solutions (FCS)** and **Greedily Diverse Solutions (GDS)**. **DeepSeek-V3**, a **671B parameter MoE language model** trained on **14.8 trillion tokens** with **FP8 mixed precision training**, highlights advances in large-scale models and open-source LLMs. Predictions for **AI in 2025** include growth in **smaller models**, **multimodality**, and challenges in **open-source AI**. The impact of AI on software development jobs suggests a need for **higher intelligence** and **specialization** as AI automates low-skilled tasks. Enhancements to **CodeLLM** improve coding assistance with features like **in-place editing** and **streaming responses**. **Natural Language Reinforcement Learning (NLRL)** offers better interpretability and richer feedback for AI planning and critique. AI hiring is growing rapidly with startups seeking strong engineers in **ML** and **systems**. New AI-powered tools such as **Rivet**, **Buzee**, and **Konfig** improve real-time applications, search, and SDK generation using technologies like **Rust** and **V8 isolates**.

Canonical issue URL

AI News for 12/30/2024-12/31/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (215 channels, and 1948 messages) for you. Estimated reading time saved (at 200wpm): 238 minutes. You can now tag @smol_ai for AINews discussions!

In case you are lacking in "Year In Review" type content, you might enjoy the Latent.Space 2024 Year in Review and 2025 AI Engineer Reading List.


AInews ad slots are open for 2025! Email [email protected] cc [email protected] to get your stuff in front of 30k AI Engineers daily.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Models and Research

AI Predictions and Trends

AI Tools and Development

AI Industry and Employment

AI Policy, Ethics, and Society


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek V3: Hardware Requirements and Performance

Theme 2. Alibaba's LLM Price Cuts: A Disruptive Move

Theme 3. Qwen: The Preferred LLM for Varied Applications

Theme 4. DeepSeek in 2024: Influence and Market Penetration

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. Deepseek Versus OpenAI 01: Disputed Claims and Community Reactions

Theme 2. RAG for Email Knowledge Retention: Privacy Concerns & Implementations


AI Discord Recap

A summary of Summaries of Summaries by o1-mini-2024-09-12

Theme 1. AI Model Performance Battles Intensify

Theme 2. AI Tools and Platform Enhancements

Theme 3. Data Privacy and AI Ethics Concerns

Theme 4. Hardware and GPU Optimization Strategies

Theme 5. Technical Issues and Community Support Challenges


PART 1: High level Discord summaries

Codeium (Windsurf) Discord


Nous Research AI Discord


OpenAI Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Unsloth AI (Daniel Han) Discord


Stackblitz (Bolt.new) Discord


Cursor IDE Discord


OpenRouter (Alex Atallah) Discord


Interconnects (Nathan Lambert) Discord


Notebook LM Discord Discord


GPU MODE Discord


Perplexity AI Discord


Modular (Mojo 🔥) Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


LlamaIndex Discord


Latent Space Discord


Cohere Discord


tinygrad (George Hotz) Discord


Axolotl AI Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LAION Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The OpenInterpreter Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium (Windsurf) ▷ #discussion (60 messages🔥🔥):

User Prompt Credits Issues, Flex Credits Delays, Windows Compatibility Concerns, Code Completion Context, Subscription Confusions


Codeium (Windsurf) ▷ #windsurf (320 messages🔥🔥):

Windsurf feedback and performance, Windsurf features and limitations, Codeium tools comparison, User experiences and issues, Data privacy and AI ethics

Links mentioned:


Nous Research AI ▷ #general (269 messages🔥🔥):

Use of AI in therapy and confidentiality, Comparison of LLMs in code generation, Data privacy and security in healthcare, Functionality vs. conciseness in AI responses, Ethics of AI and patient information

Links mentioned:


Nous Research AI ▷ #ask-about-llms (6 messages):

LlamaCpp Discord, Hermes 3 Amnesia Replication


OpenAI ▷ #ai-discussions (166 messages🔥🔥):

OpenAI's Discord Engagement, Competition with Gemini 2 Flash, Usage of APIs and Model Testing, Content Moderation Challenges, User Insights on AI Models


OpenAI ▷ #gpt-4-discussions (6 messages):

Script updates, Coding assistance, Community support

Link mentioned: Discord - Group Chat That’s All Fun & Games: Discord is great for playing games and chilling with friends, or even building a worldwide community. Customize your own space to talk, play, and hang out.


OpenAI ▷ #prompt-engineering (18 messages🔥):

Prompt Clarity, Markdown Usage in Discord, LexiDeck Framework, Streamlining Feature Research, Discord Prompt Library


OpenAI ▷ #api-discussions (18 messages🔥):

Effectiveness of Direct Prompts, Markdown Usage in Discord, LexiDeck Framework, Researching for Feature Production, Prompting Techniques


LM Studio ▷ #general (43 messages🔥):

LM Studio image generation, LM Studio update issues, Job opportunities in ML/LLM, Steiner reasoning model, Cloud VM management

Links mentioned:


LM Studio ▷ #hardware-discussion (149 messages🔥🔥):

Llama 3.2, Coral AI TPUs, GPU Alternatives, Groq LPU Inference Engine, MacBook Pro Performance

Links mentioned:


aider (Paul Gauthier) ▷ #general (130 messages🔥🔥):

Deepseek performance, Video transcription solutions, O1 API access criteria, Architect mode, Model limitations and improvements


aider (Paul Gauthier) ▷ #questions-and-tips (39 messages🔥):

Aider Command Execution, Token Limit Errors, Using File-based Prompts, Model Switching, Web UI Development


aider (Paul Gauthier) ▷ #links (2 messages):

WebDev Arena, AI Battle Rankings, Claude Model Scores, Gemini Performance, GPT-4o Updates


Unsloth AI (Daniel Han) ▷ #general (118 messages🔥🔥):

Unsloth integration, Hymba model discussion, Fine-tuning techniques, Continued pretraining, Community feedback for Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

Discord server appreciation, Framework feedback


Unsloth AI (Daniel Han) ▷ #help (5 messages):

Unsloth Documentation, Fine-tuning LLaMA 3, Personal Assistant Creation

Link mentioned: Tutorial: How to Finetune Llama-3 and Use In Ollama | Unsloth Documentation: Beginner's Guide for creating a customized personal assistant (like ChatGPT) to run locally on Ollama


Unsloth AI (Daniel Han) ▷ #showcase (8 messages🔥):

Test Time Training (TTT), ARC performance improvement, RL comparisons, Model parameter updates

Link mentioned: The Surprising Effectiveness of Test-Time Training for Abstract Reasoning: Language models have shown impressive performance on tasks within their training distribution, but often struggle with novel problems requiring complex reasoning. We investigate the effectiveness of t...


Stackblitz (Bolt.new) ▷ #prompting (6 messages):

Token Spending Concerns, Project Reloading Methods, Data Gathering for UI Design, Table Data Formatting in Bolt, Coding Issues and Language Compatibility

Link mentioned: Vite + React + TS: no description found


Stackblitz (Bolt.new) ▷ #discussions (106 messages🔥🔥):

Bolt Pro Subscription, Git Integration for Bolt, Facebook API Integration Challenges, User Experience with AI Tools, New Year Greetings

Links mentioned:


Cursor IDE ▷ #general (105 messages🔥🔥):

DeepSeek, Web Hosting Options, Chatbot Development, Debugging and Errors, New GitHub Features

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (71 messages🔥🔥):

OpenRouter Model Additions, DeepSeek v3 Performance, Gemini 2.0 Limitations, Sonnet Comparison, Self-Moderated Chat Models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (22 messages🔥):

Model Evaluation Strategies, Office Setup Upgrades, Eye-Contact Camera Techniques, Reinforcement Learning Dynamics, AI Self-Correction Mechanisms

Link mentioned: Tweet from Aidan McLau (@aidan_mclau): you should basically pretend that getting a model to think for longer is the same as building a bigger modelfollowing the math is quite fun and uncovers some neat things about industry progress


Interconnects (Nathan Lambert) ▷ #reads (13 messages🔥):

Gary Marcus's Predictions, Nvidia's Acquisition of Run:ai, GPT-4 Model Developments, Hallucination Issues in AI, Corporate AI Spending Trends

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (21 messages🔥):

2024 Interconnects Year in Review, Meta's AI Strategy, Algos and Engagement on Social Media, Open Source Models, Slow and Steady Development

Links mentioned:


Notebook LM Discord ▷ #use-cases (9 messages🔥):

Google AI sensitivity, Podcast generation issues, Use of Google Maps, LLMs in Spanish conversations, Notebook LMS Plus account


Notebook LM Discord ▷ #general (34 messages🔥):

Podcast Audio Overview, NotebookLM Plus Features, YouTube Video Uploads, Voice Model Performance, User Interface Feedback

Links mentioned:


GPU MODE ▷ #general (4 messages):

CUDA programming projects, Overlap data transfer in CUDA, Fluids simulation optimization

Links mentioned:


GPU MODE ▷ #triton (1 messages):

Image Analysis Feedback


GPU MODE ▷ #algorithms (1 messages):

Genesis Simulator, Benchmark Corrections

Link mentioned: Tweet from Stone Tao (@Stone_Tao): Yesterday the hyped Genesis simulator released. But it's up to 10x slower than existing GPU sims, not 10-80x faster or 430,000x faster than realtime since they benchmark mostly static environments...


GPU MODE ▷ #jobs (1 messages):

Cracked Research Engineer Job, CUDA Engineer Roles, Remote LLM Infra Positions, Triton Kernel Development

Link mentioned: Cracked Engineers: Hire the best ai and software engineers for your startup.


GPU MODE ▷ #beginner (4 messages):

SSH into Vast AI GPU, Using CUDA on Ubuntu, PyTorch image for GPU, SSH key generation


GPU MODE ▷ #off-topic (1 messages):

iron_bound: https://www.youtube.com/watch?v=VpAZPPCLCUI


GPU MODE ▷ #triton-puzzles (19 messages🔥):

Triton Performance vs Torch, Benchmarking Add Function, Triton Environment Variable, GPU Configuration, Code Comparison


GPU MODE ▷ #thunderkittens (4 messages):

Contributions to Triton, Integer Quantization Challenges, Triton's Optimization Claims


GPU MODE ▷ #edge (7 messages):

Raspberry Pi 5 testing, Bielik model performance, OpenBLAS effects, PP and TG test names, GPU in Raspberry Pi 5


GPU MODE ▷ #arc-agi-2 (1 messages):

SSH into Ubuntu, GPU rental process, Creating instances


Perplexity AI ▷ #general (33 messages🔥):

Pro reasoning mode, Deepseek regulations, Joke-telling abilities of AI, New Year's celebrations, AI predictions for 2025

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

YouTube Random Video Button, Content Optimization Techniques, OpenAI Public Benefit Corporation, Tibet Mega Dam Approval, Encyclopedia Britannica Updates

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (3 messages):

Sonar models usage, Perplexity AI API features, Discord bot with premium features


Modular (Mojo 🔥) ▷ #mojo (30 messages🔥):

BoxPointer renamed to OwnedPointer, Pointer issues in Mojo, Self-referential structures, Mojo update compatibility concerns

Link mentioned: [BUG] --debug-level full crashes when importing · Issue #3917 · modularml/mojo: Bug description Running a mojo script using the debugger seg faults, as opposed to when running regular mojo, which runs to completion (although I have noticed strange behavior in the regular scrip...


Modular (Mojo 🔥) ▷ #max (2 messages):

Mojo APIs for max, API modernization, Type system enhancements


Stability.ai (Stable Diffusion) ▷ #general-chat (31 messages🔥):

Scammers in Discord, Issues with SD3, Faceswap functionality in Stability.ai API, Checkpoint and Lora models, Need for better verification processes

Link mentioned: FRRouting: no description found


Eleuther ▷ #general (14 messages🔥):

Lipschitz-1 RMSNorm Replacement, Estimated Tokens in the Pile Dataset, Residual Flows Implementations, Training with Lipschitz Constants, Applications for Neural SDFs and NeRFs


LlamaIndex ▷ #blog (1 messages):

Optimized RAG Pipeline, LlamaParse


LlamaIndex ▷ #general (10 messages🔥):

Anomaly Detection, Vector Store Embeddings, Chatbot Background Process, Finetuning Llama Model


Latent Space ▷ #ai-general-chat (10 messages🔥):

ModernBERT finetunes, Return of Sesame Street models, AI progress and saturation charts, OpenAI's transition to for-profit, New agentic systems from Hugging Face

Links mentioned:


Cohere ▷ #discussions (4 messages):

Tokenization in HMM, New Year Celebrations


Cohere ▷ #questions (6 messages):

Payment issues with Cohere, Switching to OpenAI, RBI guideline changes affecting transactions


tinygrad (George Hotz) ▷ #general (5 messages):

Reversible transformations in machine code, Application of pcode concepts in tinygrad, Getting started with tinygrad contributions, Tutorial resources for tinygrad, User onboarding in tinygrad

Link mentioned: GitHub - mesozoic-egg/tinygrad-notes: Tutorials on tinygrad: Tutorials on tinygrad. Contribute to mesozoic-egg/tinygrad-notes development by creating an account on GitHub.


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

tinygrad internals, tinygrad notes

Link mentioned: tinygrad-notes/20241231_intro.md at main · mesozoic-egg/tinygrad-notes: Tutorials on tinygrad. Contribute to mesozoic-egg/tinygrad-notes development by creating an account on GitHub.


Axolotl AI ▷ #general (2 messages):

GH200 Access, D2H Memory Transfer Issue


Nomic.ai (GPT4All) ▷ #general (2 messages):

DeepSeek Coder V2 Lite, GigaChat, Modernbert, Embedding backend for localdocs


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

sevastopaul2041: Hey, what's the last date to signup for the Advanced LLM Agents MOOC ?










{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}