Frozen AI News archive

not much happened today

**ChatGPT**, **Sora**, and the **OpenAI API** experienced a >5 hour outage but are now restored. Updates to **vLLM** enable **DeepSeek-V3** to run with enhanced **parallelism** and **CPU offloading**, improving **model deployment flexibility**. Discussions on **gradient descent** in **top-k routing MoE** and adoption of **FP8 precision** focus on **training efficiency** and **memory optimization**. **AIDE**, an **AI voice medical assistant** by **Team Therasync**, leverages **Qdrant**, **OpenAI**, and **Twilio**. **DeepSeek-Engineer** offers AI-powered coding assistance with structured outputs. **LlamaIndex** integrates **LlamaCloud** and **ElevenLabs** for large-scale **document processing** and voice interaction. Insights on **version control** with **ghstack** and advocacy for **linear decay learning rate schedules** highlight best practices in AI development. Experts predict **smaller, tighter models**, **true multimodal models**, and **on-device AI** in 2025. Proposals for **planetary-scale federated learning** and community AGI moonshots emphasize future AI directions. Discussions on **agentic systems**, **multi-agent workflows**, and **deliberative alignment** through **chain of thought reasoning** underscore AI safety and alignment efforts.

Canonical issue URL

AI News for 12/26/2024-12/27/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (215 channels, and 5579 messages) for you. Estimated reading time saved (at 200wpm): 601 minutes. You can now tag @smol_ai for AINews discussions!

ChatGPT, Sora, and the OAI API had a >5 hour outage. They are back up.

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Infrastructure & Optimization

AI Applications & Tools

AI Development Practices

AI Innovation & Future Trends

AI Safety & Alignment

AI Infrastructure & Optimization

AI Development Practices

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek's Cost Efficiency and Comparative Performance vs. 4o

Theme 2. DeepSeek-V3 Architecture: Leveraging 671B Mixture-of-Experts

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. OpenAI's Growing Capital Needs and Funding Plans

Theme 2. Criticism of 'Gotcha' Tests to Determine LLM Intelligence

Theme 3. AI and Mathematics: Progress and Limitations Highlighted


AI Discord Recap

A summary of Summaries of Summaries by o1-mini-2024-09-12

**Theme 1: DeepSeek Dominates the AI Race

**Theme 2: Integrating AI Like a Pro (or Not)

**Theme 3: Ka-Ching! Pricing Models Shake Up AI Access

**Theme 4: GPU Gurus and Training Tricks

**Theme 5: Creativity Meets Code (and Ethics)


PART 1: High level Discord summaries

Cursor IDE Discord


Codeium (Windsurf) Discord


aider (Paul Gauthier) Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


Nous Research AI Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


Stackblitz (Bolt.new) Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


Notebook LM Discord Discord


Interconnects (Nathan Lambert) Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


Cohere Discord


Latent Space Discord


LlamaIndex Discord


Torchtune Discord


DSPy Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


OpenInterpreter Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


LAION Discord


MLOps @Chipro Discord


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (1129 messages🔥🔥🔥):

Cursor IDE Functionality, DeepSeek V3 Performance, Claude Sonnet Comparison, Context Management in AI Tools, Model Efficiency and Costs

Links mentioned:


Codeium (Windsurf) ▷ #content (1 messages):

Windsurf innovation, Behind the scenes of Windsurf, Holiday messages

Link mentioned: Tweet from Windsurf (@windsurf_ai): What exactly is Windsurf? Watch how we dared to innovate by breaking every industry convention 🌊


Codeium (Windsurf) ▷ #discussion (202 messages🔥🔥):

Windsurf performance issues, Codeium Pro plan frustrations, Integration problems with IDEs, Macbook M1 terminal issues, Global rules in Cascade

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (557 messages🔥🔥🔥):

Windsurf Feedback, DeepSeek V3, Credit System, User Experience, AI Tool Comparisons

Links mentioned:


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.70.0 release, o1 model support, analytics opt-in, error handling improvements, new install methods

Link mentioned: Release history: Release notes and stats on aider writing its own code.


aider (Paul Gauthier) ▷ #general (534 messages🔥🔥🔥):

DeepSeek V3 Performance, AI Coding Tools and Strategies, Aider Integration, Svelte Documentation for LLMs, Developer Skills and Learning

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (63 messages🔥🔥):

DeepSeek V3 Performance, Aider Configuration, Repo Map Functionality, Token Limits Discussion, Model Merging Strategies

Links mentioned:


aider (Paul Gauthier) ▷ #links (1 messages):

GitDiagram, Gitingest

Links mentioned:


Eleuther ▷ #general (9 messages🔥):

Hugging Face Trainer Modification, Pythia Intermediate Checkpoints, Machine Learning Research Interest


Eleuther ▷ #research (278 messages🔥🔥):

Causal Inference in Machine Learning, Intelligence and Learning Models, World Models and Video Generation, Symbolic Representation in AI, Human Learning and Cognition

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Deepseek v3, OpenRouter usage, Model comparisons, Cost of frontier models

Link mentioned: Tweet from OpenRouter (@OpenRouterAI): Deepseek has tripled in usage on OpenRouter since the v3 launch yesterday.Try it yourself, w/o subscription, including web search:Quoting Anjney Midha 🇺🇸 (@AnjneyMidha) Deepseek v3 seems to be a gen...


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

AI Chat Terminal (ACT), Content Identification/Moderation System (CIMS), Google Search for Grounding, RockDev Tool

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (277 messages🔥🔥):

DeepSeek V3 Performance, Model Comparisons, Tool Calling in AI Models, OCR Support in AI Tools, Open Weight Models

Links mentioned:


Nous Research AI ▷ #general (184 messages🔥🔥):

NVMe Performance Insights, Linux Distros for Beginners, Model Comparisons and Experiences, Nous Merch Launch, URL Moderation API Challenges

Links mentioned:


Nous Research AI ▷ #ask-about-llms (81 messages🔥🔥):

Deepseek V3 Performance, RoPE Implementation, Benchmarking Differences, Code Assistance Tools

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

real.azure: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf


Nous Research AI ▷ #interesting-links (1 messages):

xebidiah: https://xebidiah.com


Nous Research AI ▷ #research-papers (1 messages):

real.azure: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf


LM Studio ▷ #general (165 messages🔥🔥):

AI Tools and Ethical Concerns, Model Performance and Improvements, Image Generation and Derivative Work, MLX and Memory Leaks, RPG and AI Integration

Links mentioned:


LM Studio ▷ #hardware-discussion (92 messages🔥🔥):

GPU Utilization in LLM Studio, Building a Multi-GPU System, Model Performance with VRAM, Agentic Workflows and Frameworks, Server Hardware Limitations in LLM Studio

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (150 messages🔥🔥):

Forking Repositories, Fine-tuning Models, LoRA Weights vs Full Model Weights, Dynamic Adapter Loading with Hugging Face, Dataset Filtering Techniques

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (3 messages):

Bachelor's thesis data training, Instruction-tuning datasets, Python coding datasets


Unsloth AI (Daniel Han) ▷ #help (66 messages🔥🔥):

Unsloth model functionalities, Finetuning models, GGUF and 4-bit conversion, Vision language models, Model saving issues

Links mentioned:


OpenAI ▷ #ai-discussions (155 messages🔥🔥):

ChatGPT Outages, DeepSeek V3 Performance, Comparison of AI Models, Quantum System Hypothesis

Link mentioned: High error rates for ChatGPT, APIs, and Sora: no description found


OpenAI ▷ #gpt-4-discussions (39 messages🔥):

GPT-03 release timeline, ChatGPT service issues, Humorous NPC concept, User experience with prompts


OpenAI ▷ #prompt-engineering (5 messages):

Project Discussions, Outfit Creation


OpenAI ▷ #api-discussions (5 messages):

Discussing minute durations, Second project updates, Outfit creation for Ziggi_Jo


Stackblitz (Bolt.new) ▷ #prompting (7 messages):

Gabe's new app, Quality Issues on Bolt, Prompting for Code Changes, Claude Load and Performance


Stackblitz (Bolt.new) ▷ #discussions (183 messages🔥🔥):

Using Bolt with OpenAI, Netlify 404 Routing Issues, Public GitHub Repo Importing, Token Usage Issues, Community Support for Bolt

Links mentioned:


Perplexity AI ▷ #general (134 messages🔥🔥):

Perplexity AI Models, DeepSeek, Subscription Issues, AGI Discussion, AI Video Creation Aggregator

Links mentioned:


Perplexity AI ▷ #sharing (17 messages🔥):

OpenAI's humanoid robot plans, AI Pretending to Change Views, Human Spine Grown in Lab, Body-Heat Powered Wearables, Groundbreaking AI Model from India

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (1 messages):

Perplexity API performance, Spaces API usability, Custom frontends support


Stability.ai (Stable Diffusion) ▷ #general-chat (125 messages🔥🔥):

Hunyuan Video Generation, Image Prompting Techniques, Model Compatibility with Loras, AI Video Rendering Challenges, 3D Printing and AI Art

Links mentioned:


Notebook LM Discord ▷ #use-cases (14 messages🔥):

Pathfinder 2 summaries, Audio Overviews for Wikipedia, AI chatbots, NotebookLM capabilities, UFO Discussions

Links mentioned:


Notebook LM Discord ▷ #general (82 messages🔥🔥):

NotebookLM Interactive Mode Issues, Audio Overview Functionality, Subscription Information, Tabular Data in NotebookLM, Sharing AI Generated Podcasts

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (25 messages🔥):

DeepSeek V3 Launch, Multi-Token Prediction Technique, Model Training Efficiency, RL Rewards System, Engineering Innovations in DeepSeek

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (5 messages):

Deepseek Multi-head Latent Attention Mechanism, Deepseek V3 Inference Libraries

Link mentioned: DeepSeek-V3/inference/model.py at main · deepseek-ai/DeepSeek-V3: Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.


Interconnects (Nathan Lambert) ▷ #ml-drama (21 messages🔥):

DeepSeek License Update, Bluesky's AI Backlash, OpenAI Structural Changes, IPO Speculations, Conflict of Interest Concerns

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (6 messages):

Deepseek V3 performance, AI lab requirements, Benchmarking instruction following

Link mentioned: Tweet from Mihir Patel (@mvpatel2000): @teortaxesTex I would guess the pretraining is cracked but post-training lags behind big labs, which accounts for many of these artifacts


Interconnects (Nathan Lambert) ▷ #memes (1 messages):

xeophon.: https://x.com/simonw/status/1872141432544489731


Interconnects (Nathan Lambert) ▷ #nlp (3 messages):

Iterative Preference Learning, Monte Carlo Tree Search, Reasoning Capabilities of LLMs, Self-Evaluation in Models

Link mentioned: Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning: We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by ...


Interconnects (Nathan Lambert) ▷ #reads (8 messages🔥):

RL Training for LLMs, DPO vs PPO, Viewing Parties for Critical Discussions, Incentivizing Better CoTs

Link mentioned: - YouTube: no description found


GPU MODE ▷ #general (22 messages🔥):

DETRs Discussion, DeepSeek-V3 Mixed Precision Training, Block-wise vs Channel-wise Quantization, H800 GPU Training, NVIDIA's Delayed Scaling Technique

Links mentioned:


GPU MODE ▷ #triton (8 messages🔥):

device_print issue in Colab, tl.inf feature comparison, Triton recompilation conditions

Link mentioned: triton/python/triton/runtime/jit.py at 3c058ee7f518da83e99d472f5ebe16fb75e1f254 · triton-lang/triton: Development repository for the Triton language and compiler - triton-lang/triton


GPU MODE ▷ #cuda (8 messages🔥):

DETRs expertise, WGMMA inputs, CUTLASS 3.6.0 discussion

Link mentioned: CUTLASS 3.6.0 · NVIDIA/cutlass · Discussion #2013: Hopper structured sparse GEMM. FP16 FP8 INT8 TF32 A refactor to the CUTLASS 3.x convolution kernel::ConvUniversal API to bring it in line with gemm::GemmUniversal. Now the 3.x convolution API is no...


GPU MODE ▷ #torch (3 messages):

Performance of Compiled Functions with Guards, Ring Attention Doubts


GPU MODE ▷ #cool-links (2 messages):

Character.AI Inference Optimization, AMD Software Stack Gaps, Benchmarking AMD MI300X vs Nvidia H100 + H200

Links mentioned:


GPU MODE ▷ #beginner (10 messages🔥):

Learning ML Tools, vLLM Token Throughput, CUDA Resources, Attention Mechanisms

Links mentioned:


GPU MODE ▷ #pmpp-book (1 messages):

tando.: The video lecture helps me a lot to understand concept combining with the book


GPU MODE ▷ #lecture-qa (1 messages):

Occupancy vs. Utilization


GPU MODE ▷ #bitnet (6 messages):

Torchcompiled forward passes, Bitblas Conv2D generation, Mixed precision training options

Link mentioned: Tweet from Ethan (@torchcompiled): This is a cool idea, but you won't have a good time past the MNIST toy example. No backprop means needing... 128 forward passes, for grad estimate with only 0.009 cos similarity with true grad.inc...


GPU MODE ▷ #sparsity-pruning (1 messages):

Sparsify Function, Dense Matrix Compression, Model Masking Solutions

Link mentioned: ao/torchao/sparsity/README.md at 567cb46409f5f9a761429a87d27b1d5312642888 · pytorch/ao: PyTorch native quantization and sparsity for training and inference - pytorch/ao


GPU MODE ▷ #metal (1 messages):

archit3ch: Is it possible to take a .air file compiled for macOS and run it on iPad?


GPU MODE ▷ #arc-agi-2 (2 messages):

Model Task Format Understanding, Benchmarking Limitations, Scaling to AGI Challenges

Link mentioned: Tweet from Mikel Bober-Irizar (@mikb0b): When models can't understand the task format, the benchmark can mislead, introducing a hidden threshold effect.And if there's always a larger version that humans can solve but an LLM can't...


tinygrad (George Hotz) ▷ #general (6 messages):

matching engine performance, rewrite bounty, testing environment concerns

Link mentioned: Issues · tinygrad/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️ - Issues · tinygrad/tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (51 messages🔥):

Tinygrad vs PyTorch performance, JIT implementation issues, Beam search functionality, RTX 4070 usage, Model conversion to Tinygrad

Links mentioned:


Cohere ▷ #discussions (11 messages🔥):

Christmas greetings, Introduction to AI and ML, Community welcome messages, Lighthearted banter

Link mentioned: Pricing - Affordable Enterprise Generative AI Models: Access our models directly through our API to create scalable production workloads.


Cohere ▷ #questions (25 messages🔥):

Command R Plus Updates, r7b Initial Impressions, Cows and AI Bot Interaction, Emojis in Communication, Bethlehem Star Inquiry


Cohere ▷ #api-discussions (5 messages):

Image Embeds Rate Limits, Holiday Hours Impact


Cohere ▷ #cmd-r-bot (9 messages🔥):

Command R, Command R+, Retrieval-Augmented Generation


Cohere ▷ #projects (2 messages):

Content Identification/Moderation System (CIMS), Companion Discord Chatbot, Content Flagging and Deletion Features

Link mentioned: Home: An AI-powered Discord bot blending playful conversation with smart moderation tools, adding charm and order to your server. - rapmd73/Companion


Latent Space ▷ #ai-general-chat (22 messages🔥):

Orion delays, OpenAI outage, Deepseek pricing and performance, Illuminate tool, Frontier vs Foundation models

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

AI Engineer Summit NYC, Event Calendar Updates, Latent Space Events

Link mentioned: Latent Space (Paper Club & Other Events) · Events Calendar: View and subscribe to events from Latent Space (Paper Club & Other Events) on Luma. Latent.Space events. PLEASE CLICK THE RSS LOGO JUST ABOVE THE CALENDAR ON THE RIGHT TO ADD TO YOUR CAL. "Ad...


LlamaIndex ▷ #blog (2 messages):

Report Generation Agent, Conversational Voice Assistant with RAG


LlamaIndex ▷ #general (18 messages🔥):

LlamaIndex Assistant RAG App, Payroll PDF Parsing, LlamaIndex Roadmap Update, Running Non-Quantized Models with Ollama, Image Data Extraction

Link mentioned: Tags · llama3.2-vision: Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.


LlamaIndex ▷ #ai-discussion (3 messages):

LlamaIndex discussion, Docling from IBM, Open Source Library

Link mentioned: - YouTube: no description found


Torchtune ▷ #general (6 messages):

Flex Compilation Issues, Nested Compiling Dilemmas, Graph Break Concern

Link mentioned: torchtune/torchtune/modules/attention_utils.py at main · pytorch/torchtune: PyTorch native post-training library. Contribute to pytorch/torchtune development by creating an account on GitHub.


Torchtune ▷ #papers (17 messages🔥):

DeepSeek V3, H800 GPUs, FP8 Training Techniques, NVLink Bandwidth Innovations, Triton vs CUDA Implementations

Link mentioned: DeepSeek-V3/DeepSeek_V3.pdf at main · deepseek-ai/DeepSeek-V3: Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.


DSPy ▷ #general (15 messages🔥):

Glossary Generation Script, TypedDict in Pydantic, Elegant Pydantic Designs, Schema Descriptions in Prompts

Link mentioned: A script to generate a glossary of key terms from your Jekyll posts. We're using DSPy to handle LLM interactions; it helps with boilerplate prompt context and parsing responses into Pydantic objects. To run this, put this script in a folder named 'scripts' (or whatever) in your Jekyll site directory. Then plug in your Anthropic API key (or point DSPy to the LLM endpoint of your choice). It will output a YAML file named 'glossary.yaml' to your '_data' directory.: A script to generate a glossary of key terms from your Jekyll posts. We're using DSPy to handle LLM interactions; it helps with boilerplate prompt context and parsing responses into Pydantic o...


Modular (Mojo 🔥) ▷ #general (4 messages):

Mojo swag, Modular merch, Merch quality


Modular (Mojo 🔥) ▷ #mojo (1 messages):

Copyable Traits Design


Modular (Mojo 🔥) ▷ #max (8 messages🔥):

MAX and XLA comparison, Mojo vs Python APIs, Compiler optimizations, Community engagement in development, Endia and Basalt project updates


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (10 messages🔥):

Certificate Declaration Form, Next Course Dates, Quiz Form Accessibility, Advanced LLM Agents MOOC

Links mentioned:


OpenInterpreter ▷ #general (5 messages):

OCR API Issues, Desktop Version Release, Voice to Voice Chat App, Open-Interpreter OS Mode


OpenInterpreter ▷ #ai-content (2 messages):

Claude 3.5 Opus, Comparison with O1 and O1 Pro


Nomic.ai (GPT4All) ▷ #general (7 messages):

Copy Button for AI Code, WASM Package Availability, Vulcan Version Inquiry, Mouse and Keyboard Functionality, New Template Usage


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (3 messages):

Inference scaling, Post-training techniques, Tool-augmented LLM, Leaderboards for model validation


LAION ▷ #general (2 messages):

Whisper's capabilities, Voice Activity Detection


MLOps @Chipro ▷ #general-ml (1 messages):

ML Ops frameworks for HPC, Guild AI stability, DIY ops frameworks





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}