Frozen AI News archive

not much happened today

**NVIDIA** has launched **Cosmos**, an open-source video world model trained on **20 million hours of video**, aimed at advancing **robotics** and **autonomous driving**. The release sparked debate over its open-source status and technical approach. Additionally, **NVIDIA** announced **Digits**, a **$3,000** personal AI supercomputer designed to democratize AI computing. The AI community expresses mixed feelings about rapid AI progress, with concerns about **AGI**, job displacement, and investment hype. Discussions also highlight upcoming tools for fine-tuning AI models at home and foundation models for AI robotics.

Canonical issue URL

AI News for 1/6/2025-1/7/2025. We checked 7 subreddits, 433 Twitters and 32 Discords (218 channels, and 3342 messages) for you. Estimated reading time saved (at 200wpm): 365 minutes. You can now tag @smol_ai for AINews discussions!

Happy 2hr Jensen keynote day.

https://www.youtube.com/watch?v=K4qQtPpSn-k


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Theme 1. NVIDIA Cosmos: Revolutionizing Robotics and Autonomous Systems

Theme 2. Overwhelmed by AI Advancements: Navigating Uncertainty


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. NVIDIA Digits: $3K AI Supercomputer Could Revolutionize Local AI

Theme 2. Fine-Tuning Success: 3B Model Excel in Math After Hugging Face Training

Theme 3. Criticisms of RTX 5090 for AI Use: Balancing VRAM & Performance

Theme 4. NVIDIA & AMD in THE AI Tech Race: Digits vs Strix Halo

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. NVIDIA Cosmos: Revolutionizing Robotics and Autonomous Systems**

Theme 2. Overwhelmed by AI Advancements: Navigating Uncertainty


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. GPU Hype and Infrastructure

Theme 2. Fine-Tuning and LoRA Adventures

Theme 3. Tools, Function Calling, and Agents

Theme 4. Payment and Privacy Dramas

Theme 5. MLOps, LLM Security, and What’s Next


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


LM Studio Discord


Codeium (Windsurf) Discord


Stability.ai (Stable Diffusion) Discord


Stackblitz (Bolt.new) Discord


Cursor IDE Discord


Interconnects (Nathan Lambert) Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


Notebook LM Discord Discord


Nous Research AI Discord


Perplexity AI Discord


AI21 Labs (Jamba) Discord


OpenAI Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Cohere Discord


GPU MODE Discord


LlamaIndex Discord


OpenInterpreter Discord


Axolotl AI Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


LAION Discord


Mozilla AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (687 messages🔥🔥🔥):

Unsloth updates and troubleshooting, Tokenization issues with trained LORA adapters, Fine-tuning Llama 3.2, Hardware and memory considerations for AI processing, Using cloud resources for large models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

Gemini 1207 knowledge cutoff, Picotron codebase for fine tuning


Unsloth AI (Daniel Han) ▷ #help (26 messages🔥):

Merging LoRA adapters, Multiple datasets for finetuning, Deploying LLaMA models, Multi-GPU training support

Link mentioned: Google Colab: no description found


Unsloth AI (Daniel Han) ▷ #research (1 messages):

Token Embeddings, Ontological Concepts, Semantic Meaning


LM Studio ▷ #announcements (1 messages):

LM Studio 0.3.6 Release, Function Calling API, Qwen2VL and QVQ Support, New Installer Features, In-App Updates

Links mentioned:


LM Studio ▷ #general (201 messages🔥🔥):

LM Studio in AMD presentation, Function calling API updates, Model loading issues with Qwen-VL, Performance benchmarks with 4090 GPU, Feedback on new UI design

Links mentioned:


LM Studio ▷ #hardware-discussion (227 messages🔥🔥):

NVIDIA Project DIGITS, Speculative Decoding, AI model performance, AMD vs NVIDIA GPUs, Local LLM inference

Links mentioned:


Codeium (Windsurf) ▷ #discussion (71 messages🔥🔥):

DeepSeek vs Codeium Models, Codeium Subscription Issues, Codeium Chat Functionality, AI Model Support and Testing, User Concerns about Windsurf Performance

Link mentioned: Cline (prev. Claude Dev) - Visual Studio Marketplace: Extension for Visual Studio Code - Autonomous coding agent right in your IDE, capable of creating/editing files, running command...


Codeium (Windsurf) ▷ #windsurf (242 messages🔥🔥):

Windsurf Errors, Cascade Autocomplete Issues, User Experience Feedback, Internal Server Errors, Feature Requests and Suggestions

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (268 messages🔥🔥):

Project DIGITS, Stable Diffusion Licensing, Image Generation Quality, Flux Generation Times, NVIDIA Cosmos

Links mentioned:


Stackblitz (Bolt.new) ▷ #prompting (9 messages🔥):

Exporting Bolt projects, Using external LLMs, Manually uploading projects

Link mentioned: Vite + React + TS: no description found


Stackblitz (Bolt.new) ▷ #discussions (258 messages🔥🔥):

Token Consumption Concerns, Chat App Development with Supabase, Bolt and GitHub Integration Issues, Framework Selection for Mobile Apps, Account Migration and Preview Issues

Links mentioned:


Cursor IDE ▷ #general (191 messages🔥🔥):

Cursor IDE performance issues, Modularity in code structure, AI behavior in coding tasks, Cursor extension for project understanding, Issues with Composer agent

Link mentioned: Reddit - Dive into anything: no description found


Interconnects (Nathan Lambert) ▷ #events (3 messages):

Embarcadero Meetup, Meeting Schedule, Shack15 Location


Interconnects (Nathan Lambert) ▷ #news (38 messages🔥):

OpenAI AI agents launch, Devin valuation and support, 01.AI startup updates, Anthropic funding, Competition in AI

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (8 messages🔥):

MosaicML researchers, ChatGPT transcription versions, Token usage in responses


Interconnects (Nathan Lambert) ▷ #random (39 messages🔥):

Nvidia Project Digits Supercomputer, Challenges with Nvidia ARM CPUs, Community collaboration and funding for AI, Open-source software compatibility

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (4 messages):

AI2 community involvement, Kling v1.6 and trolley problem, Nextcloud support challenges

Link mentioned: Tweet from fofr (@fofrAI): I tried to see how Kling v1.6 would handle the trolley problem.But it just backed away slowly.


Interconnects (Nathan Lambert) ▷ #rl (67 messages🔥🔥):

Agents in RL Training, Function Calling and Tool Usage, Self-Correction Mechanisms, Reward Models and Gaming Behavior, Reasoning Traces Generation

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (9 messages🔥):

MeCo Method, Contextual Artifacts in LM Training, Danqi's Contributions, Physics of LLM Papers, Impact of Timestamps

Link mentioned: Tweet from Tianyu Gao (@gaotianyu1350): Introducing MeCo (metadata conditioning then cooldown), a remarkably simple method that accelerates LM pre-training by simply prepending source URLs to training documents.https://arxiv.org/abs/2501.01...


Interconnects (Nathan Lambert) ▷ #policy (1 messages):

Agents and Labor Policy, National Security, Model Shops and AI Proliferation


Eleuther ▷ #general (22 messages🔥):

Training High-Parameter LLMs, Deepspeed Zero-3 Memory Issues, Gradient Checkpointing, Ethics Dataset Evaluation, Learning and Contribution in AI


Eleuther ▷ #research (9 messages🔥):

Cerebras AI Grant Proposals, Inference-Aware Fine-Tuning for LLMs, In-Context Learning Representations, Tensor-GaLore for Neural Network Training, Cut Cross-Entropy Loss Method

Links mentioned:


Eleuther ▷ #lm-thunderdome (112 messages🔥🔥):

Evaluation of Chat Templates vs No Chat Templates, Logprob Analysis of Multiple Choice Questions, Instruct Model Performance, Arc Challenge and Generation Tasks, Chat Format Impact on Model Responses


Eleuther ▷ #gpt-neox-dev (5 messages):

Llama2 Checkpoints Conversion, Optimizer Support in NeoX, Scheduler Syntax in Configs, Mixed Precision Loss Scaling, Pythia Batch Size Calculation

Link mentioned: gpt-neox/megatron/training.py at main · EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries - EleutherAI/gpt-neox


OpenRouter (Alex Atallah) ▷ #general (138 messages🔥🔥):

OpenRouter Payment Issues, Model Performance Concerns, DeepSeek V3 Reliability, Using Crypto for Payments, LLM Limitations in Game Development

Links mentioned:


aider (Paul Gauthier) ▷ #general (73 messages🔥🔥):

Aider's utility in coding, Issues with O1 Pro, Using Continue.dev alongside Aider, Tips for effective AI interactions, Challenges with command execution

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (50 messages🔥):

Aider prompt caching, Custom LLM usage with Aider, Terminal display issues, Color themes for terminal, Troubleshooting file updates in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #links (2 messages):

Aider workflow adaptation, LLM-guided interviews


Notebook LM Discord ▷ #use-cases (14 messages🔥):

NBA Game Recaps, AI in Virtual Sportscasting, Sources and Citation Practices, AI for Contract Review, NotebookLM's Capabilities

Links mentioned:


Notebook LM Discord ▷ #general (86 messages🔥🔥):

NotebookLM Usage Limits, NotebookLM Plus Features, Audio Overview Length, Missing Features, Google Workspace Questions

Links mentioned:


Nous Research AI ▷ #general (78 messages🔥🔥):

Nous Forge API updates, Performance comparisons of RTX GPUs, NVIDIA Project DIGITS, AI bot behavior tweaks, USB-C for networking

Links mentioned:


Nous Research AI ▷ #ask-about-llms (1 messages):

Reputation concerns, Privacy issues, Profit-driven motivations


Nous Research AI ▷ #interesting-links (3 messages):

Structure of Neural Embeddings, MiniMind Training Pipeline, MiniMind Model Overview

Links mentioned:


Perplexity AI ▷ #general (58 messages🔥🔥):

Perplexity performance issues, Concerns about privacy and ads, User interface feedback, SOC 2 compliance inquiries, Subscription and usage questions

Link mentioned: Trust Center | Powered by Drata: Ready to turn trust into your competitive advantage? Sprint through security reviews and quickly share key security information with Trust Center.


Perplexity AI ▷ #sharing (10 messages🔥):

NASA's Moon Micro-Mission, AgiBot's Humanoid Robot Training Dataset, Microsoft's AGI Development, Disney's New Projects, Gen Z Looksmaxxing Trend


Perplexity AI ▷ #pplx-api (1 messages):

Mail from December 19, Concerns about online models


AI21 Labs (Jamba) ▷ #general-chat (66 messages🔥🔥):

AI21 Labs Token, Scam Concerns, Social Media Communication, Token Audits

Link mentioned: DEXTools: DEXTools, the gateway to DEFI, real-time charts, history and all token info from blockchain.


OpenAI ▷ #ai-discussions (18 messages🔥):

AGI and Innovation, AI as a Tool, Recent Advances in AI Technology, Fine-tuning AI Models, RTX 5000 DLSS 4


OpenAI ▷ #gpt-4-discussions (9 messages🔥):

Convo transfer from 4o to 1o, Mini O1 vs GPT-4, Ubuntu setup and GPU compatibility, O1 Pro upgrade discussion


OpenAI ▷ #prompt-engineering (15 messages🔥):

Midjourney SREF prompt in Dall-E, JSON schema responses, Retry implementations, Style naming for prompts


OpenAI ▷ #api-discussions (15 messages🔥):

Midjourney prompt in Dall-E, JSON schema return issue, Retries not working, Prompt engineering concerns


Latent Space ▷ #ai-general-chat (47 messages🔥):

Foundation Models in Science, NVIDIA's Cosmos, Vercel's AI SDK, AI in Whale Conservation, FP4 Wars

Links mentioned:


Modular (Mojo 🔥) ▷ #general (3 messages):

Modular docs font weight, Font readability


Modular (Mojo 🔥) ▷ #mojo (37 messages🔥):

Mojo Debugger, Mojo Project Structure, Static Lists in Mojo, Indexing with Runtime Variables, Static Analysis Methods

Links mentioned:


Cohere ▷ #discussions (7 messages):

AI-Plans Hackathon, Best AI Models, Command R+ Performance, AI Alignment Research


Cohere ▷ #questions (2 messages):

Evals, Mechanistic Interpretability, Object Detection in AR


Cohere ▷ #api-discussions (4 messages):

Embed API Usage, Response Structure, Image Encoding

Links mentioned:


Cohere ▷ #cmd-r-bot (16 messages🔥):

Neural Network in JavaScript, Discord Restart Issues, Cohere Billing Policies


Cohere ▷ #projects (4 messages):

AR projects for object detection, Live AR asset implementation


GPU MODE ▷ #triton (10 messages🔥):

Array Operations in Triton, Config Management in Projects, Performance of MMAs with wgmma, Memory Layout and Data Movement, Kernel Compilation and Autotuning


GPU MODE ▷ #cuda (1 messages):

Output fragment register layout, WMMA loading and storing, Experimenting with matrix copying


GPU MODE ▷ #torch (2 messages):

Custom Autograd Functions, Guard Failures in PyTorch

Link mentioned: Extending PyTorch — PyTorch main documentation: no description found


GPU MODE ▷ #cool-links (3 messages):

Picotron framework, DeepSeek-v3 paper, LLM infrastructure videos

Links mentioned:


GPU MODE ▷ #beginner (3 messages):

Journey sharing, ONNX to TensorRT conversion issues


GPU MODE ▷ #off-topic (4 messages):

Nvidia's Project DIGITS, Grace Blackwell Superchip, Training Small Models

Link mentioned: NVIDIA Project DIGITS: The World’s Smallest AI Supercomputer. : Reserve yours today.


GPU MODE ▷ #rocm (3 messages):

hipDeviceAttributeMaxBlocksPerMultiProcessor, CUDA vs HIP attributes comparison, AMD hardware max occupancy, Thread block discussions

Links mentioned:


GPU MODE ▷ #🍿 (5 messages):

Discord based leaderboard, GPU Glossary resources


LlamaIndex ▷ #blog (4 messages):

LlamaIndex and MLflow integration, Multi-agent systems with NVIDIA AI, Cohere models usage with LlamaIndex

Links mentioned:


LlamaIndex ▷ #general (9 messages🔥):

LlamParse Error, LlamIndex Tutorial Notebook, Text-to-SQL Capabilities, Documentation Links

Links mentioned:


OpenInterpreter ▷ #general (10 messages🔥):

Open Interpreter 1.0 Release, Archiving of Classic OI, Issues with pip installation, Modifications and PR submissions, Local Model Performance

Links mentioned:


Axolotl AI ▷ #general (8 messages🔥):

GH200 Utilization, Compilation Challenges, Discord Link Issues


DSPy ▷ #general (7 messages):

MiPROv2 Instructions Flow, Integration of dspy with Langchain


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

Certificate Declaration, Assignment Completion, Certificate Deadlines


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (5 messages):

Declaration Form Acknowledgment, Email Address Consistency for Submissions


Nomic.ai (GPT4All) ▷ #general (5 messages):

Reasoner v1 capabilities, Local Docs indexing issue, Embedding model support


MLOps @Chipro ▷ #events (1 messages):

MLOps and Feature Stores Webinar, Integration of LLMs in MLOps, 2024 MLOps Developments, Trends and Challenges in 2025

Link mentioned: MLOps and Feature Stores in 2025 with Ben Epstein: Join our 1-hr webinar where Simba Khadder of Featureform and Ben Epstein of MLOps Community will chat about upcoming MLOps trends in 2025!


LAION ▷ #research (1 messages):

LLM security testing, Harmful AI Assistant Challenge, GraySwanAI Arena

Link mentioned: Tweet from Gray Swan AI (@GraySwanAI): 🚨 New Arena Launch Alert: Harmful AI Assistant Challenge 🚨💰 $40,000 in Prizes📅 Launch Date: January 4th, 1 PM EST🤖 5 Anonymous Models🔥 Prizes for speed & quantity.🎮 Multi-turn Inputs AllowedYou...


Mozilla AI ▷ #announcements (1 messages):

Common Voice AMA, 2024 Review, Voice Technology Accessibility


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (1 messages):

Dolphin 3.0 Model Series, BFCL Leaderboard

Link mentioned: Dolphin 3.0 - a cognitivecomputations Collection: no description found




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}