Frozen AI News archive

not much happened today

**Harvey** secured a new **$300M funding round**. **OuteTTS 0.3 1B & 500M** text-to-speech models were released featuring **zero-shot voice cloning**, **multilingual support** (en, jp, ko, zh, fr, de), and **emotion control**, powered by **OLMo-1B** and **Qwen 2.5 0.5B**. The **HOVER** model, a **1.5M-parameter neural net** for **agile motor control**, was introduced, leveraging **human motion capture datasets** and **massively parallel reinforcement learning**. **kokoro.js** enables running AI models locally in browsers with minimal dependencies. **Meta AI** awarded **$200K LLM evaluation grants** for projects on **regional language understanding**, **complex reasoning**, and **interactive programming environments**. **Stability AI's Twitter account was hacked**, prompting security warnings. **Alibaba Qwen** improved **Process Reward Models (PRMs)** for better **mathematical reasoning** using a **consensus filtering mechanism**. **DeepSeek V3** uses **pipeline parallelism** to enhance **distributed inference** and **long-context generation efficiency**. Discussions on **AI policy in legal frameworks** and **AI's role in democratizing education** were highlighted. Lighthearted AI-related humor was also shared.

Canonical issue URL

AI News for 1/15/2025-1/16/2025. We checked 7 subreddits, 433 Twitters and 34 Discords (225 channels, and 2732 messages) for you. Estimated reading time saved (at 200wpm): 327 minutes. You can now tag @smol_ai for AINews discussions!

Congrats to Harvey, for their new $300m round.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Developments

AI Tools and Product Releases

Company and Industry News

Technical Insights and Research

Policy and Societal Impact

Memes / Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Google's Neural Memory Architecture Revolution

Theme 2. UMbreLLa Enhances LLM Performance on Consumer GPUs

Theme 3. Wayfarer Model Redefines AI Dungeon Experience

Theme 4. Meta-Prompt Strategies for Improved LLM Task Management

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. Titans: Successor to Transformers with Human-like Memory

Theme 2. Financial Analysis of AI Subscriptions and Usage


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1. AI Tools Garner Funding but Get Stuck in the 'Loop of Doom'

Theme 2. New AI Architectures Promise to Outscale the Titans

Theme 3. AI Ethics Shake-Ups: Data Policies and DMCA Take-Downs

Theme 4. Coders Clash Over AI Coding Companions

Theme 5. Multi-Agent Systems and Tooling Take Center Stage


PART 1: High level Discord summaries

Cursor IDE Discord


MCP (Glama) Discord


Codeium (Windsurf) Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


Stackblitz (Bolt.new) Discord


Stability.ai (Stable Diffusion) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Notebook LM Discord Discord


OpenRouter (Alex Atallah) Discord


Cohere Discord


tinygrad (George Hotz) Discord


OpenAI Discord


Perplexity AI Discord


LM Studio Discord


Nomic.ai (GPT4All) Discord


GPU MODE Discord


Yannick Kilcher Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Interconnects (Nathan Lambert) Discord


LlamaIndex Discord


DSPy Discord


Axolotl AI Discord


MLOps @Chipro Discord


OpenInterpreter Discord


AI21 Labs (Jamba) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LAION Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (450 messages🔥🔥🔥):

Cursor performance issues, New funding announcement, User experiences and productivity, Comparison with other tools, Python environment issues

Links mentioned:


MCP (Glama) ▷ #general (241 messages🔥🔥):

MCP Tool Discovery, Open-Swarm Framework, Semantic Tool Selection, Dynamic Tool Updates, Home Assistant Integration

Links mentioned:


MCP (Glama) ▷ #showcase (24 messages🔥):

MCP-Bridge, SSE Support, Open Source Client Improvements, Open Strategy Partners Tools, Discord Functionality

Links mentioned:


Codeium (Windsurf) â–· #announcements (1 messages):

Student Discount Pricing, Windsurf Editor Launch, Codeium vs GitHub Copilot, Enterprise Plan, Training Data

Link mentioned: Windsurf Editor and Codeium extensions: Codeium is the AI code assistant platform that developers love and enterprises trust. Also the builders of Windsurf, the first agentic IDE.


Codeium (Windsurf) ▷ #discussion (102 messages🔥🔥):

Student Discounts, Customer Service Issues, Windsurf Updates, VSCode and IDE Preferences, Service Outages

Link mentioned: Support | Windsurf Editor and Codeium extensions: Need help? Contact our support team for personalized assistance.


Codeium (Windsurf) ▷ #windsurf (157 messages🔥🔥):

Windsurf functionality issues, Student discount concerns, DeepSeek performance, Feedback on feature requests, Cascade prompt tips

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (171 messages🔥🔥):

Phi-4 Model Fine-tuning, Dynamic Quantization Comparison, Ollama Chat Template Issues, Saving Merged Models, CPT Training Results

Links mentioned:


Unsloth AI (Daniel Han) â–· #off-topic (1 messages):

Onnx, TensorRT output differences


Unsloth AI (Daniel Han) ▷ #help (63 messages🔥🔥):

Flash Attention 2 installation issues, Training Phi-4 on Colab, Model serving using VLLM, Endless generation in Phi-4, Data packing and performance concerns

Links mentioned:


Unsloth AI (Daniel Han) â–· #research (7 messages):

Transition between memorizing and grokking, Grokking phenomenon, Unsloth training techniques, LORA for knowledge distillation

Link mentioned: Activate GROKKING NOW - Performance Phase of LLMs (II): Grokking, or the sudden generalization by AI models to new knowledge - that occurs after prolonged overfitting in LLMs, is a surprising phenomenon that has c...


Eleuther ▷ #general (125 messages🔥🔥):

LLM API for Batch Text Continuations, Training LLMs and Token Prediction, Generative Models and VAEs, Attention Mechanisms vs. MLP, Exploration of LSTMs in Modern AI


Eleuther ▷ #research (65 messages🔥🔥):

Modded NanoGPT Speedrun Record, TruthfulQA Dataset Exploitation, BERT vs. GPT for Classification, Hybrid Attention Models, Euler-Lagrange Equations in Neural Networks

Links mentioned:


Eleuther ▷ #scaling-laws (16 messages🔥):

PDE Foundation Models, Scaling Laws in Models, Implicit vs Explicit Solvers, Model Output vs Training Data


Eleuther ▷ #lm-thunderdome (8 messages🔥):

MATH DMCA Takedown, MATH Dataset Impact, YAML Quickstarter Update

Link mentioned: Tweet from Tom Adamczewski (@tmkadamcz): Hendrycks MATH has just been hit with a DMCA takedown notice. The dataset is currently disabled.https://huggingface.co/datasets/hendrycks/competition_math/discussions/5


Eleuther â–· #gpt-neox-dev (2 messages):

Zero stages in Deepspeed, Model Parallelism challenges, Performance optimization for 30b model, Amd MI250x GPUs performance

Link mentioned: gpt-neox/megatron/training.py at f7a5a6f9da47de4d4d7cdf776c0832b257f329ef · EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries - EleutherAI/gpt-neox


Stackblitz (Bolt.new) â–· #announcements (1 messages):

Bolt project title editing

Link mentioned: Tweet from StackBlitz (@stackblitz): 📢 Fresh Bolt update:You can change the project title now — making it easier to find on the projects list!


Stackblitz (Bolt.new) â–· #prompting (5 messages):

Chat History Snapshot System, Date Input Issue, New User Introduction, GitHub Repo Interaction

Link mentioned: feat: restoring project from snapshot on reload by thecodacus · Pull Request #444 · stackblitz-labs/bolt.diy: Add Chat History Snapshot SystemOverviewThis PR introduces a snapshot system for chat history, allowing the restoration of previous chat states along with their associated file system state. This...


Stackblitz (Bolt.new) ▷ #discussions (178 messages🔥🔥):

Git Support Discussions, Shadcn Default in Bolt, Session and Token Issues, Deployment Challenges, Stripe Integration Problems

Link mentioned: Apes Together Strong 0p1sf GIF - Apes Together Strong 0p1sf - Discover & Share GIFs: Click to view the GIF


Stability.ai (Stable Diffusion) ▷ #general-chat (133 messages🔥🔥):

Swarm vs A1111, Scam Alert on Twitter, Image Generation Metrics, Community License Attribution, Print on Demand with Stable Diffusion

Links mentioned:


aider (Paul Gauthier) ▷ #general (62 messages🔥🔥):

Lint Errors After Update, DeepSeek Usage and Alternatives, Model Performance Concerns, MOE and GPU Efficiency, AI Security Layers

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (64 messages🔥🔥):

Aider API Logging, Git Commit Issues, CEDARScript Integration, Chat Session Management, Agentic Tools for Code Exploration

Links mentioned:


aider (Paul Gauthier) â–· #links (1 messages):

Helicone LLM Observability, LLM Security Features, Local Deployment with Docker

Link mentioned: GitHub - Helicone/helicone: 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓: 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓 - Helicone/helicone


Nous Research AI ▷ #general (52 messages🔥):

Nous Research Affiliation, Merch Funding, Fine-tuning Techniques, LLAMA 1B QLoRA Training, Community Engagement


Nous Research AI ▷ #ask-about-llms (35 messages🔥):

GrokAdamW and Ortho Grad, LLM Memorization, Process Reward Models (PRMs), Chatbot architecture with PDF data

Links mentioned:


Nous Research AI â–· #research-papers (3 messages):

New architecture design, Recurrent models and attention, Neural long-term memory

Link mentioned: Titans: Learning to Memorize at Test Time: Over more than a decade there has been an extensive research effort on how to effectively utilize recurrent models and attention. While recurrent models aim to compress the data into a fixed-size memo...


Nous Research AI â–· #interesting-links (2 messages):

GrokFast Optimizer, Orthograd Optimizer, Coconut Repository

Link mentioned: GitHub - facebookresearch/coconut: Training Large Language Model to Reason in a Continuous Latent Space: Training Large Language Model to Reason in a Continuous Latent Space - facebookresearch/coconut


Nous Research AI â–· #research-papers (3 messages):

New Neural Architecture, Recurrent Models and Attention, Neural Long-Term Memory Module

Link mentioned: Titans: Learning to Memorize at Test Time: Over more than a decade there has been an extensive research effort on how to effectively utilize recurrent models and attention. While recurrent models aim to compress the data into a fixed-size memo...


Notebook LM Discord ▷ #use-cases (13 messages🔥):

Novice Author Workshop, Digital Pathology Groovy Scripts, Interactivity in NotebookLM, Podcast Sharing, Notebook Usability Feedback

Link mentioned: Easily listen to NotebookLM ➡ Token Wisdom ✨: Quickly and easily listen to NotebookLM ➡ Token Wisdom ✨ for free!


Notebook LM Discord ▷ #general (73 messages🔥🔥):

NotebookLM Plus Access, Podcast Feature Enhancements, Source Management, Google Workspace Licensing, Integration of New Tools

Links mentioned:


OpenRouter (Alex Atallah) â–· #announcements (1 messages):

Minimax-01, Needle-In-A-Haystack test, Requesting models on Discord

Link mentioned: OpenRouter: A unified interface for LLMs. Find the best models & prices for your prompts


OpenRouter (Alex Atallah) ▷ #general (85 messages🔥🔥):

Minimax model performance, DeepSeek issues, OpenRouter regional restrictions, Gemini flash model errors, Activity page functionality

Links mentioned:


Cohere ▷ #discussions (19 messages🔥):

Command R+ Coding Languages, Payment Processing with Stripe, Cohere Models Proxying


Cohere ▷ #questions (13 messages🔥):

Updating Command R models, Rerank 3.5 Coding Capabilities, Embedding Model Limitations, Prompt Composition for Rerank


Cohere ▷ #cmd-r-bot (53 messages🔥):

Random Number Generation, Cohere Rerank Tool, Deep Learning Learning Resources

Link mentioned: LLM University (LLMU): Welcome to LLM University, your premier learning destination for mastering Enterprise AI technologies. Designed for developers and technical professionals, our hub offers comprehensive resources, expe...


tinygrad (George Hotz) ▷ #general (12 messages🔥):

Tinygrad in Browser, JSPI Flag Integration, Cross-Platform Testing, Cloud Computing Goals, Non Uniform Memory Management

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (56 messages🔥🔥):

Tinygrad installation issues, TinyJit performance, Metal backend on pre M3 devices, Model export/import in Tinygrad, Operator fusion notes

Links mentioned:


OpenAI ▷ #ai-discussions (36 messages🔥):

AI Project File Issues, PrivateGPT and Obsidian, ChatGPT Software Engineering Limitations, Generative AI Development Hurdles, Omnimodal Image Generation Delays

Link mentioned: Google Research Unveils "Transformers 2.0" aka TITANS: Have we finally cracked the code on how to give models "human-like" memory? Watch to find out!Join My Newsletter for Regular AI Updates 👇🏼https://forwardfu...


OpenAI ▷ #gpt-4-discussions (15 messages🔥):

Canvas on Web, GPT-4o Tasks, Image Rendering Issues, Custom GPTs, Version History Glitches


OpenAI â–· #prompt-engineering (3 messages):

Prompt Engineering, Writing Books on AI, Self-Discovery Techniques


OpenAI â–· #api-discussions (3 messages):

Prompt Engineering, Writing a Book on Prompting Techniques, OpenAI Documentation Utilization


Perplexity AI ▷ #general (46 messages🔥):

Perplexity AI Issues, Image Generation Quality, Claude Sonnet Performance, Bora's Law, Market Strategies

Links mentioned:


Perplexity AI â–· #sharing (2 messages):

Enron Prank Revival, SEC Sues Elon Musk, TikTok Sale Consideration, 3D Printing AI Tools, 3D Printing Toys

Link mentioned: YouTube: no description found


Perplexity AI â–· #pplx-api (4 messages):

search_domain_filter, API model changes, CrewAI custom stop parameters


LM Studio ▷ #general (51 messages🔥):

Model Search Issues, LM Studio Logging, Context Window Explanation, VRAM vs System RAM, M2 Mac Model Loading Problems

Links mentioned:


Nomic.ai (GPT4All) ▷ #general (50 messages🔥):

Screenplay Feedback, Model Performance & Comparisons, AI Ethical Guardrails, Model Recommendations, Model Management in GPT4All

Links mentioned:


GPU MODE â–· #general (5 messages):

LeetGPU online playground, Discussion channel redirection, Call for contributors

Link mentioned: LeetGPU: no description found


GPU MODE â–· #triton (4 messages):

Triton Initialization Errors, tl.gather Limitations, Optimizing Moe Kernels, Performance Issues with Vectorization

Link mentioned: [Triton] try to optimzie triton moe kernel implmenet with vectorization and tl.gather triton-3… by yiakwy-xpu-ml-framework-team · Pull Request #2913 · sgl-project/sglang: MotivationWhen debugging problematic moe cuda kernels (illegal memory access), I tried to optimize triton moe kernels. The triton moe kernel basically implements radix sort using occurrencies (fi...


GPU MODE â–· #cuda (2 messages):

ONNX to TensorRT conversion, Myelin CUDA error


GPU MODE â–· #torch (4 messages):

Torchinductor, Torch Compile, Machine Learning Frameworks, Caffe's Historical Context

Links mentioned:


GPU MODE â–· #cool-links (6 messages):

GPU internalization, MLPerf participation, MI300X node performance, XCD division in MI300A

Link mentioned: Tweet from Fleetwood (@fleetwood___): Everyone working with GPUs needs to internalise this.A100 version of @vrushankdes's animation


GPU MODE ▷ #beginner (9 messages🔥):

GPU request batching, Kernel implementation feedback, Flash Attention with CUDA, Using Triton for blockwise matmul, Evaluating hardware requirements

Links mentioned:


GPU MODE â–· #off-topic (1 messages):

Torch TensorRT Installation, PyTorch 2.1.1 Documentation


GPU MODE â–· #arm (1 messages):

Linux arm64 Runners, Copilot Chat for Actions Job Failures

Link mentioned: Linux arm64 hosted runners now available for free in public repositories (Public Preview) · GitHub Changelog: Linux arm64 hosted runners now available for free in public repositories (Public Preview)


GPU MODE â–· #liger-kernel (1 messages):

0x000ff4: is there any active topic that I can contribute


GPU MODE â–· #self-promotion (1 messages):

Happy New Year 2025, ArXiv Submission, Researcher Endorsement


GPU MODE ▷ #🍿 (12 messages🔥):

Modal Registry for Popcorn Bot, GPU Type Management, nvidia-smi and deviceQuery, Discord Leaderboard Integration, Function Versioning in Modal

Link mentioned: What utility/binary can I call to determine an nVIDIA GPU's Compute Capability?: Suppose I have a system with a single GPU installed, and suppose I've also installed a recent version of CUDA. I want to determine what's the compute capability of my GPU. If I coul...


GPU MODE â–· #thunderkittens (3 messages):

Documentation for Kernel Options, Onboarding Assistance


Yannick Kilcher ▷ #general (29 messages🔥):

LLM Surprises vs Unsurprises, Discord LLM Recommendations, Model Distillation Approach, ChatGPT Poignant Moment, Document Handling Techniques

Link mentioned: Tutorials | 🦜️🔗 LangChain: New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.


Yannick Kilcher ▷ #paper-discussion (11 messages🔥):

New Google Research Architecture, OpenAI API Policy Changes, OpenAI's Compute Costs, Tensor Product Attention (TPA), Daily Paper Discussion Recording

Links mentioned:


Yannick Kilcher â–· #ml-news (5 messages):

4090 GPU failures, 2-slot 4090 availability

Link mentioned: OEM 48GB RTX 4090 Founders Edition Dual width GPU Graphics card Ganming/ Server | eBay: no description found


Latent Space ▷ #ai-general-chat (38 messages🔥):

Francois Chollet's AI Lab Ndea, Curator for Synthetic Data Generation, Titans Memory Architecture, HAL Holistic Agent Leaderboard, Harvey AI Funding Raise

Links mentioned:


Modular (Mojo 🔥) ▷ #general (4 messages):

Modular subreddit, GitHub organization move


Modular (Mojo 🔥) ▷ #mojo (28 messages🔥):

Recursive Types in Mojo, SIMD Performance Concerns, Variadic Lists to Dictionary Implementation Issues

Links mentioned:


Interconnects (Nathan Lambert) â–· #events (2 messages):

Agent Identity Hackathon, Xeno Grant Applications

Link mentioned: Xeno Grant: Agent Identity Hackathon · Luma: Come join Plastic Labs & Betaworks for an agent identity hackathon to kick off Xeno Grant (powered by $YOUSIM).$5,000 in prizes for the most compelling…


Interconnects (Nathan Lambert) ▷ #news (9 messages🔥):

LiveCodeBench Update, Cerebras Yield Problem, Contextual AI Platform Launch, TGI Backend Expansion, SWE-bench Multimodal Evaluation

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (8 messages🔥):

AMD GPU Support, Intel Sponsorship, Ai2 Funding

Link mentioned: Access MI300X GPU Today | TensorWave | The MI300X Cloud: Access AMD MI300X GPUs today on TensorWave Cloud. Contact us today to get started.


Interconnects (Nathan Lambert) â–· #random (2 messages):

Terminology for non-reasoning models, GPT-4o, Autoregressive models


Interconnects (Nathan Lambert) â–· #cv (1 messages):

natolambert: wow a throwback. this should've been more....


Interconnects (Nathan Lambert) â–· #reads (4 messages):

Human Decision-Making vs AI, Technology Cycle Expectations, Social Movements against AI, Public Understanding of LLMs


Interconnects (Nathan Lambert) â–· #posts (1 messages):

Project Aria, Meta communications

Link mentioned: Introducing Project Aria, from Meta: Project Aria is a research program from Meta, to help build the future responsibly. Project Aria unlocks new possibilities of how we connect with and experience the world.


LlamaIndex â–· #blog (3 messages):

Vellum AI partnership, Neomagus winning LLM x Law Hackathon, Women in AI RAG Hackathon


LlamaIndex ▷ #general (16 messages🔥):

Signup Issues Resolved, Metadata Filtering in ChromaDB, LLM Integration with llmlingua, Tag Extraction Approaches

Link mentioned: Add longllmlingua2 integration by tituslhy · Pull Request #17531 · run-llama/llama_index: DescriptionAdded LLMLingua 2 integration! LLMLingua2 is an improvement over LLMLingua1 using a smaller sized prompt compression method trained via data distillation from GPT-4 for token classifica...


DSPy â–· #show-and-tell (1 messages):

text-to-SQL pipeline


DSPy â–· #general (1 messages):

jimmc414_00230: Any word on when we might expect DSPy v3?


DSPy â–· #examples (2 messages):

dspy ReAct usage, Addition function error, LLama model issues


Axolotl AI â–· #general (4 messages):

Ideal chat template, Torchtune integration


MLOps @Chipro â–· #events (1 messages):

Cooperative AI Summer School, Confirmed Speakers, Application Details, Sortition in Democracy, Financial Assistance

Links mentioned:


MLOps @Chipro â–· #general-ml (2 messages):

Cost of solutions, Churn prevention strategies


OpenInterpreter â–· #general (2 messages):

Bora's Law, Open Interpreter functionalities, Autonomous systems development

Link mentioned: Bora's Law: Intelligence Scales With Constraints, Not Compute: This is a working paper exploring an emerging principle in artificial intelligence development.


AI21 Labs (Jamba) â–· #general-chat (2 messages):

Jamba API Performance, Comparative Analysis with OpenAI







{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}