Frozen AI News archive

not much happened today

**DeepSeek-V3**, a **671 billion parameter mixture-of-experts model**, surpasses **Llama 3.1 405B** and **GPT-4o** in coding and math benchmarks. **OpenAI** announced the upcoming release of **GPT-5** on **April 27, 2023**. **MiniMax-01 Coder mode** in **ai-gradio** enables building a chess game in one shot. **Meta** research highlights trade-offs in scaling visual tokenizers. **Google DeepMind** improves diffusion model quality via inference-time scaling. The **RA-DIT** method fine-tunes LLMs and retrievers for better RAG responses. The U.S. proposes a three-tier export restriction system on AI chips and models, excluding countries like **China** and **Russia**. Security vulnerabilities in AI chatbots involving CSRF and prompt injection were revealed. Concerns about superintelligence and weapons-grade AI models were expressed. **ai-gradio** updates include NVIDIA NIM compatibility and new models like **cosmos-nemotron-34b**. **LangChain** integrates with **Claude-3-haiku** for AI agents with persistent memory. **Triton Warp specialization** optimizes GPU usage for matrix multiplication. **Meta's** fine-tuned **Llama** models, **OpenBioLLM-8B** and **OpenBioLLM-70B**, target personalized medicine and clinical trials.

Canonical issue URL

AI News for 1/16/2025-1/17/2025. We checked 7 subreddits, 433 Twitters and 34 Discords (225 channels, and 2327 messages) for you. Estimated reading time saved (at 200wpm): 298 minutes. You can now tag @smol_ai for AINews discussions!

o3-mini is coming.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Releases and Evaluations

Research Papers and Technical Insights

AI Policy, Regulation, and Security

Tools, Frameworks, and Development

AI in Industry & Use Cases

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. ElevenLabs' TTS: Factors Behind Outstanding Quality

Theme 2. OpenWebUI's Canvas: Enhanced Multi-Language Support

Theme 3. DeepSeek V3 vs Claude 3.5 Sonnet: Analyzing the Practical Edge

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. OpenAI's Task Management Imperfections: User Frustrations Unveiled


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1. Major Funding Rounds and Company Milestones

Theme 2. Advances in AI Model Development and Performance

Theme 3. AI Tools and Integrations Enhancing Developer Workflows

Theme 4. Challenges and Issues in AI Model Usage and Implementation

Theme 5. Community Initiatives and Events in AI


PART 1: High level Discord summaries

Stackblitz (Bolt.new) Discord


Eleuther Discord


Cursor IDE Discord


Unsloth AI (Daniel Han) Discord


MCP (Glama) Discord


Interconnects (Nathan Lambert) Discord


Codeium (Windsurf) Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Notebook LM Discord Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


Nomic.ai (GPT4All) Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


Yannick Kilcher Discord


OpenAI Discord


LM Studio Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


LlamaIndex Discord


DSPy Discord


Axolotl AI Discord


OpenInterpreter Discord


AI21 Labs (Jamba) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LAION Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Stackblitz (Bolt.new) ▷ #announcements (1 messages):

Lucide Icons, Bolt update, Error resolution

Link mentioned: Tweet from StackBlitz (@stackblitz): Bolt 🧠 update: Deterministic IconsLLMs tend to hallucinate icon names which causes errors, but Bolt's agent can now access entire icon libraries and pick the perfect one — every time. (No extra t...


Stackblitz (Bolt.new) ▷ #prompting (6 messages):

NPM usage in React, Prompting for specific additions, Instructions for PDF annotation web app, File structure documentation, TraycerAI extension review

Link mentioned: Tweet from Sanket Dongre (@sanketdongre369): Just tried out @TraycerAI extension and it works in @cursor_ai!It keep track of the entire codebase. You can create specific tasks to implement, improve or fix features. It generates a plan which you ...


Stackblitz (Bolt.new) ▷ #discussions (283 messages🔥🔥):

Challenges with Bolt's functionality, User experiences with Supabase, Git integration for Bolt, Domain checking tool development, Efficient collaboration strategies

Links mentioned:


Eleuther ▷ #general (227 messages🔥🔥):

RWKV Model Performance, Box Puzzle AI Challenge, Model Size vs. Inference Speed, QRWKV Project, CoT Tuning

Links mentioned:


Eleuther ▷ #research (39 messages🔥):

BERT and GPT Attention Mechanisms, Gradient Sparsity and Compression, NanoGPT Speedrun Achievements, SIRENs and Derivatives, Context Length Warmup Strategies

Links mentioned:


Eleuther ▷ #scaling-laws (19 messages🔥):

Low Resolution Data Effects, Model Approximation Phenomena, Precision vs Accuracy in Model Training, Ground Truth Data Challenges, Finite Element Method (FEM) Convergence


Eleuther ▷ #lm-thunderdome (3 messages):

HF Fair Use Claim, Git Repository Availability


Cursor IDE ▷ #general (236 messages🔥🔥):

Cursor IDE Performance, Claude Integration, Funding Confirmation, User Experience Issues, O1 Model Feedback

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (136 messages🔥🔥):

Issues with Model Saving, Hugging Face API Links, Colab GPU Instances, Qwen2 and Multi-gpu Support, Updates on Kaggle and Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (1 messages):

Prompt tracking tools for LLMs, Open-source LLM comparison


Unsloth AI (Daniel Han) ▷ #help (42 messages🔥):

Docker image for Unsloth AI, Inference speed comparison between Unsloth and Hugging Face, Finetuning support for Molmo, Error messages during Docker image setup, LoRa adapter training differences

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

Thinking Models, Llama Model Fine-tuning, Benchmarking AI Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (1 messages):

KD Full Fine-tuning, Selective Weights, LORA


MCP (Glama) ▷ #general (100 messages🔥🔥):

MCP terminology confusion, MCP tools and clients, Sage and its marketplace feature, Timeout limitations in MCP, Integration and testing of MCP SDK

Links mentioned:


MCP (Glama) ▷ #showcase (25 messages🔥):

MCP-Bridge Usage, Discord Bot Development, David Shapiro's ACE Framework, Drinks Blog on User Simulation, GitHub Project by frgmt0

Link mentioned: GitHub - frgmt0/blnk: Contribute to frgmt0/blnk development by creating an account on GitHub.


Interconnects (Nathan Lambert) ▷ #news (102 messages🔥🔥):

SWE-bench release, WeirdML benchmark, OpenAI vagueposting, Deepseek R1 release rumors, Concerns over AI transparency

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (4 messages):

NeurIPS PC criticism, Personalities in ML conferences, Career transitions in ML

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (6 messages):

Olmo 2 Pod Experience, Internet Businesses, Resource Allocation, H100 vs CS-2 Comparison

Link mentioned:

Auphonic

</a>: no description found

Interconnects (Nathan Lambert) ▷ #nlp (3 messages):

Survey Papers on Reasoning Models, Criticism of Survey Papers


Interconnects (Nathan Lambert) ▷ #reads (4 messages):

Debt: The First 5000 Years, Devin AI, Answer AI's Impact

Links mentioned:


Interconnects (Nathan Lambert) ▷ #lectures-and-projects (1 messages):

H2O.ai products, H2O LLM Datastudio, Data preparation for LLMs

Link mentioned: H2O LLM DataStudio | Docs | H2O LLM DataStudio | Docs: <H2OHome title="H2O LLM DataStudio" description="A no-code application and toolkit to streamline data curation, preparation, and augmentation tasks related to Large Language Models (...


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Wave 2 Release, Cascade Web Search, Cascade Autogenerated Memories, Performance Improvements, System Status

Links mentioned:


Codeium (Windsurf) ▷ #discussion (28 messages🔥):

Codeium customer service issues, Student discount program expansion, IDE features and bug reports, Login issues with Codeium extension, Bugs fixes and new features feedback

Link mentioned: Open VSX Registry: no description found


Codeium (Windsurf) ▷ #windsurf (90 messages🔥🔥):

Windsurf account issues, Integration of tools in Windsurf, Pricing discrepancies for students, User experience with Windsurf features, Bug reports and support queries

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (116 messages🔥🔥):

Activity Page Confusion, Gemini Model Endpoint Changes, OpenRouter API and Regional Restrictions, DeepSeek Performance, BYOK API Key Integration

Links mentioned:


aider (Paul Gauthier) ▷ #general (45 messages🔥):

DeepSeek 3 Performance and Configuration, Aider and GitHub Milestone, OpenRouter Provider Settings, Quantization Impact on DeepSeek, CodeGate Security Features

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (60 messages🔥🔥):

Agentic Tools for Code Exploration, User Customization of Aider, Scraping Limitations with Aider, Context Limits Issue, Sparse Priming Representation

Links mentioned:


aider (Paul Gauthier) ▷ #links (2 messages):

Helicone LLM Observability, Activepieces Tool Overview, LLM Documentation Resources

Links mentioned:


Nous Research AI ▷ #general (58 messages🔥🔥):

Nous funding, OpenAI compensation structures, Employee ownership and shares, Model hosting options, Training frameworks

Links mentioned:


Nous Research AI ▷ #ask-about-llms (17 messages🔥):

RAG chatbot architecture, Small model alternatives, LLM modification suggestions, Structured output challenges


Nous Research AI ▷ #research-papers (6 messages):

New Neural Architecture, Titans Implementation in PyTorch, Large Language Models Book

Links mentioned:


Nous Research AI ▷ #research-papers (6 messages):

New neural long-term memory module, Titans implementation in PyTorch, Overview of large language models

Links mentioned:


Notebook LM Discord ▷ #use-cases (11 messages🔥):

NotebookLM prompts, Virtual travel agent bot, NotebookLM for science, AI Studio reliability, NotebookLM for computer topics

Links mentioned:


Notebook LM Discord ▷ #general (76 messages🔥🔥):

Podcast Generation Limitations, Notebooks Usage Questions, YouTube Video Link Handling, Interactive Mode and Bugs, Audio Overview Customization

Links mentioned:


Perplexity AI ▷ #general (68 messages🔥🔥):

Perplexity Pro Issues, Model Performance, Image Generation Problems, Subscription Activation Challenges, Student Discounts Inquiry


Perplexity AI ▷ #sharing (5 messages):

Starship 7 incident, China's Space Solar Array, Apple's USA-Made iPhone Chips, OpenAI's Economic Blueprint, Time Travel Discussion

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (5 messages):

Sonar and Sonar-Pro models, Custom stop parameters error, crewAI model troubleshooting

Link mentioned: no title found: no description found


Stability.ai (Stable Diffusion) ▷ #general-chat (76 messages🔥🔥):

David Lynch in the Lodge, Using Stable Diffusion for business, Image generation and controlnet issues, Training LoRA with personal photos, Switching between Stable Diffusion webUIs

Link mentioned: GitHub - lllyasviel/stable-diffusion-webui-forge: Contribute to lllyasviel/stable-diffusion-webui-forge development by creating an account on GitHub.


Nomic.ai (GPT4All) ▷ #announcements (1 messages):

Nomic Embed Vision, Apache 2.0 License, Multimodal tasks, Open weights and code

Link mentioned: Tweet from Nomic AI (@nomic_ai): Nomic Embed Vision is now under an Apache 2.0 License.- High quality, unified embedding space for image, text, and multimodal tasks.- Outperforms both OpenAI CLIP and text-embedding-3-small - Open we...


Nomic.ai (GPT4All) ▷ #general (66 messages🔥🔥):

Ethics in AI, Model Recommendations, Model Performance, Custom URL Schemes, Template for Qwen2.5-1.5B

Links mentioned:


GPU MODE ▷ #general (8 messages🔥):

LeetGPU Platform, CUDA Learning Resources

Links mentioned:


GPU MODE ▷ #triton (6 messages):

Triton kernel optimizations, Warp specialization, Fusion of Triton kernels

Link mentioned: Automatic Warp Specialization Optimization by htyu · Pull Request #5622 · triton-lang/triton: Warp specialization enhances kernel performance by utilizing an asynchronous execution model, where different parts of the kernel are handled by separate hardware units. The data communication betw...


GPU MODE ▷ #torch (29 messages🔥):

Torch Profiler Issues, Custom Autograd Function Implementation, Double Backward in PyTorch, Resource Sharing for Optimizations, Intermediate Tensor Management in Backward Pass

Link mentioned: torch.func.grad — PyTorch 2.5 documentation: no description found


GPU MODE ▷ #algorithms (1 messages):

Custom torch.autograd.Function, addbmm with Softplus activation, Double backward implementation, PyTorch ops and Triton ops, Mathematical derivation


GPU MODE ▷ #off-topic (6 messages):

LLM Agents, Business Logic vs LLM, Automation in Programming


GPU MODE ▷ #arm (1 messages):

Linux arm64 runners, Copilot chat enhancements

Link mentioned: Linux arm64 hosted runners now available for free in public repositories (Public Preview) · GitHub Changelog: Linux arm64 hosted runners now available for free in public repositories (Public Preview)


GPU MODE ▷ #liger-kernel (1 messages):

0x000ff4: is there any active topic that I can contribute


GPU MODE ▷ #thunderkittens (7 messages):

Hardware requirements for development, CUDA coding without GPU, Ampere architecture necessity, Ideal GPUs for tensor cores, Apple M chip compatibility

Link mentioned: LeetGPU: no description found


GPU MODE ▷ #arc-agi-2 (4 messages):

Familiarity with arc-agi-2 codebase, Self-improvement with grounding signal, Tree-sampling experiments with MCTS, Llama-3.2 performance on math tasks, AI feedback for model training


tinygrad (George Hotz) ▷ #general (10 messages🔥):

Flash Attention in Tinygrad, Memory Control Challenges, GPU Out-Of-Memory Errors, Nested-Loop Representation


tinygrad (George Hotz) ▷ #learn-tinygrad (39 messages🔥):

Operator (Un)Fusion, Flash Attention Challenges, Tinygrad JIT Optimization, Adding FP8 Support, Windows Support in Tinygrad

Links mentioned:


Yannick Kilcher ▷ #general (17 messages🔥):

FORTRAN resurgence, CUDA alternatives, Triton vs CUDA, Complex loss functions, V JEPA paper issues


Yannick Kilcher ▷ #paper-discussion (22 messages🔥):

MiniMax paper discussion, Model training results, Active Inference Insights, Communication non-verbal cues, Machine learning resources

Links mentioned:


Yannick Kilcher ▷ #ml-news (7 messages):

3090 Memory Mods, CaPa Paper Release, Mesh Generation Technologies

Link mentioned: CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation: no description found


OpenAI ▷ #ai-discussions (20 messages🔥):

Google Research TITANS, RunwayML content moderation, Master AI agent for LLM logs, API access limitations, Privacy concerns with OpenAI tokens

Links mentioned:


OpenAI ▷ #gpt-4-discussions (7 messages):

Mind Journal Functionality, DALL·E Integration Issue, Version History Inaccuracy


OpenAI ▷ #prompt-engineering (6 messages):

Prompt Engineering Book, ChatGPT Jailbreaks, Community Moderation


OpenAI ▷ #api-discussions (6 messages):

Prompt Engineering, ChatGPT Jailbreaks, AI Moderation


LM Studio ▷ #general (39 messages🔥):

Molmo Vision Model Errors, Llama Model Loading Issues, Model Performance on Mac, MiniMax-01 Benchmarks, Image Handling Bugs

Links mentioned:


Cohere ▷ #discussions (10 messages🔥):

User Introductions, Student Projects, Billing Support


Cohere ▷ #questions (7 messages):

Prompt engineering for reranking, Chat history relevance, Use case exploration, Vercel hosting inquiry


Cohere ▷ #api-discussions (1 messages):

Command R models cost comparison, Default model timestamp confusion, Issues with 8-2024 version


Cohere ▷ #cmd-r-bot (11 messages🔥):

Free platforms for deep learning, Resources for beginners in deep learning, Cohere LLM University, Cohere Cookbooks, AWS Cloud platform for Cohere


Cohere ▷ #cohere-toolkit (1 messages):

Community Guidelines, Mod Role


Modular (Mojo 🔥) ▷ #general (4 messages):

Modular GitHub Repository Migration, Community Package Showcase, Adding Mojo/MAX Projects to Awesome for Beginners, Concerns about Mojo Language Stability, Request for Mojo-Specific Projects List

Link mentioned: GitHub - MunGell/awesome-for-beginners: A list of awesome beginners-friendly projects.: A list of awesome beginners-friendly projects. Contribute to MunGell/awesome-for-beginners development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #mojo (18 messages🔥):

Mojo Parallelization Constraints, yyjson Data Structures, Type System and Language Improvements, Using Variant for Sum Type Support, Quantum Country Resource Feedback

Links mentioned:


Modular (Mojo 🔥) ▷ #max (1 messages):

MAX's final form, Composable components, .NET platform comparison


Latent Space ▷ #ai-general-chat (22 messages🔥):

SWEBench Verification, Funding Rounds in AI, Agent Recipes, Cybersecurity Executive Order, OpenAI's webRTC API

Links mentioned:


LlamaIndex ▷ #blog (2 messages):

Women in AI RAG Hackathon, GraphRAG with LlamaIndex and Memgraph


LlamaIndex ▷ #general (18 messages🔥):

Cached Augmented Generation (CAG), Azure AI with OpenAI, Embedding Model Configuration, Research on RAG Domain Influence, Tracking Prompts Across LLMs

Links mentioned:


DSPy ▷ #general (5 messages):

DSPy v3 release, Stable Diffusion optimization, Chain-of-thought style iteration

Link mentioned: Tweet from Thorondor LLC (@ThorondorLLC): New project! Optimize your stable diffusion prompts via a "chain-of-thought" style iteration - built with DSPy


DSPy ▷ #examples (3 messages):

dspy ReAct usage, addition function error, LLama model issues


Axolotl AI ▷ #general (5 messages):

ChatML, Llama3, Torchtune, ShareGPT Dataset, Migration from ShareGPT


OpenInterpreter ▷ #general (1 messages):

Screenshot Analysis, Image Insights


AI21 Labs (Jamba) ▷ #general-chat (1 messages):

``








{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}