Frozen AI News archive

small little news items

**Ollama** enhanced its models by integrating **Cohere's R7B**, optimized for **RAG** and **tool use tasks**, and released **Ollama v0.5.5** with quality updates and a new engine. **Together AI** launched the **Llama 3.3 70B multimodal model** with improved reasoning and math capabilities, while **OpenBMB** introduced the **MiniCPM-o 2.6**, outperforming **GPT-4V** on visual tasks. Insights into **Process Reward Models (PRM)** were shared to boost **LLM reasoning**, alongside **Qwen2.5-Math-PRM** models excelling in mathematical reasoning. **LangChain** released a beta for **ChatGPT Tasks** enabling scheduling of reminders and summaries, and introduced open-source **ambient agents** for email assistance. **OpenAI** rolled out **Tasks** for scheduling actions in **ChatGPT** for Plus, Pro, and Teams users. AI software engineering is rapidly advancing, predicted to match human capabilities within 18 months. Research on **LLM scaling laws** highlights power law relationships and plateauing improvements, while **GANs** are experiencing a revival.

Canonical issue URL

AI News for 1/13/2025-1/14/2025. We checked 7 subreddits, 433 Twitters and 32 Discords (219 channels, and 2161 messages) for you. Estimated reading time saved (at 200wpm): 256 minutes. You can now tag @smol_ai for AINews discussions!

ChatGPT Tasks launched. Cursor raised a B. Sakana announced a beautiful improvement over LoRAs with only minor performance improvement. Hailuo dropped a giant 456B MoE similar to Deepseek v3.

Nothing we'd give title story feature to, but nice incremental progress.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Model Releases and Updates

AI Features and Tools

AI Research and Papers

AI Community and Events

AI Industry News and Policy

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Qwen's Math Process Reward Models and Innovations

Theme 2. MiniMax-Text-01: MoE and Long Context Capabilities

Theme 3. Inspiration from LLMs Driving New Open Source Initiatives

Theme 4. RTX Titan Ada 48GB: Unveiling New GPU Potentials

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. AGI: Marketing Hype or Genuine Innovation?


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1. New AI Models: Codestral, MiniMax-01, and DeepSeek V3

Theme 2. AI Tools and IDEs: Performance Hiccups and User Innovations

Theme 3. Advancements in AI Features: From Task Scheduling to Ambient Agents

Theme 4. AI Infrastructure: GPU Access and Support Challenges

Theme 5. AI in Code Development: Practices and Philosophies


PART 1: High level Discord summaries

Codeium (Windsurf) Discord


Unsloth AI (Daniel Han) Discord


Cursor IDE Discord


LM Studio Discord


Eleuther Discord


Stackblitz (Bolt.new) Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Interconnects (Nathan Lambert) Discord


Latent Space Discord


Notebook LM Discord Discord


Perplexity AI Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


GPU MODE Discord


OpenAI Discord


Cohere Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


Torchtune Discord


OpenInterpreter Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


LlamaIndex Discord


LAION Discord


DSPy Discord


MLOps @Chipro Discord


The Axolotl AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium (Windsurf) ▷ #discussion (16 messages🔥):

Support Ticket Issues, Codeium Connectivity Problems, Technical Assistance, Freedom of Speech Concerns, Telemetry Errors on VS Code

Link mentioned: Exafunction/CodeiumVisualStudio: Visual Studio extension for Codeium. Contribute to Exafunction/CodeiumVisualStudio development by creating an account on GitHub.


Codeium (Windsurf) ▷ #windsurf (350 messages🔥🔥):

Windsurf Performance, Improvements in AI Suggestions, Need for Clear Guidelines, Application Development Challenges, User Experience and Feedback

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (128 messages🔥🔥):

Unsloth GPU Support, Mistral Code Generation Models, DeepSeek V3 Local Run, Chat Templates in Models, Model Licensing and Availability

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

Dynamic Filters in LLAMA, Language Learning, Video Resources, Pronunciation Practice


Unsloth AI (Daniel Han) ▷ #help (144 messages🔥🔥):

Fine-tuning on Unsloth, Training VLMs, RAG for hallucination control, Using 4bit saving, Dataset preparation for LLM

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (1 messages):

fjefo: https://youtu.be/SRfJQews1AU?si=s3CSvyThYNcTetX_


Cursor IDE ▷ #general (209 messages🔥🔥):

Cursor IDE performance issues, Model comparisons, Batch file for snapshots, Using Claude vs O1, User experiences with Cursor

Links mentioned:


LM Studio ▷ #announcements (2 messages):

Hugging Face search functionality, Issue resolution


LM Studio ▷ #general (72 messages🔥🔥):

Voodoo Graphics Card Nostalgia, Qwen 2.5 versus QwQ Comparisons, Local LLM Recommendations, Generative AI and Game Development, LM Studio Chat vs Development Responses

Links mentioned:


LM Studio ▷ #hardware-discussion (77 messages🔥🔥):

RTX 5090 vs 4090 multi-GPU setups, GPU memory bandwidth impacts, Image generation GPU setups, Dual GPU setups for cost efficiency, Model training complexities

Links mentioned:


Eleuther ▷ #general (40 messages🔥):

PhD in Statistical Learning Theory, Orch-Or Theory and AI, AI Autocomplete Speed, Developer Opportunities, Off-Topic Channel Dynamics


Eleuther ▷ #research (73 messages🔥🔥):

Eukaryotic Cell Vaults, Research Channel Guidelines, Hyper-connections in Neural Networks, Process Reward Models, VinePPO and CoT Trajectories

Links mentioned:


Eleuther ▷ #scaling-laws (6 messages):

Induction Head Bumps, Loss vs Compute, Circuit Interoperability


Eleuther ▷ #interpretability-general (2 messages):

Neel Nanda podcast, Mechanistic interpretability, Neural network understanding

Link mentioned: Neel Nanda - Mechanistic Interpretability (Sparse Autoencoders): Machine Learning Street Talk (MLST) · Episode


Eleuther ▷ #lm-thunderdome (12 messages🔥):

Pre-commit mixed line ending issue, MLQA benchmark implementation PR, LM Evaluator's majority voting, Filters in LM Evaluation Harness

Links mentioned:


Eleuther ▷ #gpt-neox-dev (5 messages):

Growth of AINews Newsletter, Llama 2 Configuration Differences, Gradient Clipping Concerns, Tokenizer Padded Vocabulary Issues, Activation Function Discrepancies

Links mentioned:


Stackblitz (Bolt.new) ▷ #prompting (15 messages🔥):

Code Removal Issues, Report Issues in Editor, Enabling Diffs, Google Analytics 4 API Integration, Netlify Function Challenges


Stackblitz (Bolt.new) ▷ #discussions (108 messages🔥🔥):

Token Consumption Issues, Supabase Project Integration, Chatbot Development Strategies, Subscription Plan Confusions, Bug Reporting and Persistence

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Telegram AI Chatbot, DeVries AI Subscription Model, Multi-Model Access in Telegram

Link mentioned: devriesai: Your Telegram AI Agent


OpenRouter (Alex Atallah) ▷ #general (106 messages🔥🔥):

OpenRouter Provider Setup, Deepseek Performance Issues, OpenRouter Rate Limits, Anime Discussions, MiniMax Model Releases

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (104 messages🔥🔥):

Discord Bot Issues, Flux Sampling with DEIS BETA, Aesthetic Classifier Development, FP8 vs FP16 Precision, Performance of Intel B580 in Stable Diffusion


Interconnects (Nathan Lambert) ▷ #news (28 messages🔥):

Qwen2.5-Math-PRM, Claude Sonnet 3.5 performance, MiniCPM-o 2.6 introduction, Midwit model, Evaluation methods in models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

420gunna: https://x.com/aidan_mclau/status/1878944278782890158


Interconnects (Nathan Lambert) ▷ #random (31 messages🔥):

AI in Higher Education, CIO Functions, Michigan's Chatbot Offerings, Limitations of LLMs, Stripe Tax Registration

Links mentioned:


Interconnects (Nathan Lambert) ▷ #cv (16 messages🔥):

Synthetic Chain-of-Thought Training, O1 Models Discussion, Tulu-R and Vision Generalists, Naming Conventions for Models, Molmo and Reinforcement Learning

Link mentioned: LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs: Reasoning is a fundamental capability for solving complex multi-step problems, particularly in visual contexts where sequential step-wise understanding is essential. Existing approaches lack a compreh...


Interconnects (Nathan Lambert) ▷ #reads (17 messages🔥):

Process Reward Models (PRMs), ProcessBench Benchmarking, Nathan Lambert's Post-Training Insights, LLaMA 3 Paper Discussion

Links mentioned:


Interconnects (Nathan Lambert) ▷ #policy (3 messages):

Economic Blueprint, Executive Order for Datacenters

Links mentioned:


Latent Space ▷ #ai-general-chat (82 messages🔥🔥):

ChatGPT Task Automation, Cursor Series B Funding, AI Email Assistants, Model Performance and Capacity Issues, GPU Investment Strategies

Links mentioned:


Latent Space ▷ #ai-announcements (2 messages):

Reading list on HN, User achievements


Notebook LM Discord ▷ #announcements (1 messages):

Audio Overviews Feedback Survey, User Research Participation

Link mentioned: Register your interest: Google feedback survey: Hello,We are looking for feedback on NotebookLM via a short survey. This will help the Google team better understand your needs in order to incorporate them into future product enhancements. To regist...


Notebook LM Discord ▷ #use-cases (29 messages🔥):

AI-generated podcasts, Organizing notebooks in Google Classroom, Length control for generated podcasts, Audio transcription options, Sharing permissions for NotebookLM

Links mentioned:


Notebook LM Discord ▷ #general (52 messages🔥):

NotebookLM functionalities, Paid version features, User feedback sessions, Audio summary issues, Sharing projects

Links mentioned:


Perplexity AI ▷ #general (74 messages🔥🔥):

Perplexity Pro Features, User Experience Issues, Coding Assistance Frustrations, API Access Limitations, Content Visibility Concerns

Links mentioned:


Perplexity AI ▷ #sharing (2 messages):

TikTok regulations, German summary requests


Nous Research AI ▷ #general (69 messages🔥🔥):

Claude's personality and training, Open source models limitations, Defensive practices in AI data, Quality of training data and its impact, Human-like interactions in models

Link mentioned: Never Give Up Fist Pump GIF - Never Give Up Fist Pump Motivation - Discover & Share GIFs: Click to view the GIF


Nous Research AI ▷ #ask-about-llms (3 messages):

Gemini for Data Extraction, 4o-mini and Llama-8B, Jina for Text Conversion


Nous Research AI ▷ #research-papers (1 messages):

real.azure: It seems to be the month of attention alternatives.

https://arxiv.org/abs/2501.06425


Nous Research AI ▷ #research-papers (1 messages):

real.azure: It seems to be the month of attention alternatives.

https://arxiv.org/abs/2501.06425


aider (Paul Gauthier) ▷ #general (57 messages🔥🔥):

.aider.conf.yml exclusion from .gitignore, DeepSeek v3 performance on local machines, API configurations for custom hosts, Recommendations for open-source models, Benchmarking individual models

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (6 messages):

Aider Error Handling, Toggling Edit Formats, Copy Context Command Limitations

Link mentioned: File editing problems: aider is AI pair programming in your terminal


GPU MODE ▷ #general (5 messages):

Machete kernels across multiple GPUs, lm_head quantization issues, Bitpacking incompatibilities, Model monkey-patching

Links mentioned:


GPU MODE ▷ #triton (1 messages):

Blockwise Matrix Multiplication, Linear Layer Kernel


GPU MODE ▷ #cuda (3 messages):

FA3 Profiling, H200 vs H100 Performance


GPU MODE ▷ #torch (2 messages):

torch.cuda.max_memory_allocated, torch.cpu, Memory allocation functions


GPU MODE ▷ #algorithms (2 messages):

MiniMax-01 Open Source, Lightning Attention Architecture, Ultra-Long Context, Cost-Effectiveness of MiniMax Models

Link mentioned: Tweet from MiniMax (official) (@MiniMax__AI): MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent EraWe are thrilled to introduce our latest open-source models: the foundational language model MiniMax-Text-01 and the visua...


GPU MODE ▷ #cool-links (1 messages):

remek1972: New one: https://salykova.github.io/sgemm-gpu


GPU MODE ▷ #jobs (2 messages):

Kaiko AI job openings, Foundation Models at Prior Labs

Links mentioned:


GPU MODE ▷ #beginner (8 messages🔥):

CUDA atomic functions, CFD solver development

Link mentioned: How do I use atomicMax on floating-point values in CUDA?: I have used atomicMax() to find the maximum value in the CUDA kernel: global void global_max(float* values, float* gl_max) { int i=threadIdx.x + blockDim.x * blockIdx.x;&...


GPU MODE ▷ #pmpp-book (1 messages):

DeepSeek 2.5 reasoning, Image analysis


GPU MODE ▷ #torchao (6 messages):

int8_weight_only, torch.compile and optimization, torchao compatibility with TorchScript/ONNX

Link mentioned: ao/torchao/quantization at main · pytorch/ao: PyTorch native quantization and sparsity for training and inference - pytorch/ao


GPU MODE ▷ #off-topic (3 messages):

GTC Attendance, CUDA Talks, Networking at GTC


GPU MODE ▷ #metal (1 messages):

MPS Profiler, Kernel Profiling, GPU Trace Issues, PyTorch and MPS, MTL_CAPTURE_ENABLED


GPU MODE ▷ #self-promotion (1 messages):

Thunder Compute, Cloud GPU Pricing, CLI Instance Management

Link mentioned: Thunder Compute: Low-cost GPUS for anything AI/ML: Train, fine-tune, and deploy models on Thunder Compute. Get started with $20/month of free credit.


GPU MODE ▷ #thunderkittens (1 messages):

Onboarding Documentation

Link mentioned: TK onboarding: Summary This document specifies how to get started on programming kernels using TK. Please feel free to leave comments on this document for areas of improvement / missing information. Summary 1 Back...


GPU MODE ▷ #edge (4 messages):

Nvidia Cosmos, Jetson, Transformer Engine, Mamba, 3D Vision Stack


OpenAI ▷ #ai-discussions (20 messages🔥):

New codestral model, ChatGPT 4o with Canvas, OpenAI model changes, AI Misalignment Videos, App token issues


OpenAI ▷ #gpt-4-discussions (1 messages):

jlgri: Will there ever be a chatgpt app on apple watches?


OpenAI ▷ #prompt-engineering (8 messages🔥):

Data Format Discussions, PDF Limitations, Assistant Response Issues, Rephrasing Techniques


OpenAI ▷ #api-discussions (8 messages🔥):

Improving Data Formats, Reducing PDF Usage, Assistant Behavior Customization, User-Friendly Rephrasing


Cohere ▷ #discussions (9 messages🔥):

Konkani Language Preservation, AI Model for Konkani language


Cohere ▷ #questions (16 messages🔥):

Rerank fine-tuning pricing, AI memory limits, API key limits, Cohere pricing and documentation

Links mentioned:


Cohere ▷ #cmd-r-bot (11 messages🔥):

Bot Interaction, Alice in Wonderland Reference, Cohere Documentation Search


Modular (Mojo 🔥) ▷ #general (2 messages):

Meeting attendance, Update sharing, YouTube link


Modular (Mojo 🔥) ▷ #mojo (26 messages🔥):

Async Mojo Proposals, Zed Extension for Mojo, Mojodojo Int8 Conversion Issue

Links mentioned:


tinygrad (George Hotz) ▷ #general (4 messages):

Tinygrad Overview, Need for Tinygrad, Tiny Corp's Funding and Vision, AI Chip Companies

Link mentioned: the tiny corp raised $5.1M: Here we go again. I started another company. The money is in the bank.


tinygrad (George Hotz) ▷ #learn-tinygrad (16 messages🔥):

Learning Tinygrad, Tensor Stacking Issues, Recursion Error in Tinygrad, Getting Tensor Attributes, Using Tensor Functions

Link mentioned: Because Reading is Fundamental: Most discussions show a bit of information next to each user:What message does this send? * The only number you can control printed next to your name is post count. * Everyone who reads this will see ...


Torchtune ▷ #papers (13 messages🔥):

Inference-time scaling in LLMs, Jailbreaking O1 for model training, Automated evaluation methods in healthcare, False equivalence in medical training, Role of AI in medicine

Links mentioned:


OpenInterpreter ▷ #general (10 messages🔥):

Open Interpreter Setup, Video Editing Capabilities, Cmder Performance Issues


OpenInterpreter ▷ #O1 (2 messages):

Deepseek Model Name, API Key Variable for Deepseek


OpenInterpreter ▷ #ai-content (1 messages):

velinxs: how can you integrate deepspeek into oi?


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (7 messages):

2024 MOOC nostalgia, Beginner friendliness of the MOOC, Fall 2024 MOOC lectures, Certificate release for Fall 2024 MOOC


Nomic.ai (GPT4All) ▷ #general (5 messages):

NPU Support in GPT4All, VPN and Reverse Proxy for GPT4All Access, Model Versions on Hugging Face


LlamaIndex ▷ #blog (1 messages):

Agent vs. Naive RAG, Weaviate Integration, LlamaIndex Performance


LlamaIndex ▷ #general (3 messages):

QuestionsAnsweredExtractor customization, Prompt template function mappings, LlamaIndex package installation

Link mentioned: Advanced Prompt Techniques (Variable Mappings, Functions) - LlamaIndex: no description found


LAION ▷ #general (2 messages):

Meta JASCO, Joint Audio and Symbolic Conditioning, Music Modeling

Link mentioned: facebook/jasco-chords-drums-melody-1B · Hugging Face: no description found


DSPy ▷ #general (1 messages):

Ambient Agents, DSPy implementation examples


MLOps @Chipro ▷ #events (1 messages):

Healthcare and AI, Finance and AI, AI-enabled solutions

Link mentioned: Welcome! You are invited to join a meeting: Healthcare, Finance, and Artificial Intelligence . After registering, you will receive a confirmation email about joining the meeting.: Healthcare, Finance, and Artificial Intelligence (AI) are increasingly intertwined in today's world. The interconnectedness among these fields has also brought to the fore opportunities and challenges...






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}