Frozen AI News archive

OpenAI Voice Mode Can See Now - After Gemini Does

**OpenAI** launched **Realtime Video** shortly after **Gemini**, which led to less impact due to Gemini's earlier arrival with lower cost and fewer rate limits. **Google DeepMind** released **Gemini 2.0 Flash** featuring enhanced multimodal capabilities and real-time streaming. **Anthropic** introduced **Clio**, a system analyzing real-world usage of **Claude** models. Together Computing acquired CodeSandbox to launch a code interpreter tool. Discussions highlighted **Meta's Llama 3.3-70B** for its advanced roleplay and prompt handling abilities, outperforming models like **Mistral Large** and **GPT-4o** in expressiveness and censorship. The AI community also engaged in humorous takes on AI outages and model competition, with **ChatGPT** adding a Santa mode for holiday interactions. *"Anthropic is capturing the developer ecosystem, Gemini has AI enthusiast mindshare, ChatGPT reigns over AI dabblers"* was a noted observation from the community.

Canonical issue URL

AI News for 12/11/2024-12/12/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (207 channels, and 6137 messages) for you. Estimated reading time saved (at 200wpm): 616 minutes. You can now tag @smol_ai for AINews discussions!

OpenAI launched Realtime Video a day after expected, but it made less of a splash because Gemini got there first, with less cost, and less rate limiting.

image.png

The buzz is still solidly pro Gemini:

image.png

and we enjoy seeing some friendly sniping between undoubtedly SOTA, very hard working teams.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here are the key topics organized from the Twitter discussions:

AI Model Releases & Updates

AI Infrastructure & Development

Industry & Market Updates

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Meta's Llama 3.3-70B: Roleplaying and Prompt Handling Excellence

Theme 2. Microsoft's Phi-4: Small Model, Big Benchmark Results, Skepticism Remains

Theme 3. OpenAI o1 vs Claude 3.5 Sonnet: Subscription Showdown

Theme 4. Gemini series shines in Math Benchmarks, Growing Cognitive Reputation

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. NeurIPS 2024 Sabotage Allegations Disrupt Research

Theme 2. Controversial 'Stop Hiring Humans' Campaign in SF

Theme 3. ChatGPT's Santa Voice: Seasonal Gimmick or Revolutionary?

Theme 4. OpenAI’s 12 Days of Releases: Video in AVM


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. AI Model Showdowns: Gemini vs. Claude

Theme 2. GPU Frenzy: New Launches and Scalping Wars

Theme 3. AI Tool Turbulence: Updates, Bugs, and Integrations

Theme 4. MLOps Marvels: Innovations in Training and Optimization

Theme 5. Community Catalysts: Hackathons, AMA Sessions, and Collaborative Tools


PART 1: High level Discord summaries

Codeium / Windsurf Discord


aider (Paul Gauthier) Discord


Cursor IDE Discord


OpenAI Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


LM Studio Discord


Bolt.new / Stackblitz Discord


Notebook LM Discord Discord


Nous Research AI Discord


GPU MODE Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


Interconnects (Nathan Lambert) Discord


LlamaIndex Discord


Modular (Mojo 🔥) Discord


DSPy Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


Torchtune Discord


Gorilla LLM (Berkeley Function Calling) Discord


Axolotl AI Discord


Mozilla AI Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LAION Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium / Windsurf ▷ #announcements (1 messages):

Windsurf Wave 1 Launch, Cascade Memories, Usage Transparency, Image Upload Capabilities, Improved Python Support

Links mentioned:


Codeium / Windsurf ▷ #discussion (135 messages🔥🔥):

Codeium Plugin Issues, Windsurf Features, Credit Management Concerns, User Experience with AI Integration, Comparisons to Other AI Tools

Links mentioned:


Codeium / Windsurf ▷ #windsurf (647 messages🔥🔥🔥):

Windsurf Performance Issues, Gemini Models vs Claude, Internal Errors in Cascade, User Feedback on Codeium, Support Ticket System

Links mentioned:


aider (Paul Gauthier) ▷ #general (1026 messages🔥🔥🔥):

O1 Pro Performance, Gemini Flash, DeepSeek, Devin AI, OpenHands

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (90 messages🔥🔥):

Aider Installation Issues, Aider and Rust Dependencies, Gemini Model Comparisons, Aider's Commenting Functionality, Aider User Experience and Feedback

Links mentioned:


Cursor IDE ▷ #general (620 messages🔥🔥🔥):

Gemini vs Claude, Cursor performance, AI tools and price, Web hosting solutions, Programming challenges

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

12 Days of OpenAI, Santa Mode, Advanced Voice features

Link mentioned: Santa Mode & Video in Advanced Voice—12 Days of OpenAI: Day 6: Kevin Weil, Jackie Shannon, Michelle Qin, and Rowan Zellers introduce and demo the new Santa voice, as well as video and screensharing in Advanced Voice.


OpenAI ▷ #ai-discussions (417 messages🔥🔥🔥):

Project Astra, Gemini 2.0 vs. OpenAI, AI Image Generation, Voice AI Developments, AI Model Comparisons

Links mentioned:


OpenAI ▷ #gpt-4-discussions (21 messages🔥):

OpenAI service outage, Custom GPT file format, ChatGPT recovery updates, AI view feature release, User API call handling

Links mentioned:


OpenAI ▷ #prompt-engineering (5 messages):

Canmore Interaction Functions, Custom GPT for Presentation Slides, Formatting Evals


OpenAI ▷ #api-discussions (5 messages):

Canmore Tool Functions, Custom GPT for Lectures, Formatting Evals


Perplexity AI ▷ #general (433 messages🔥🔥🔥):

Gemini Performance Comparison, Perplexity App Issues, Pro Subscription Queries, O1 and Reasoning Model, LinkedIn Integration Announcement

Links mentioned:


Perplexity AI ▷ #sharing (6 messages):

B650E Taichi issues, Yong Yuan Niwan Cheng Sinaitoy, GPR devices and methodologies, Poetry requests, Advertising copywriting


Perplexity AI ▷ #pplx-api (6 messages):

Perplexity API card payment issues, 3D Secure transaction problems, Creation of Perplexity Pages


Unsloth AI (Daniel Han) ▷ #general (265 messages🔥🔥):

DPO with Llama 3.3, Merging models and quantization, LoRA adapters and fine-tuning, 4-bit vs 16-bit model merging, Unsloth license and compatibility

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (1 messages):

SPDL: Faster AI Model Training, Thread-based Data Loading, Reality Labs AI Research


Unsloth AI (Daniel Han) ▷ #help (15 messages🔥):

Unsoth AI Installation Issues, Fine-Tuning Process, Multi-GPU Training Memory Error, Using Colab for Training, Ollama Setup

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (3 messages):

OpenPlatypus Dataset, Minimum Wage Debate in Stenland, QwQ Model Development, Mathematics Aptitude Test of Heuristics Dataset

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (208 messages🔥🔥):

Upcoming 5090 GPU release, Scalping practices for GPUs, Model recommendations for image generation, Challenges with Stable Diffusion models, Discord community for Local Video AI Models

Links mentioned:


Eleuther ▷ #announcements (1 messages):

Training Jacobian Analysis, Parameter Space Dynamics, Neural Network Training, Impact of Data on Training, Upcoming Papers Series

Links mentioned:


Eleuther ▷ #general (70 messages🔥🔥):

RWKV model releases, AdamW weight decay significance, NeurIPS discussions, ARC prize controversies, Concerns over VAR paper misconduct

Links mentioned:


Eleuther ▷ #research (117 messages🔥🔥):

Muon Optimizer, Negative Attention Weights, Attention Mechanism Insights, Prepending Information in Models, Alternative Softmax Approaches

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):

Saving model outputs


LM Studio ▷ #general (152 messages🔥🔥):

LM Studio Model Support, Running LLMs on Mac, Uncensored AI Models, Fine-tuning LLMs, GPU Configuration for LLMs

Links mentioned:


LM Studio ▷ #hardware-discussion (29 messages🔥):

LM Studio GPU Usage, Intel GPU Purchasing Decisions, Gaming Laptop RAM Limitations, B580 GPU Reviews, RTX 3060 Comparisons

Links mentioned:


Bolt.new / Stackblitz ▷ #announcements (1 messages):

Bolt beanie, 2024 Holiday Special

Link mentioned: 2024 Holiday Special 🎅: The official website and shop of StackBlitz. Find the latest merch.


Bolt.new / Stackblitz ▷ #prompting (1 messages):

Deleting all chats, Chat history errors


Bolt.new / Stackblitz ▷ #discussions (174 messages🔥🔥):

Token Usage Issues, Debugging with Bolt, Integration of Supabase, Frontend vs Fullstack Development, Feature Requests and Improvements

Links mentioned:


Notebook LM Discord ▷ #use-cases (17 messages🔥):

Custom Voices in AI, Roleplaying in TTRPGs, Post-Apocalyptic Musicals, AI-generated Podcasts, Using NotebookLM for Literature Review

Links mentioned:


Notebook LM Discord ▷ #general (125 messages🔥🔥):

NotebookLM Updates, Podcast Customization, YouTube Video Summaries, Gemini 2.0, Source Management

Links mentioned:


Nous Research AI ▷ #general (84 messages🔥🔥):

Hermes Model Benchmarks, Qwen and Mistral Discussions, Event Registration Updates, Math Benchmark Insights, Model Running and Fine-tuning Tools

Links mentioned:


Nous Research AI ▷ #ask-about-llms (10 messages🔥):

Amnesia mode in 3B, iOS apps for LLMs, Unfiltered Hermes, Context length in Ollama, Continuous SFT for Llama Instruct


Nous Research AI ▷ #research-papers (5 messages):

QTIP model analysis, Signal processing resurgence, Communication theory papers

Link mentioned: GitHub - Cornell-RelaxML/qtip: Contribute to Cornell-RelaxML/qtip development by creating an account on GitHub.


Nous Research AI ▷ #research-papers (5 messages):

Qtip Model Discussion, Model Capacity Utilization, Communications Theory in AI

Link mentioned: GitHub - Cornell-RelaxML/qtip: Contribute to Cornell-RelaxML/qtip development by creating an account on GitHub.


GPU MODE ▷ #general (7 messages):

Intel CPUs, Intel Arc B580 vs RX 6750 XT, Torch Compile Performance, Dynamic Padding vs Performance Penalties


GPU MODE ▷ #triton (8 messages🔥):

Fused Kernel for Matmul and Softmax, Fused Attention and Flash Attention, Elementwise Operations on FP16 vs FP32, Floating Point Precision in Triton, Resource Requests for Triton Fused Attention


GPU MODE ▷ #cuda (9 messages🔥):

Best GEMM Implementations, Occupancy and Branching, CUDA Programming Techniques, GPU Glossary Release

Link mentioned: GPU Glossary: A glossary of terms related to GPUs.


GPU MODE ▷ #torch (2 messages):

Forward and Backward Hooks, Activation Checkpointing, Context Function in Checkpointing


GPU MODE ▷ #algorithms (1 messages):

konakona666: https://arxiv.org/pdf/2302.02451 smth like that?


GPU MODE ▷ #cool-links (3 messages):

vLLM Office Hours, Machete GEMM Kernel, Trillium TPU, Gemini 2.0 AI Model, Building ML Systems

Links mentioned:


GPU MODE ▷ #torchao (4 messages):

Float8 Training Implementation, DDP vs FSDP in TorchAO


GPU MODE ▷ #off-topic (1 messages):

Video Game Datasets, Keyboard/Mouse Inputs, Labeled Actions in Games


GPU MODE ▷ #rocm (1 messages):

Instinct devices and XDP kernels, Gemms with CUDA core style MAC, GEMM DL examples in ROCm

Link mentioned: composable_kernel/example/01_gemm/gemm_dl_fp16.cpp at develop · ROCm/composable_kernel: Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators - ROCm/composable_kernel


GPU MODE ▷ #lecture-qa (1 messages):

CUDA Performance Checklist, Kernel Occupancy vs Duration


GPU MODE ▷ #liger-kernel (1 messages):

0x000ff4: there is an update about KTO on #410


GPU MODE ▷ #self-promotion (11 messages🔥):

Pruna AI guest appearance, GPU Glossary creation, H100 GPU thread count confusion, Tensor core functionality, Register limitations

Link mentioned: Device Hardware | GPU Glossary: no description found


GPU MODE ▷ #🍿 (1 messages):

Markdown Blog Version, Eval Performance for Kernels


GPU MODE ▷ #arc-agi-2 (23 messages🔥):

ARC Riddles and LLMs, Transduction vs. Program Induction, Test Time Training Extensions, Image Segmentation and GNNs, ARC Augmentation Strategies

Links mentioned:


Cohere ▷ #discussions (26 messages🔥):

Cohere support issues, Timeout problems, System status updates, User interactions, Error messages

Links mentioned:


Cohere ▷ #questions (40 messages🔥):

Quantization Techniques in Model Training, Cohere Go SDK Update Needs, Aya Expanse Model Usage, Model Calibration Datasets, Licensing Issues and Compliance

Links mentioned:


Cohere ▷ #api-discussions (2 messages):

403 Error Response, VPN Usage, IP Information, Email Issues


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Hackathon submission deadline, Submission platform change, Evaluation criteria, Announcement timeline


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

Advanced Large Language Model Agents MOOC, LLM technology advancements, MATHEMATICS in AI, AI Code Generation, Program Verification


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (44 messages🔥):

Article Assignment Information, Quizzes and Certificate Requirements, Hackathon Participation, Course Feedback and Future Offerings, TA Assistance

Links mentioned:


Interconnects (Nathan Lambert) ▷ #events (3 messages):

Meeting Location, Event Details


Interconnects (Nathan Lambert) ▷ #news (4 messages):

Android XR, Gemini, Google's Augmented Reality, Live Translation, Smart Glasses

Link mentioned: I saw Google’s plan to put Android on your face: This is the closest I’ve ever been to being Tony Stark.


Interconnects (Nathan Lambert) ▷ #ml-drama (3 messages):

Content battles, Wholesome interactions

Link mentioned: Michelangelo D’Agostino (@mdagost.bsky.social): I have no inside information, only what @natolambert.bsky.social wrote in that post:


Interconnects (Nathan Lambert) ▷ #random (12 messages🔥):

Nous Dunks, Nextcloud Promotion, Frankle-signed Databrick Bricks, OpenAI vs Anthropic, Claude Pro Subscription

Link mentioned: Tweet from Tibor Blaho (@btibor91): The Information details rising tensions between OpenAI ($4B revenue in 2024, $157B valuation) and Anthropic ($1B in ARR by end of 2024, $18B valuation), highlighting Anthropic's growth in coding a...


Interconnects (Nathan Lambert) ▷ #cv (9 messages🔥):

MLLM developments sources, VLM insights by Hugging Face, MVLM posts request, University courses on MLLM

Links mentioned:


Interconnects (Nathan Lambert) ▷ #reads (8 messages🔥):

AI Model Creative Benchmarking, Algorithmic Responsibility, Claude's Spam Problem, Tulu 3 Post-Training Techniques

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (2 messages):

Discord critical mass, Technical issues


LlamaIndex ▷ #blog (3 messages):

CalPitch Tool, RAG Agents & SharePoint, Google Gemini 2.0 Launch, Llama Index Compatibility

Link mentioned: Gemini 2.0 Flash: An outstanding multi-modal LLM with a sci-fi streaming mode: Huge announcment from Google this morning: Introducing Gemini 2.0: our new AI model for the agentic era. There’s a ton of stuff in there (including updates on Project Astra and …


LlamaIndex ▷ #general (21 messages🔥):

Slack Bot Personalization, Techniques for Unstructured PDF, Function Calling Defaults, BGEM3 Model Integration, Setting System Prompts in FunctionCallingProgram

Link mentioned: LlamaParse - LlamaIndex: no description found


LlamaIndex ▷ #ai-discussion (1 messages):

Athina AI, LLM Experiments, Prompt Engineering Techniques


Modular (Mojo 🔥) ▷ #general (8 messages🔥):

New Forum Experience, Excitement for Upcoming Events, User Level Advancement, Company Praise


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Swag challenge, Ask Me Anything sessions, Community packages early access, Async Mojo implementation, Mojo optimization pipeline


Modular (Mojo 🔥) ▷ #mojo (10 messages🔥):

Open sourcing Mojo, Mojo's mascot, Boitatá, Mojo character name

Link mentioned: Mojo mascot? Python+Mojo=Boitatá (brazilian mythological creature) · modularml/mojo · Discussion #941: Not a issue itself, but something curious for me being Brazilian which brings me an idea. Mojo being a superset of Python (or a "Python++"), reminds me of a famous monster in Brazilian folkl...


DSPy ▷ #show-and-tell (2 messages):

DSPy, Prompt Optimization, Categorization Tasks

Link mentioned: Pipelines & Prompt Optimization with DSPy: Writing about technology, culture, media, data, and all the ways they interact.


DSPy ▷ #general (12 messages🔥):

Video and Audio Input Discussion, LLM Agent Definition Debate, Use of Optimizers with Labeled Data, Impact of AI on Conventional Categories

Links mentioned:


OpenInterpreter ▷ #general (6 messages):

Spider verse glitch effect, Issues with OI in Docker, GitHub model i tutorial update, NVIDIA NIM base URL setup, Thoughts on WebVoyager


OpenInterpreter ▷ #ai-content (1 messages):

zohebmalik: Video to advanced voice mode in ChatGPT announced for day 6


tinygrad (George Hotz) ▷ #general (6 messages):

Test Coverage Tools, Finding Dead Code, Coverage.py, gcov tool, Code Quality

Link mentioned: Coverage.py — Coverage.py 7.6.9 documentation: no description found


Torchtune ▷ #papers (3 messages):

QRWKV6-32B Model, Compute Efficiency, Training Innovations, RWKV-V6 Attention, Model Limitations

Link mentioned: Tweet from Rohan Paul (@rohanpaul_ai): New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B🚀 Recursal AI converted Qwen 32B Instruct model into QRWKV6 architecture, replacing transformer attentio...


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (2 messages):

finetuning Gorilla LLM, downloading GoEx model, implementing reversibility, training in Colab


Axolotl AI ▷ #general (1 messages):

PYTORCH_TUNABLEOP_ENABLED, PyTorch tunable operations


Mozilla AI ▷ #announcements (1 messages):

Mozilla Builders Demo Day, Community Engagement, Social Media Highlights

Link mentioned: Tweet from Mozilla Builders 🔧 (@mozillabuilders): We have chiseled ourselves out of our Demo Day cocoons just in time to write the world's most interesting recap. Seriously, it was spectacular — a confluence of amazing people and incredible techn...





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}