Frozen AI News archive

Google wakes up: Gemini 2.0 et al

**Google DeepMind** launched **Gemini 2.0 Flash**, a new multimodal model outperforming Gemini 1.5 Pro and o1-preview, featuring vision and voice APIs, multilingual capabilities, and native tool use. It powers new AI agents like **Project Astra** and **Project Mariner**, with Project Mariner achieving state-of-the-art **83.5%** on the WebVoyager benchmark. **OpenAI** announced ChatGPT integration with **Apple** devices, enabling Siri access and visual intelligence features. **Claude 3.5 Sonnet** is noted as a distilled version of Opus. The AI community's response at **NeurIPS 2024** has been overwhelmingly positive, signaling a strong comeback for Google in AI innovation. Key topics include **multimodality**, **agent development**, **multilinguality**, **benchmarking**, and **model releases**.

Canonical issue URL

AI News for 12/10/2024-12/11/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (207 channels, and 6549 messages) for you. Estimated reading time saved (at 200wpm): 649 minutes. You can now tag @smol_ai for AINews discussions!

It is day 1 sessions at NeurIPS, and as teased with various Gemini-Exp versions, Sundar Pichai came out swinging with Google's first official Gemini 2 model - Gemini Flash. Nobody expected 2.0 Flash to beat 1.5 Pro but here we are:

image.png

It also beats o1-preview on LMArena (but is still behind Gemini-Exp-1206, the suspected 2.0 Pro model).

Pricing is "free" - while 2.0 Flash is still experimental. As if that weren't enough, 2.0 Flash launches with a Multimodal (Vision -AND- Voice) API, and Paige Bailey even stopped by today's Latent Space LIVE/Thrilla on Chinchilla event to show off how it does what OpenAI dared not ship today:

image.png

Image output is also trained and teased but not shipped, but it can draw the rest of the owl like you have never seen.

They also announced a bunch of features in limited preview:

Comments and impressions from everyone here at NeurIPS, and online on X/Reddit/Discord was overwhelmingly positive. Google is so back!


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here are the key discussions organized into relevant categories:

Major Model Releases & Updates

Industry Developments & Analysis

Research & Technical Developments

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Gemini 2.0 Flash Achievements and Comparisons

Theme 2. QRWKV6-32B and Finch-MoE-37B-A11B: Innovations in Linear Models

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Google's Gemini 2.0: Strategic Release Amidst OpenAI Announcements

Theme 2. Google GenCast: 15-Day AI Weather Forecast Spearheading Future Predictions

Theme 3. ChatGPT Outages: Troubles Improving Stability and User Dependency

Theme 4. Sora AI Criticisms: Inferior Outputs Against Rivals and Discontent Among Users


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. New AI Models and Significant Updates

Theme 2. AI Tool Performance and Comparative Analysis

Theme 3. Feature Integrations and Platform Enhancements

Theme 4. Pricing, Usage Transparency, and Subscription Models

Theme 5. Training, Fine-Tuning, and Cutting-Edge Research


PART 1: High level Discord summaries

Codeium / Windsurf Discord


OpenAI Discord


Cursor IDE Discord


Eleuther Discord


Bolt.new / Stackblitz Discord


Unsloth AI (Daniel Han) Discord


Nous Research AI Discord


Stability.ai (Stable Diffusion) Discord


Notebook LM Discord Discord


Interconnects (Nathan Lambert) Discord


LM Studio Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


OpenInterpreter Discord


Torchtune Discord


DSPy Discord


LAION Discord


Axolotl AI Discord


Mozilla AI Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium / Windsurf ▷ #announcements (1 messages):

Windsurf Wave 1 Launch, Cascade Memories and terminal automation, Updated pricing and usage transparency, Improved Language support for Python, Cascade image uploads

Links mentioned:


Codeium / Windsurf ▷ #content (1 messages):

Windsurf AI Twitter Giveaway

Link mentioned: Tweet from Windsurf (@windsurf_ai): Excited to announce our first merch giveaway 🏄Share what you've built with Windsurf for a chance to win a care package 🪂 #WindsurfGiveawayMust be following to qualify


Codeium / Windsurf ▷ #discussion (239 messages🔥🔥):

Credit Issues in Windsurf, Customer Support Concerns, Functionality of Cascade and Extensions, Community Discussions on Features, Product Updates and Feedback

Links mentioned:


Codeium / Windsurf ▷ #windsurf (580 messages🔥🔥🔥):

Windsurf Updates, Cascade Model Issues, User Experience with Pricing, Feature Requests for Windsurf, Community Feedback on AI Performance

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

ChatGPT integration with Apple, 12 Days of OpenAI, Holiday themed demo

Link mentioned: ChatGPT x Apple Intelligence—12 Days of OpenAI: Day 5: Sam Altman, Miqdad Jaffer, and Dave Cummings introduce and demo ChatGPT integration into iOS and macOS while wearing holiday sweaters.


OpenAI ▷ #ai-discussions (517 messages🔥🔥🔥):

Gemini 2.0 Flash, OpenAI services downtime, ChatGPT usage strategies, Image generation performance, API tool comparisons

Link mentioned: no title found: no description found


OpenAI ▷ #gpt-4-discussions (25 messages🔥):

Custom GPT Actions, API Error Handling, Platform Outages, File Formats for Scenarios

Link mentioned: API, ChatGPT & Sora Facing Issues: no description found


OpenAI ▷ #prompt-engineering (11 messages🔥):

Fine-tuning issues, Custom GPT instructions, Tool integration challenges, Canmore tool functions


OpenAI ▷ #api-discussions (11 messages🔥):

Fine-tuning OpenAI models, Custom GPT tool usage, Chaining multiple tools, Canmore tool functions, Protected chat policies


Cursor IDE ▷ #general (410 messages🔥🔥🔥):

Cursor performance issues, Agent mode functionality, Comparison with other AI tools, Gemini model capabilities, Windsurf communication and features

Links mentioned:


Eleuther ▷ #announcements (1 messages):

Neural network training, Training Jacobian analysis, Parameter dependence, Bulk and chaotic subspaces, Training dynamics

Links mentioned:


Eleuther ▷ #general (78 messages🔥🔥):

HumanEval evaluations, OpenAI employee insights on training data, RWKV architectures and models, AdamW weight decay

Links mentioned:


Eleuther ▷ #research (296 messages🔥🔥):

Muon optimizer performance, Understanding harmful biases in LLMs, Regularization techniques in neural networks, Effects of weight decay in transformers, Gradient orthogonalization benefits

Links mentioned:


Eleuther ▷ #lm-thunderdome (7 messages):

lm_eval_harness, Perplexity evaluation, Batch processing in inference frameworks, Token processing utility, AOTriton updates

Link mentioned: [ROCm] Update to AOTriton 0.8b (#140172) · pytorch/pytorch@424156c: Notable new features for SDPA operators on AMD systems from AOTriton 0.8b:1. Nestedtensor support;2. MQA/GQA support;3. Restore Efficient attention support for causal=True and seqlen_q != seqle...


Eleuther ▷ #multimodal-general (1 messages):

tensor_kelechi: https://machinelearning.apple.com/research/multimodal-autoregressive


Bolt.new / Stackblitz ▷ #announcements (3 messages):

OSS Bolt, YouTube Streams, Supabase Integration

Link mentioned: Bolt Office Hours: Week 8: 🔗 LinksBolt.diy: https://bolt.diyCole Medin YouTube: https://www.youtube.com/@ColeMedinBolt.diy announcement: https://twitter.com/stackblitz/status/18668673...


Bolt.new / Stackblitz ▷ #prompting (7 messages):

Web App Development, Shopify API Integration, Data Transformation Tools, Airtable Integration, Webhook Scenarios

Link mentioned: Shopify API, libraries, and tools: Learn about Shopify APIs, libraries, and tools, and select the right option for your use case.


Bolt.new / Stackblitz ▷ #discussions (235 messages🔥🔥):

Bolt AI performance, Firebase vs Supabase, Error handling in Bolt, Token usage concerns, Community support in Bolt


Unsloth AI (Daniel Han) ▷ #general (152 messages🔥🔥):

Qwen 2.5 Fine-tuning, Gemini Voice, AWQ and Adapter Models, Context Length Capabilities, Performance Evaluation of Voice Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (10 messages🔥):

Unsloth Merch, Dropshipping Challenges, Ecommerce Concerns


Unsloth AI (Daniel Han) ▷ #help (58 messages🔥🔥):

Learning CUDA and Triton, Fine-tuning with custom datasets, Memory management during training, Using llama.cpp for model conversion, Data quality for domain adaptation

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (5 messages):

Roles in AI systems, Constrained generation, Feature extraction, Moderation techniques


Unsloth AI (Daniel Han) ▷ #research (4 messages):

WizardLM Arena datasets, OpenPlatypus dataset, QwQ model conversion, MATH dataset

Links mentioned:


Nous Research AI ▷ #announcements (3 messages):

New Projects Channel, Forge Discord Bot Access, Hermes 3 LLM Release

Link mentioned: NousResearch/Hermes-3-Llama-3.2-3B · Hugging Face: no description found


Nous Research AI ▷ #general (90 messages🔥🔥):

Nous Forge Access, Quantum Computing Updates, Neurofeedback Research, AI Collaboration Proposals, Creative AI Simulation

Links mentioned:


Nous Research AI ▷ #ask-about-llms (62 messages🔥🔥):

Coconut model, KV-cache mechanisms, Thought tokens in LLMs, Amnesia mode in models, Custom LLMs on iOS

Links mentioned:


Nous Research AI ▷ #research-papers (8 messages🔥):

DNF for 4D Generation, Chain of Continuous Thought (COCONUT), Github Repository for qtip, Model Capacity Utilization Insights, Communication Theory in AI

Links mentioned:


Nous Research AI ▷ #interesting-links (16 messages🔥):

Gemini 2.0, Gemini Flash, Deep Research feature, Maya: Multilingual Vision-Language Model

Links mentioned:


Nous Research AI ▷ #research-papers (8 messages🔥):

4D Generative Modeling, Chain of Continuous Thought (COCONUT), Model Capacity Utilization, Signal Processing in AI, QTIP GitHub Repository

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (176 messages🔥🔥):

VRAM usage and management, AI model recommendations for image enhancement, Scalping GPUs, Using AI for classification and tagging images, Voice training AI programs

Links mentioned:


Notebook LM Discord ▷ #use-cases (13 messages🔥):

Discord Integration for Notebook, TTRPG Rule Book Utility, Experimenting with Output Styles, Podcast Enhancement Strategies, Solo Adventure Generation

Links mentioned:


Notebook LM Discord ▷ #general (93 messages🔥🔥):

Podcasting with NotebookLM, NotebookLM Features and Limits, Gemini 2.0 Integration, Input Methods for NotebookLM, User Experiences with AI Tools

Links mentioned:


Interconnects (Nathan Lambert) ▷ #events (13 messages🔥):

Microwave Gang, Discord Profile Naming, Open Hangouts Scheduling, Whova App Preferences

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (49 messages🔥):

Gemini 2.0 Flash, OpenAI Product Focus, Video Generation Models, Sora Sign-ups, Coding Capabilities

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (6 messages):

Scaling Laws Debate, Latent Space Podcast Live Event, Influencers and Scaling, Community Engagement, Wholesomeness in AI Discussions

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (11 messages🔥):

Joining Llama team, Gemini secrets, Presentation on reasoning, Major papers on reasoning, Nous Dunks compliment

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (8 messages🔥):

Scalability in AI, Criticism of GM, Responses to Online Discourse

Links mentioned:


Interconnects (Nathan Lambert) ▷ #cv (1 messages):

CV Channel Engagement, MLLMs, VLMs


Interconnects (Nathan Lambert) ▷ #reads (11 messages🔥):

AI Scaling Laws, LLM Creativity Benchmarking, Inference Time Compute, RL LLMs, Scaling LLM Test-Time Compute

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (2 messages):

``


LM Studio ▷ #general (81 messages🔥🔥):

Merging Documents with AI, Running Models on Multiple GPUs, Using LM Studio with Web Access, Handling Model Parameters, Updating LM Studio

Link mentioned: SicariusSicariiStuff/LLAMA-3_8B_Unaligned_BETA_GGUFs · Hugging Face: no description found


LM Studio ▷ #hardware-discussion (5 messages):

Alphacool D5 Pump Setup, LMStudio GPU Usage


Latent Space ▷ #ai-general-chat (52 messages🔥):

Nous Simulators Announcement, Hyperbolic Series A Funding, Gemini 2.0 Flash Launch, Stainless Series A Update, Realtime Multimodal API Introduction

Links mentioned:


Latent Space ▷ #ai-announcements (2 messages):

Latent Space Live 2024, NeurIPS Conference, AI Agents Debate, Bolt Success, YouTube Streaming Event

Links mentioned:


Latent Space ▷ #llm-paper-club-west (7 messages):

Zoom call arrangements, Thriaal on Chinchilla, YouTube live stream

Links mentioned:


Modular (Mojo 🔥) ▷ #general (7 messages):

C and C++ Standardization, Modular Website Forum Access

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (49 messages🔥):

Multi-Paxos Protocol Implementation, Mojo Struct Design, Named Results Performance, Programming Environment Preferences, Mojo Open Source Timeline


Cohere ▷ #discussions (13 messages🔥):

AI tools and chatbots, DM communication, Support requests

Link mentioned: Magic Eight GIF - Magic Eight Eightball - Discover & Share GIFs: Click to view the GIF


Cohere ▷ #questions (8 messages🔥):

Rerank 3.5 English Model, CmdR+Play Bot Status, Aya Expanse Performance, API Request 403 Error, Dataset Recommendations for Quantification

Link mentioned: GitHub - hiyouga/LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024): Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024) - hiyouga/LLaMA-Factory


Cohere ▷ #api-discussions (9 messages🔥):

API Response 403 Issues, VPN Connection Effects, Trial API Key Limitations


Cohere ▷ #projects (9 messages🔥):

Maya Multimodal Model, Open Source Development, Feedback and Support, Future Video Release, Culturally Aware VLM

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

MOOC Feedback, Hackathon Feedback


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (31 messages🔥):

Hackathon Submission Guidelines, Written Article Assignment, API Key Usage, Feedback Submission, Course Completion Requirements

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (2 messages):

Function Calling in AI, ToolBench Platform, Important Research Papers

Links mentioned:


OpenInterpreter ▷ #general (25 messages🔥):

Open Interpreter Desktop App, O1 Pro Capabilities, Website Design Feedback, Pricing for Pro Plan, Actions Beta App

Link mentioned: Open Interpreter: no description found


OpenInterpreter ▷ #O1 (1 messages):

pradipdutta9392: i Think This useful for researchers just they shown in the demo


Torchtune ▷ #dev (16 messages🔥):

DoraLinear Initialization, Module Device Handling, Gradient Management, Parameter Copying Techniques, Use of Optional in Method Signatures

Links mentioned:


Torchtune ▷ #papers (1 messages):

QRWKV6-32B, Finch-MoE-37B-A11B, Computational Efficiency Improvements, RWKV-V6 Attention Mechanism, Language Support Limitations

Link mentioned: Tweet from Rohan Paul (@rohanpaul_ai): New linear models: QRWKV6-32B (RWKV6 based on Qwen2.5-32B) & RWKV-based MoE: Finch-MoE-37B-A11B🚀 Recursal AI converted Qwen 32B Instruct model into QRWKV6 architecture, replacing transformer attentio...


DSPy ▷ #general (10 messages🔥):

O1 Series Impact on DSPy Workflows, Generic Optimization Errors, Backtrack_to Attribute Error, Async Usage Issues, Video and Audio Input Discussions


LAION ▷ #general (3 messages):

Grassroots Science Initiative, Optimizing Inference Throughput, Comparative Performance of Libraries, Knowledge Graphs from Research Papers

Links mentioned:


LAION ▷ #research (6 messages):

Non-LLMs Generalization, Sub-Billion Parameter Models, COCONUT Paradigm, Efficient Small Models

Link mentioned: Tweet from Tanishq Mathew Abraham, Ph.D. (@iScienceLuvr): Training Large Language Models to Reason in a Continuous Latent SpaceIntroduces a new paradigm for LLM reasoning called Chain of Continuous Thought (COCONUT)Extremely simple change: instead of mapping...


Axolotl AI ▷ #general (1 messages):

c.gato: That should be the default


Mozilla AI ▷ #announcements (1 messages):

Mozilla AI hiring, Community Engagement Head role, Lumigator product, Developer Hub, Blueprints initiative

Link mentioned: Head of Community Engagement: Remote






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}