Frozen AI News archive

Meta Apollo - Video Understanding up to 1 hour, SOTA Open Weights

**Meta** released **Apollo**, a new family of state-of-the-art video-language models available in **1B, 3B, and 7B** sizes, featuring "Scaling Consistency" for efficient scaling and introducing **ApolloBench**, which speeds up video understanding evaluation by **41×** across five temporal perception categories. **Google Deepmind** launched **Veo 2**, a 4K video generation model with improved physics and camera control, alongside an enhanced **Imagen 3** image model. **OpenAI** globally rolled out ChatGPT search with advanced voice and map features and discussed a potential $2,000/month "ChatGPT Max" tier. Research highlights include achieving **Llama 70B** performance using **Llama 3B** via test-time compute scaling and expanding **Command R7B** language support from 10 to 23 languages. Industry updates feature **Figure AI** delivering humanoid robots commercially and **Klarna** reducing workforce through AI. Notion integrated **Cohere Rerank** for better search. Studies reveal LLMs can recognize their own writing style and show self-preference bias. Discussions note video processing progress outpacing text due to better signal-per-compute and data evaluation.

Canonical issue URL

AI News for 12/13/2024-12/16/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (209 channels, and 11992 messages) for you. Estimated reading time saved (at 200wpm): 1365 minutes. You can now tag @smol_ai for AINews discussions!

Meta starts the week strong with an open model (1B, 3B, 7B) and paper release that you can use immediately: Apollo: An Exploration of Video Understanding in Large Multimodal Models.

While the paper is very tentatively titled, the Huggingface demo shows off how it works in practice, consuming a 24min sample video easily:

image.png

the authors credit their development of "Scaling Consistency" to their efficient scaling up of experiments.

image.png

image.png

They also introduce ApolloBench, a subset of existing benchmarks (e.g. Video-MME, MLVU, LongVideoBench) that cuts evaluation time by 41× (with high correlation) while offering detailed insights in five broad temporal perception categories: Temporal OCR, Egocentric, Spatial, Perception, and Reasoning.

Perhaps the most entertaining part of the paper was the passive aggressive abstract: "Despite the rapid integration of video perception capabilities into Large Multimodal Models (LMMs), the underlying mechanisms driving their video understanding remain poorly understood. Consequently, many design decisions in this domain are made without proper justification or analysis."

Well okay Meta, shots fired.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here are the key discussions organized by topic:

AI Model & Product Releases

Research & Technical Developments

Industry & Business Updates

AI Research Insights

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Meta's Apollo Multimodal Models: Local Execution and VRAM Efficiency

Theme 2. Criticism and Examination of Chain Of Thought Prompts

Theme 3. High Performance Benchmarks: Intel B580 and LLMs

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Claude 3.5's Edge Over OpenAI's O1

Theme 2. Criticism of Apple's LLM Reasoning Capabilities

Theme 3. Google's VEO 2: Advanced Video Creation

Theme 4. Eric Schmidt's Warning on AI Autonomy


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. AI Models Battle: New Releases and Comparisons

Theme 2. AI Tools Throw Tantrums: Users Grapple with Bugs and Credits

Theme 3. AI Ethics Drama: Alignment and Whistleblower Woes

Theme 4. AI Gets Creative: From Erotic Roleplay to Customized Outputs

Theme 5. AI Research Breakthroughs: New Methods and Models Emerge


PART 1: High level Discord summaries

Codeium / Windsurf Discord


Notebook LM Discord Discord


Cursor IDE Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Bolt.new / Stackblitz Discord


Latent Space Discord


LM Studio Discord


Stability.ai (Stable Diffusion) Discord


Interconnects (Nathan Lambert) Discord


Perplexity AI Discord


Cohere Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


Torchtune Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


OpenInterpreter Discord


DSPy Discord


Axolotl AI Discord


LAION Discord


Mozilla AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium / Windsurf ▷ #announcements (1 messages):

Discord Challenge Winners, YouTube Video Submissions, Windsurf Pro Tier Rewards

Links mentioned:


Codeium / Windsurf ▷ #discussion (212 messages🔥🔥):

Windsurf Features and Issues, User Feedback on Flow Action Credits, Account Management and Support, AI Behavior and Code Changes, Integration with Other Tools

Links mentioned:


Codeium / Windsurf ▷ #windsurf (609 messages🔥🔥🔥):

Windsurf Issues, AI and Dependency, Codeium vs. Gemini, MCP and Function Calling, Ruff Linter and Formatter

Links mentioned:


Notebook LM Discord ▷ #use-cases (96 messages🔥🔥):

Notebook LM Podcast Features, Customizing AI Outputs, Using Different Languages in AI, Creating Engaging Content with AI, AI and the Turing Test

Links mentioned:


Notebook LM Discord ▷ #general (613 messages🔥🔥🔥):

NotebookLM new features, NotebookLM Plus, Interactive mode, Podcast generation, Language settings

Links mentioned:


Cursor IDE ▷ #general (884 messages🔥🔥🔥):

Cursor IDE performance, AI model comparisons, Social media project development, Cursor integrations, Chat management issues

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (544 messages🔥🔥🔥):

Unsloth Model Support, Dependencies and Installation Issues, Triton Installation, Long Context Models, Ilya Sutskever's Talk Insights

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (1 messages):

edd0302: https://main-horse.github.io/posts/visualizing-6d


Unsloth AI (Daniel Han) ▷ #help (236 messages🔥🔥):

Unsloth Training Issues, Model Compatibility with Streamlit, Dataset Loading Problems, Fine-tuning Techniques, Max Sequence Length for Llama 3.2

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (24 messages🔥):

Model Merging Techniques, AI Regulation and Politics, Impact of AI on Society, Nuclear Treaty Comparisons, Perceptions of AI Gains

Link mentioned: Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation: By merging models, AI systems can combine the distinct strengths of separate language models, achieving a balance between multiple capabilities without requiring substantial retraining. However, the i...


OpenAI ▷ #annnouncements (1 messages):

ChatGPT Search Day, 12 Days of OpenAI

Link mentioned: - YouTube: no description found


OpenAI ▷ #ai-discussions (614 messages🔥🔥🔥):

Character AI Performance, OpenAI and Alignment, New AI Models, Local LLMs, AI and Politics

Links mentioned:


OpenAI ▷ #gpt-4-discussions (24 messages🔥):

O1 Pro AI, OpenAI Subscription Discussions, Chess with GPT, LLMs and Calculations, GPT 4o vs GPT 4o-mini


OpenAI ▷ #prompt-engineering (67 messages🔥🔥):

Prompt Engineering Techniques, AI Model Capabilities in Coding, Learning Programming, Memory Management in AI, Creating a Curriculum for Prompt Engineering


OpenAI ▷ #api-discussions (67 messages🔥🔥):

Prompt Engineering, Using ChatGPT for Coding, Memory Management, Prompt Library Concept, Learning Programming Languages


Nous Research AI ▷ #general (327 messages🔥🔥):

AI Government Regulation, Apollo LMMs Release, Hermes 3 Key Access, Model Performance Issues, Community Involvement in AI

Links mentioned:


Nous Research AI ▷ #ask-about-llms (32 messages🔥):

Open-source coding LLMs, Fine-tuning local LLMs, Vector databases and embeddings, Model merging and souping, RNG algorithms in LLMs

Links mentioned:


Nous Research AI ▷ #research-papers (18 messages🔥):

Model Compression Techniques, Application of Communication Theory to AI, Lora Updates in Model Training, Trade-offs in Training Approaches, Position Invariance in MLPs

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): 🌟 Weekly Medical AI Research Roundup 🌟📅 December 7-14, 2024Here's your weekly digest of the most important medical AI papers! 🎉🤖 Medical LLM & Other Models- PediaBench: Chinese Pediatric LLM-...


Nous Research AI ▷ #interesting-links (3 messages):

Byte Latent Transformer, Dynamic Tokenization, Inference Efficiency, Llama 3 Benchmark, Byte-level Models

Links mentioned:


Nous Research AI ▷ #research-papers (18 messages🔥):

Decompression on GPU, Historical influence of Physics on AI, Trellis coding and its applications, Model compression and redundancy, Distributed training methods

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): 🌟 Weekly Medical AI Research Roundup 🌟📅 December 7-14, 2024Here's your weekly digest of the most important medical AI papers! 🎉🤖 Medical LLM & Other Models- PediaBench: Chinese Pediatric LLM-...


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

SF Compute launch, Qwen QwQ price cut, New Grok models from xAI

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

OpenRouter API wrapper, OpenRouter-client

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (372 messages🔥🔥):

Hermes 3 405B performance, Gemini Pro 2 capabilities, Image generation model updates, Prompt caching in LLM providers, Rate limits for Gemini models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (3 messages):

OpenRouter launch, New feature integration


Eleuther ▷ #general (70 messages🔥🔥):

Grading Criteria for Student Projects, Non-Transformer Models Research, Byte vs Bit Encoding, Model Training Data Shuffling, JAX/Flax vs TensorFlow

Links mentioned:


Eleuther ▷ #research (249 messages🔥🔥):

Attention vs Kernel Methods, Constraint Satisfaction Problems, Reinforcement Learning and Memory, Iterative Reasoning in Neural Networks, Hybrid Architectures in Transformers

Links mentioned:


Eleuther ▷ #interpretability-general (8 messages🔥):

RASP Framework for Transformers, SAE Steering Applications, Contrastive Objectives in MCMC, Negative Results in SAE Research, Dense Probes and SAE Encodings

Links mentioned:


Eleuther ▷ #lm-thunderdome (12 messages🔥):

lm_eval harness with VLLM, Error Issues with VLLM API, VLLM Version Discussions

Links mentioned:


Bolt.new / Stackblitz ▷ #prompting (39 messages🔥):

Bolt Token Usage, Currency Update Issues, Bug Reports, Project Management with Bolt, Integration with Stripe and Supabase

Links mentioned:


Bolt.new / Stackblitz ▷ #discussions (237 messages🔥🔥):

Service Availability Issues, New Features and Integrations, Cost of Tokens and Subscriptions, React Native Development Guidance, Backup and Recovery Options

Links mentioned:


Latent Space ▷ #ai-general-chat (68 messages🔥🔥):

Grok-2 updates, NeurIPS 2024, Veo 2 and Imagen 3 announcements, Byte Latent Transformer, Search in Voice mode

Links mentioned:


Latent Space ▷ #ai-in-action-club (183 messages🔥🔥):

NeurIPS Webcrawl, Prompt Engineering, AI Functions with Marvin, SillyTavern, Entropix and Chat Bots

Links mentioned:


LM Studio ▷ #general (147 messages🔥🔥):

Multimodal Models, Model Fine-tuning, Uncensored Chatbots, RAG Implementation, Model Updates

Links mentioned:


LM Studio ▷ #hardware-discussion (80 messages🔥🔥):

Power Supply Unit (PSU) Ratings, AMD Radeon VII GPU Support, Choosing GPU for AI/ML tasks, Llama Model Usage and Context Limits, Efficient Prompt Strategies

Link mentioned: Git ingest: Replace 'hub' with 'ingest' in any Github Url for a prompt-friendly text


Stability.ai (Stable Diffusion) ▷ #general-chat (224 messages🔥🔥):

Image Manipulation with AI, Stable Diffusion Models, Extensions for Stable Diffusion, Upscaling Generated Images, Stock Market Discussions

Links mentioned:


Interconnects (Nathan Lambert) ▷ #events (1 messages):

natolambert: There are indeed many interconnects fans at neurips. My people 💙💙💙


Interconnects (Nathan Lambert) ▷ #news (67 messages🔥🔥):

LiquidAI funding, Search memory in ChatGPT, DeepMind's Veo 2 and Imagen 3, OpenAI API updates, Performance comparison of AI models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (44 messages🔥):

NeurIPS Controversy, AI and Geopolitical Context, Implicit Bias in Academia, AI Companies and Cultural Sensitivity, Stupidity vs. Racism

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (52 messages🔥):

WebDev Arena Leaderboard, Hugging Face Account Compromise, OpenAI Whistleblower Incident, GPT-4o Update, Zebra Logic Bench Insights

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (8 messages🔥):

AI Influence in Politics, OpenAI's Sentient Model, Scaling Test-Time Compute, RL Discourse Resurgence

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rl (8 messages🔥):

David Silver sightings, RL Conf standout talks, Ani's Molmo talk, Barto retirement discussion

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rlhf (6 messages):

vLLM Runtime Weight Update API, John Schulman involvement, Anthropic and vLLM relationship, Technology in online RL training

Links mentioned:


Interconnects (Nathan Lambert) ▷ #cv (2 messages):

Apollo Video LLMs, Performance Comparison, Video Understanding in Multimodal Models, Qwen2.5 LLM Usage

Link mentioned: Apollo: Apollo: An Exploration of Video Understanding in Large Multimodal Models


Interconnects (Nathan Lambert) ▷ #reads (8 messages🔥):

Frontier language models sizes, GPT-4o and Claude 3.5 Sonnet parameters, Active vs Total Parameters, Flash models, MOEs with fewer active parameters

Link mentioned: Frontier language models have become much smaller: In this Gradient Updates weekly issue, Ege discusses how frontier language models have unexpectedly reversed course on scaling, with current models an order of magnitude smaller than GPT-4.


Perplexity AI ▷ #announcements (2 messages):

Campus Strategist program, Perplexity Pro gift subscriptions

Link mentioned: Perplexity Pro Subscription | Perplexity Supply: Perplexity Supply exists to explore the relationship between fashion and intellect with thoughtfully designed products to spark conversations and showcase your infinite pursuit of knowledge.


Perplexity AI ▷ #general (168 messages🔥🔥):

Custom Web Sources in Spaces, Support for Pro Users, Perplexity Pro Subscription Queries, Model Performance Issues, YouTube Videos Related to Perplexity

Links mentioned:


Perplexity AI ▷ #sharing (12 messages🔥):

Samsung's Project Moohan, One Hundred Years of Solitude HBO, Harvard AI Training Dataset, Gemini 2.0 Release, New Infinity Types

Links mentioned:


Perplexity AI ▷ #pplx-api (5 messages):

Perplexity API URL issues, Trouble accessing news via API, Model availability in API, Concerns over production API usage

Link mentioned: no title found: no description found


Cohere ▷ #discussions (65 messages🔥🔥):

Cohere Command Models, Runaway AIs Concerns, R7B Model Benchmarks, Upcoming Community Meeting, Code Wizard Hackathon

Links mentioned:


Cohere ▷ #announcements (1 messages):

Command R7B Office Hours


Cohere ▷ #questions (10 messages🔥):

Difference between Rerank and Embed, Performance of the new 7b model, AI in contract clause identification, Cohere's embedding models, Seeking help for code errors


Cohere ▷ #api-discussions (15 messages🔥):

API Access Issues, Using the Chat API, Dataset Upload Errors, Understanding Model Mapping, Rate Limiting Response Headers


Cohere ▷ #cmd-r-bot (62 messages🔥🔥):

Rerank vs Embed, Emotion-Concealing Robots, API Schema Changes, Cohere Agent Pricing, Today's Weather Forecast


Modular (Mojo 🔥) ▷ #general (13 messages🔥):

Mojo RSA Crypto, Prime Number Generation, Optimizations with SIMD Instructions, Zoom Call Recordings


Modular (Mojo 🔥) ▷ #mojo (67 messages🔥🔥):

Mojo and LLVM, Custom Mojo Kernels, Networking Performance, Nightly vs Stable Branches, Database Planning in MAX

Link mentioned: GitHub - cassioneri/teju_jagua: Teju Jagua: Teju Jagua. Contribute to cassioneri/teju_jagua development by creating an account on GitHub.


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Hackathon Submission Deadline, Submission Process Change, Last Minute Help, Project Excitement


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (29 messages🔥):

Certificate Notifications, OpenAI Credit Issues, LLM Agents Course, Mobile Responsiveness, Resubmission of Assignments

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

Safety alignment in AI Research Agents, AI Research Resources

Link mentioned: - YouTube: no description found


Torchtune ▷ #general (6 messages):

Torchtune v3.9 updates, Ruff automatic type hinting, Fine-tuning projects, Torcheval syncing metrics issues


Torchtune ▷ #dev (13 messages🔥):

DTensor Construction, Gradient Normalization in FSDP, Scalar vs Scaler Confusion

Links mentioned:


Torchtune ▷ #papers (3 messages):

Generative Verifiers, Scaling Test Time Compute, LLM Performance Enhancement

Links mentioned:


tinygrad (George Hotz) ▷ #general (15 messages🔥):

BEAM Configuration, New Gradient API, Kernel Search Experience, Tinygrad Porting Projects, Backend Support

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

ShapeTracker Explainer, tinygrad Tutorials

Link mentioned: tinygrad-notes/20241217_st.md at main · mesozoic-egg/tinygrad-notes: Tutorials on tinygrad. Contribute to mesozoic-egg/tinygrad-notes development by creating an account on GitHub.


LlamaIndex ▷ #blog (3 messages):

LlamaIndex tutorial, Agentic workflow for contract compliance, Agentic workflow for patient case summaries


LlamaIndex ▷ #general (10 messages🔥):

Creating Query Engine with Vector Store, Handling PDF Errors, Custom Extractors in LlamaIndex, Implementing Contextual Retrieval, NVIDIA NV-Embed-v2 Availability

Link mentioned: GitHub - cklapperich/Eidetic: Contribute to cklapperich/Eidetic development by creating an account on GitHub.


LlamaIndex ▷ #ai-discussion (1 messages):

Langchain Integration, MegaParse Document Parsing

Link mentioned: Integrating Langchain with MegaParse: Unlocking Seamless Document Parsing: Ankush k Singal


OpenInterpreter ▷ #general (7 messages):

Folder creation issues, API response problems, Billing tracking for Litellm, Learning Japanese apps, Using OS locally


DSPy ▷ #examples (5 messages):

Optimization of Claude Sonnet prompt, DSpy outdated examples, Revamping VLM examples


DSPy ▷ #colbert (1 messages):

nsa7211: <@1149658946982916167> can colpali work with handwritten docs too?


Axolotl AI ▷ #general (2 messages):

APOLLO optimizer, LLM training memory efficiency, Multi-turn KTO

Links mentioned:


LAION ▷ #general (1 messages):

Progressive Tokenization, Zero-tree Ordering, DWT Coefficients, VAE Embedding


LAION ▷ #research (1 messages):

Byte Latent Transformer Patches, Large Concept Models, NLP advancements

Link mentioned: no title found: no description found


Mozilla AI ▷ #announcements (1 messages):

Retrieval Augmented Generation, Event Preparations, SQLite-Vec and LlamaFile, Python Development


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (1 messages):

huanzhimao: Update: They are here.




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}