Frozen AI News archive

Genesis: Generative Physics Engine for Robotics (o1-mini version)

**OpenAI** launched the **o1 model** API featuring function calling, structured outputs, vision support, and developer messages, achieving **60% fewer reasoning tokens** than its preview. The model excels in math and code with a **0.76 LiveBench Coding score**, outperforming Sonnet 3.5. Beta SDKs for Go and Java and WebRTC support with **60% lower prices** were also released. **Google Gemini 2.0 Pro (Gemini Exp 1206)** deployment accelerated, showing improved coding, math, and reasoning performance. Meta AI FAIR introduced research on training transformers directly on raw bytes using dynamic entropy-based patching. Commercial humanoid robots were successfully deployed by an industry player. **Hugging Face** researchers demonstrated that their **3B Llama model** can outperform the **70B Llama model** on MATH-500 accuracy using search techniques, highlighting efficiency gains with smaller models. Concerns about reproducibility and domain-specific limitations were noted.

Canonical issue URL

AI News for 12/17/2024-12/18/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (215 channels, and 4542 messages) for you. Estimated reading time saved (at 200wpm): 497 minutes. You can now tag @smol_ai for AINews discussions!

You are reading AINews generated by o1-mini-2024-09-12. As is tradition on new frontier model days, we try to publish multiple issues for A/B testing/self evaluation. Check our archives for the o1-2024-12-17 version. We are sorry for the repeat sends yesterday (platform bug) but today's is on purpose.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here are the key discussions organized by topic:

OpenAI o1 API Launch and Features

Google Gemini Updates

Model Development & Architecture

Industry & Business

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Hugging Face's 3B Llama Model: Outperforming the 70B with Search

Theme 2. Moonshine Web: Faster, More Accurate than Whisper

Theme 3. Granite 3.1 Language Models: 128k Context & Open License

Theme 4. Moxin LLM 7B: A Fully Open-Source AI Model

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. Imagen v2 Quality Elevates Image Generation Benchmark

Theme 2. NotebookLM's Conversational Podcast Revolution

Theme 3. Gemini 2.0 Surpass Others in Academic Writing

Theme 4. Veo 2 Challenges Sora with Realistic Video Generation


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. Challenges in AI Extensions and Projects

Theme 2. New and Upgraded Models

Theme 3. GPU & Inference Pitfalls

Theme 4. Advanced Fine-Tuning & RAG Techniques

Theme 5. NotebookLM and Agentic Workflows


PART 1: High level Discord summaries

Codeium (Windsurf) Discord


Cursor IDE Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Nous Research AI Discord


Notebook LM Discord Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Stability.ai (Stable Diffusion) Discord


Perplexity AI Discord


GPU MODE Discord


LM Studio Discord


Stackblitz (Bolt.new) Discord


Cohere Discord


Modular (Mojo 🔥) Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


Torchtune Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


LlamaIndex Discord


Gorilla LLM (Berkeley Function Calling) Discord


LLM Agents (Berkeley MOOC) Discord


Axolotl AI Discord


Mozilla AI Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium (Windsurf) ▷ #discussion (60 messages🔥🔥):

Codeium Extension Issues, Windsurf Performance Problems, Flex Credits Concerns, Connection to Codeium Server, Prompting with o1

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (678 messages🔥🔥🔥):

Windsurf vs Cursor, Model Performance Comparisons, Error Handling in Windsurf, AI Integration in Development, Coding Performance and Tools

Links mentioned:


Cursor IDE ▷ #general (707 messages🔥🔥🔥):

Cursor Update 0.44.2, Development tools in Cursor, PyQt and PySide6 issues, O1 Pro usage, Kepler Community browser

Links mentioned:


aider (Paul Gauthier) ▷ #general (264 messages🔥🔥):

o1 API access, Benchmark Performance, Refund and Support Experiences, Gemini vs. Sonnet, Aider Functionality

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

Aider Support for Gemini Flash 2, Working with /architect and /ask Modes, Managing Code Refactoring, File Upload Issues, Google Search Grounding in Gemini 2.0

Links mentioned:


aider (Paul Gauthier) ▷ #links (11 messages🔥):

Depth AI, LightRAG, Codebase Indexing, AI Assistant Deployment

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

12 Days of OpenAI, Stay Updated Role


OpenAI ▷ #ai-discussions (220 messages🔥🔥):

OpenAI vs Google AI advancements, Experiences with different AI models, AI and safety concerns, AI for personal assistance, DALL·E vs Midjourney for image generation

Link mentioned: GitHub - AlignAGI/Alignment: Promoting global awareness and action for ethical AI alignment and safeguarding humanity against AI self-replication risks. Includes research, frameworks, and open-source resources.: Promoting global awareness and action for ethical AI alignment and safeguarding humanity against AI self-replication risks. Includes research, frameworks, and open-source resources. - AlignAGI/Alig...


OpenAI ▷ #gpt-4-discussions (3 messages):

Custom GPTs functionality, Manager role in training


OpenAI ▷ #prompt-engineering (4 messages):

Channel Posting Etiquette, Seeking Help in Appropriate Channels


OpenAI ▷ #api-discussions (4 messages):

Channel Overposting, Seeking Help, Proper Channel Usage, Spam Concerns


Nous Research AI ▷ #general (210 messages🔥🔥):

Falcon Model Performance, Prompt Chaining Techniques, OpenAI Safety Discussions, Feedback and Evaluation Systems, API and Tool-Use Support in Models

Links mentioned:


Nous Research AI ▷ #ask-about-llms (13 messages🔥):

Function calling on local models, Bias in function fetching, Effectiveness of search engines, Hermes 3 405B model issues, Pink elephant problem in AI responses


Nous Research AI ▷ #research-papers (2 messages):

Signal and Noise in Inference, Consistency of LLM Output, Long Output Challenges


Nous Research AI ▷ #research-papers (2 messages):

Signal and Noise in Inference, Consistency of LLM Output


Notebook LM Discord ▷ #announcements (1 messages):

3-panel UI changes, Suggested actions removal, Workarounds for missing features


Notebook LM Discord ▷ #use-cases (27 messages🔥):

Multilingual Functionality, Podcast Length Customization, Interactive AI Use Cases, Knowledge Base Generation, Creative Podcast Production

Links mentioned:


Notebook LM Discord ▷ #general (194 messages🔥🔥):

NotebookLM Podcast Features, Interactive Mode Rollout, Audio Overview Functionality, Source Integration and Updates, Case Study Preparation Using NotebookLM

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (66 messages🔥🔥):

Fine-tuning Llama 3.2, Batch Size and Training, Function Calling in Models, Multi-GPU Support in Unsloth, Overfitting in Machine Learning Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (139 messages🔥🔥):

Open Source Reasoning Models, Unsloth Model Training, Fine-Tuning with QwQ, DiLoCo Presentation, LORA vs Model Architecture

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (15 messages🔥):

Llama 3.2 training loss, M4 MAX GPU compatibility, Unsloth support on Mac


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

OpenAI o1 model, Structured outputs, EVA Llama model, Price reductions, Provider pages

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (209 messages🔥🔥):

Exposed OpenRouter keys, Chat details in API, Using OpenRouter API keys with PKCE, OpenRouter pricing structure, Model performance comparisons

Links mentioned:


Eleuther ▷ #general (1 messages):

Retail/E-commerce ad models, Runway, OpenAI Sora, Veo 2


Eleuther ▷ #research (123 messages🔥🔥):

Warmup phase for learning rates, Meta-Learning to reduce overfitting, Compression methods in neural networks, Grokking in large models, Koopman operator theory in neural networks

Links mentioned:


Eleuther ▷ #lm-thunderdome (6 messages):

doc_to_text function arguments, Creating new configs, Overloading config fields


Eleuther ▷ #gpt-neox-dev (9 messages🔥):

WANDB logging, Configuring WANDB run names, Pull Requests on features

Link mentioned: gpt-neox/megatron/logging.py at f5325805678c2b9e35aae4528283e0132c5f5bbc · EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries - EleutherAI/gpt-neox


Stability.ai (Stable Diffusion) ▷ #general-chat (122 messages🔥🔥):

Lora Training Techniques, Current Models in Use, Running Stable Diffusion on Linux, Navigating Image Resolution and Performance, Understanding AI Generated Content and Models

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Custom Web Sources, Perplexity Spaces


Perplexity AI ▷ #general (108 messages🔥🔥):

Perplexity Pro Subscriptions, New Features and Updates, User Experience with AI Models, Rate Limits and Performance, User Interface Suggestions

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

Meta vs OpenAI Pro-Fit, Microbe Threat Warning, Plant Communication, Dopamine Precursors, Cell Revival

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (1 messages):

Perplexity API, Web Search Feature, Cost Overview


GPU MODE ▷ #general (41 messages🔥):

6D Parallelism Article, PC Troubleshooting, GPU Performance and Coil Whine, Multi-GPU Instances with NVLink, Coil Whine and Audio Experimentation

Links mentioned:


GPU MODE ▷ #triton (1 messages):

Kernel Computation Optimization, Memory Management in GPU, Output Concatenation Techniques


GPU MODE ▷ #cuda (8 messages🔥):

CUDA Memory Copy Issues, Comparing A100 and H100 GPUs, AMP Related Differences


GPU MODE ▷ #torch (33 messages🔥):

Megatron-LM efficiency, Torch.compile warnings handling, Distributed training community, FlexAttention development, Keras/PyTorch contributions

Links mentioned:


GPU MODE ▷ #cool-links (4 messages):

Raspberry Pi 5 Deployment, Edge Device Models, Esp32 / Xtensa LX7 Chips


GPU MODE ▷ #jobs (1 messages):

MatX LLM accelerator, Job openings in ML, ASIC roles

Link mentioned: Tweet from MatX | Jobs: no description found


GPU MODE ▷ #torchao (5 messages):

int4group scheme, Training process quantization, Tinygemm compute method


GPU MODE ▷ #rocm (1 messages):

MigraphX in MI300X, ONNX frontend support, Opset compatibility


GPU MODE ▷ #thunderkittens (1 messages):

kimishpatel: what i cam here for 🙂


GPU MODE ▷ #arc-agi-2 (18 messages🔥):

Custom Vision Encoder, Chain of Thought Generation, Axolotl Configurations, Efficient Sampling Processes, Experimenting with Finetuning

Links mentioned:


LM Studio ▷ #general (87 messages🔥🔥):

LM Studio setup, Qwen QwQ and roleplay LLMs, Model compatibility and errors, Using LM Studio on mobile, New developments in AI models

Links mentioned:


LM Studio ▷ #hardware-discussion (17 messages🔥):

3060ti confusion, AMD driver issues, Llama model performance, Inference hardware desires, RAM requirements for large models


Stackblitz (Bolt.new) ▷ #prompting (6 messages):

Migrating Firebase to Supabase, Using Bootstrap with create-mf-app, Google reCAPTCHA Issues, Testing ChatGPT Bolt Pilot, Vite Pre-Transform Errors


Stackblitz (Bolt.new) ▷ #discussions (97 messages🔥🔥):

Token Waste Issues, Project Collaboration, Bolt.diy Importing Projects, Payment Integration Discussion, User Experience with Bolt

Links mentioned:


Cohere ▷ #discussions (42 messages🔥):

Maya Tool Use, Model Integration Challenges, Sleep Importance, Image Tool Development, Local Model Usage


Cohere ▷ #announcements (1 messages):

Rate-limit increase for Multimodal Embed-v3 Images, Trial vs Production rate limits, API key options and pricing

Link mentioned: API Keys and Rate Limits — Cohere: This page describes Cohere API rate limits for production and evaluation keys.


Cohere ▷ #questions (51 messages🔥):

Cohere Reranker Issues, Using Different Embedding Models, Cohere and Nvidia Dependency, TPU in AI Systems, Vector Store for Different Dimensionality

Links mentioned:


Cohere ▷ #cmd-r-bot (1 messages):

setupisanoun: hey buddy


Cohere ▷ #projects (2 messages):

Product Hunt Launch, Findr App, Digital Memory

Link mentioned: Tweet from Nishkarsh (@Nish306): we’ve launched on Product Hunt. i would greatly appreciate your support https://www.producthunt.com/posts/findr-remember-everythingwe're giving humans infinite memory and a searchable digital brai...


Cohere ▷ #cohere-toolkit (3 messages):

Cohere Toolkit Deployment, AWS Stream Errors, Docker Logs Inspection


Modular (Mojo 🔥) ▷ #general (22 messages🔥):

Mojo on Archcraft Linux issues, Installation of Max and Magic, Using the Mojo REPL, Python requirements in magic environment


Modular (Mojo 🔥) ▷ #mojo (57 messages🔥🔥):

Mojo Documentation Updates, Mojo Kernel Terminology, Compute Kernels vs OS Kernels, Discussion on var Keyword, Argmax and Argmin Removal

Link mentioned: Mojo language basics | Modular Docs: Introduction to Mojo's basic language features.


Modular (Mojo 🔥) ▷ #max (13 messages🔥):

Custom ops in Mojo, Error handling and documentation, Feature request for custom op messages, Max GitHub repo issues, Session loading with custom ops

Link mentioned: [Feature Request] Single compilation unit kernels and/or improved error messages · Issue #269 · modularml/max: What is your request? This is a 2-part request, but bundled since they both address the same UX issue. Part one is to make the "custom op not found" error message direct users to documentati...


OpenInterpreter ▷ #general (28 messages🔥):

Open Interpreter Errors, Latest OI Version, AI Applications and Models, Truffle-1 Computing Stack, Long-Term Memory in OI

Links mentioned:


tinygrad (George Hotz) ▷ #general (27 messages🔥):

Benchmarks of Llama Models, Mergeability in ShapeTrackers, Layout Algebra in CuTe, Algorithm Complexity in Merging, Injectivity in Layout Algebra

Links mentioned:


Torchtune ▷ #dev (25 messages🔥):

FSDP normalization, Scaling factors in loss computation, Bug reports on trl and HF trainer, Optimizer behavior with weight decay, Updates to PRs

Links mentioned:


Torchtune ▷ #papers (2 messages):

Evolutionary Algorithms, Scale Up Evolution, Gradient Techniques


DSPy ▷ #show-and-tell (1 messages):

collabin: https://youtu.be/BrvVheleOqc


DSPy ▷ #papers (4 messages):

AI and Knowledge Economy, Coconut - Chain of Continuous Thought, Autonomous vs Non-Autonomous AI

Links mentioned:


DSPy ▷ #general (11 messages🔥):

TypedReAct integration, RouteLLM maintenance concerns, DSPy evolution with reasoning models

Link mentioned: Agents - DSPy: The framework for programming—rather than prompting—language models.


Nomic.ai (GPT4All) ▷ #general (12 messages🔥):

GPT4All issues, Jinja template functionality, Docker version of GPT4All, Command line interface concerns, Local documents in CLI


LlamaIndex ▷ #blog (2 messages):

AI SDR, Agent Building Crash Course, LlamaIndex Function Calling, Agentic RAG, ReAct

Link mentioned: composio/python/examples/quickstarters at master · ComposioHQ/composio: Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling - ComposioHQ/composio


LlamaIndex ▷ #general (4 messages):

OpenAIAgent concurrency, RAG evaluation discussions

Link mentioned: Single-Turn Multi-Function Calling OpenAI Agents - LlamaIndex: no description found


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (3 messages):

BFCL Leaderboard, Function Call Demo Issues, Gorilla Benchmark for Structured Outputs

Link mentioned: Berkeley Function Calling Leaderboard V3 (aka Berkeley Tool Calling Leaderboard V3) : no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

kallemickelborg: Thank you for that!


Axolotl AI ▷ #general (1 messages):

New Engineer on Board, Reinforcement Learning Support


Mozilla AI ▷ #announcements (1 messages):

Developer Hub, Blueprints Initiative




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}