Frozen AI News archive

Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)

**Genesis** is a newly announced **universal physics engine** developed by a large-scale collaboration led by **CMU PhD student Zhou Xian**. It integrates multiple state-of-the-art physics solvers to simulate diverse materials and physical phenomena, targeting robotics applications with features like lightweight, ultra-fast simulation, photo-realistic rendering, and generative data capabilities. The engine is open source and designed for robotics simulation beyond just video generation. Additionally, **OpenAI** released the **o1** model to API with advanced features like function calling and vision support, showing strong math and coding performance. **Google** teased updates on **Gemini 2.0 Pro**, accelerating deployment for advanced users.

Canonical issue URL

AI News for 12/17/2024-12/18/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (215 channels, and 4542 messages) for you. Estimated reading time saved (at 200wpm): 497 minutes. You can now tag @smol_ai for AINews discussions!

You are reading AINews generated by o1-2024-12-17. As is tradition on new frontier model days, we try to publish multiple issues for A/B testing/self evaluation. Check our archives for the o1-mini version. We are sorry for the repeat sends yesterday (platform bug) but today's is on purpose.

December has been the month of Generative Video World Simulators apparently, with Sora Turbo going GA, both Genie 2 and Veo 2 getting teased by Google. Now, a group of academics led by CMU PhD student Zhou Xian have announced Genesis: A Generative and Universal Physics Engine for Robotics and Beyond, a 2 year large scale research collaboration involving over 20 labs, debuting with a drop of water rolling down a Heineken bottle:

image.png

Because it is a physics engine, it can render the same engine from different camera angles:

image.png

as well as expose the driving vectors:

image.png

The "unified physics engine" integrates various SOTA physics solvers (MPM, SPH, FEM, Rigid Body, PBD, etc.), supporting simulation of a wide range of materials: rigid body, articulated body, Cloth, Liquid, Smoke, Deformables, Thin-shell materials, Elastic/Plastic Body, Robot Muscles, etc.

Rendering consistent objects is immediately useful today, but does not sound like the "purist" bitter pilled approach taken by the big labs - being a pile of physics solvers manually put together rather than machine learned through data - but it does have the advantage of being open source and usable today (no paper yet).

If the purpose were video generation, this would already be impressive, but the real goal is robotics. Genesis is really a platform for 4 things:

  1. A universal physics engine re-built from the ground up, capable of simulating a wide range of materials and physical phenomena.
  2. A lightweight, ultra-fast, pythonic, and user-friendly robotics simulation platform.
  3. A powerful and fast photo-realistic rendering system.
  4. A generative data engine that transforms user-prompted natural language description into various modalities of data.

and it should be the robotics applications that should really shine.

image.png

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here are the key discussions organized by topic:

OpenAI o1 API Launch and Features

Google Gemini Updates

Model Development & Architecture

Industry & Business

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Hugging Face's 3B Llama Model: Outperforming the 70B with Search

Theme 2. Moonshine Web: Faster, More Accurate than Whisper

Theme 3. Granite 3.1 Language Models: 128k Context & Open License

Theme 4. Moxin LLM 7B: A Fully Open-Source AI Model

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. Imagen v2 Quality Elevates Image Generation Benchmark

Theme 2. NotebookLM's Conversational Podcast Revolution

Theme 3. Gemini 2.0 Surpass Others in Academic Writing

Theme 4. Veo 2 Challenges Sora with Realistic Video Generation


AI Discord Recap

A summary of Summaries of Summaries by o1-2024-12-17

Theme 1. Challenges in AI Extensions and Projects

Theme 2. New and Upgraded Models

Theme 3. GPU & Inference Pitfalls

Theme 4. Advanced Fine-Tuning & RAG Techniques

Theme 5. NotebookLM and Agentic Workflows


PART 1: High level Discord summaries

Codeium (Windsurf) Discord


Cursor IDE Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Nous Research AI Discord


Notebook LM Discord Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


Interconnects (Nathan Lambert) Discord


Eleuther Discord


Stability.ai (Stable Diffusion) Discord


Perplexity AI Discord


GPU MODE Discord


LM Studio Discord


Stackblitz (Bolt.new) Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


Torchtune Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


LlamaIndex Discord


Gorilla LLM (Berkeley Function Calling) Discord


LAION Discord


LLM Agents (Berkeley MOOC) Discord


Axolotl AI Discord


Mozilla AI Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium (Windsurf) ▷ #discussion (60 messages🔥🔥):

Codeium Issues, Windsurf Performance, Elementor and Codeium Integration, Support Ticket System, JetBrains Extension Issues

Links mentioned:


Codeium (Windsurf) ▷ #windsurf (678 messages🔥🔥🔥):

Windsurf Performance Issues, Comparison of Codeium and Copilot, Cascade Functionality, Llama Model Benchmarking, Free AI Tool Options

Links mentioned:


Cursor IDE ▷ #general (707 messages🔥🔥🔥):

Cursor IDE Updates, Kepler Browser Development, Python Environment Management, O1 Pro Performance, Galileo API Integration

Links mentioned:


aider (Paul Gauthier) ▷ #general (264 messages🔥🔥):

O1 API Release, Aider Benchmarking, Competition in AI Models, Support and Refunds, Using Gemini as Editor

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

Aider and Gemini Flash 2 Grounding, Project Management with Aider, Aider's File Handling Issues, Using Architect vs Ask Modes, Repo Map Concerns

Links mentioned:


aider (Paul Gauthier) ▷ #links (11 messages🔥):

Depth AI, LightRAG, Codebase indexing, AI assistants, Technical accuracy

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

12 Days of OpenAI, OpenAI Role Customization


OpenAI ▷ #ai-discussions (220 messages🔥🔥):

OpenAI ChatGPT developments, Gemini AI comparison, AI model safety concerns, Generative image differences, User experiences with AI models

Link mentioned: GitHub - AlignAGI/Alignment: Promoting global awareness and action for ethical AI alignment and safeguarding humanity against AI self-replication risks. Includes research, frameworks, and open-source resources.: Promoting global awareness and action for ethical AI alignment and safeguarding humanity against AI self-replication risks. Includes research, frameworks, and open-source resources. - AlignAGI/Alig...


OpenAI ▷ #gpt-4-discussions (3 messages):

GPT Management Training, Editing Custom GPTs


OpenAI ▷ #prompt-engineering (4 messages):

Channel appropriateness, Spam management, Seeking help


OpenAI ▷ #api-discussions (4 messages):

Channel confusion, Spam concerns


Nous Research AI ▷ #general (210 messages🔥🔥):

Prompt Chaining, Falcon Models, AI Tool Use, OpenAI Safety Discussions, Data Preprocessing Optimizations

Links mentioned:


Nous Research AI ▷ #ask-about-llms (13 messages🔥):

Function Calling Methods for Local Models, Language Model Data Recollection, Bias in AI Search Integration, Hermes 3 405B Model Responses, Search Engine Expectations


Nous Research AI ▷ #research-papers (2 messages):

Signal and Noise in Inference, LLM Output Consistency


Nous Research AI ▷ #research-papers (2 messages):

Signal and Noise in AI, Consistency of LLM Outputs


Notebook LM Discord ▷ #announcements (1 messages):

3-panel UI Changes, Removed Suggested Actions, Workarounds for Model Usage, Source-based Actions, Notes to Source Conversion


Notebook LM Discord ▷ #use-cases (27 messages🔥):

Interactive Language Function, Podcast Effectiveness, Using NotebookLM for Gaming, AI-generated Content Concerns, Multilingual Experimentation

Links mentioned:


Notebook LM Discord ▷ #general (194 messages🔥🔥):

NotebookLM Interaction Features, Audio Overview and Podcast Length Control, Notes and Citations, Sharing Notebooks Outside Organizations, Using Google Docs as Source

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (66 messages🔥🔥):

Fine-tuning Llama Models, Multi-GPU Support in Unsloth, Batch Size Considerations, Combining Datasets for Fine-tuning, Unsloth Contributions and Reviews

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (139 messages🔥🔥):

QwQ reasoning models, Unsloth usage and troubleshooting, Training models with LoRA, DiLoCo research and presentations, Installing llama.cpp

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (15 messages🔥):

Llama 3.2 Training Issues, M4 MAX GPUs Support, Community Contributions for Unsloth, Fast Fine-tuning Alternatives for Mac


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

OpenAI o1 model, EVA Llama model, Price drops on models, Provider Pages improvements, New reasoning parameters

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (209 messages🔥🔥):

OpenRouter keys exposure, API call metadata viewing, Using Google AI API with OpenRouter, Reasoning model instruction compliance, Model performance in coding assistance

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (36 messages🔥):

Google Shipmas Releases, Gemini 2.0 Updates, Deep Research in AI, GitHub Copilot Free Tier, Microsoft Investment in Anthropic

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (26 messages🔥):

Olmo pretokenized data, Public S3 bucket access, Cloudflare hosting issues, Hugging Face dataset workaround, AWS credits usage


Interconnects (Nathan Lambert) ▷ #ml-drama (15 messages🔥):

Video Understanding Models, Human Touch in AI, Meta's Legal Issues, Hugging Face Upload Challenges, Translation AI in Smart Glasses

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (21 messages🔥):

Model precision concerns, Coding improvements in models, Anthropic research findings, Emergence of cooperation in LLMs, Alignment faking in language models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (20 messages🔥):

Trump's Art of the Deal, Meme Creation, AI Meme Applications, Superbowl Halftime References


Interconnects (Nathan Lambert) ▷ #rl (5 messages):

Self-correction behavior in RLVR, New o1 API parameters, Emergent properties in RL training


Interconnects (Nathan Lambert) ▷ #rlhf (14 messages🔥):

Qwen 2.5 7B Tulu 3, Reinforcement Learning (RL) Updates, RLVR Training Methodology, Crazy RL Success


Interconnects (Nathan Lambert) ▷ #posts (31 messages🔥):

AI agents definitions, LinkedIn misinformation, Interconnects business plans, Public engagement, Snail project 2025

Links mentioned:


Eleuther ▷ #general (1 messages):

Retail/E-commerce Ad Content Models, Runway, OpenAI Sora, Veo 2


Eleuther ▷ #research (123 messages🔥🔥):

Koopman Operator Theory in Neural Networks, Emergent Abilities of LLMs, Neural Network Compression Techniques, Iterated Function Composition, Training Efficiency in Generative Models

Links mentioned:


Eleuther ▷ #lm-thunderdome (6 messages):

Passing Extra Arguments to Functions, Task Configurations, Subtask Creation


Eleuther ▷ #gpt-neox-dev (9 messages🔥):

Logging to WANDB, WandB run names from configs, Non-parametric layernorm PR

Link mentioned: gpt-neox/megatron/logging.py at f5325805678c2b9e35aae4528283e0132c5f5bbc · EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries - EleutherAI/gpt-neox


Stability.ai (Stable Diffusion) ▷ #general-chat (122 messages🔥🔥):

Lora Training, Stable Diffusion Models, Quantum Computing, Web UI Recommendations, Video Generation Challenges

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Custom web sources, Perplexity Spaces, Tailored searches


Perplexity AI ▷ #general (108 messages🔥🔥):

Perplexity Pro subscriptions, User Interface feedback, Model performance comparisons, Rate limits and upgrades

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

Meta's Blocking of OpenAI for-Profit, Microbe Threat Warning, Cell Revitalization, Plant Responses, Dopamine Precursors


Perplexity AI ▷ #pplx-api (1 messages):

Perplexity web search feature, Perplexity API cost overview


GPU MODE ▷ #general (41 messages🔥):

6D Parallelism, PC Troubleshooting, GPU Performance Comparison, Coil Whine Issues

Links mentioned:


GPU MODE ▷ #triton (1 messages):

Kernel Computation Methods, Output Concatenation in Kernels


GPU MODE ▷ #cuda (8 messages🔥):

LLM Model Inference, Cuda Memory Operations, A100 vs H100 Training, CUDA Graphs and Async Operations, AMP Impact on Loss


GPU MODE ▷ #torch (33 messages🔥):

Megatron-LM for efficient training, Torch.compile warnings handling, Coreweave packaging challenges, Handling various shapes in image generation, Distributed training at Gensyn

Links mentioned:


GPU MODE ▷ #cool-links (4 messages):

Raspberry Pi 5 deployment, Edge device LLM models, Esp32 / Xtensa LX7 chips


GPU MODE ▷ #jobs (1 messages):

MatX LLM Accelerator, Hiring for roles, Low level compute kernel, Compiler engineering, ML performance engineering

Link mentioned: Tweet from MatX | Jobs: no description found


GPU MODE ▷ #torchao (5 messages):

int4group scheme, Tinygemm compute, Activation quantization, Matmul kernel processing


GPU MODE ▷ #rocm (1 messages):

MigraphX, ONNX Frontend, MI300X, Opset 11 Support


GPU MODE ▷ #thunderkittens (1 messages):

kimishpatel: what i cam here for 🙂


GPU MODE ▷ #arc-agi-2 (18 messages🔥):

Custom Vision Encoder Development, Chain of Thought Implementation, Experimenting with LLMs, Training Configurations for LLMs, Decentralized Sampling Processes for CoT Prompts

Links mentioned:


LM Studio ▷ #general (87 messages🔥🔥):

LM Studio beta features, Using Llama 3.2 model, Roleplay LLM setup, Connecting LM Studio to mobile, Hardware specifications for running models

Links mentioned:


LM Studio ▷ #hardware-discussion (17 messages🔥):

3060 Ti vs 3060, AMD Radeon VII driver issues, Llama model utilization, Inference performance with GPUs, M2 MacBook Air as a gateway


Stackblitz (Bolt.new) ▷ #prompting (6 messages):

Migrating from Firebase to Supabase, Using create-mf-app with Bootstrap, Google reCAPTCHA Issues, Testing Bolt Pilot GPTs, Vite Pre-Transform Errors


Stackblitz (Bolt.new) ▷ #discussions (97 messages🔥🔥):

Frustrations with Bolt, Seeking Help on Projects, Token Usage, Collaborative Projects, Technical Discussions

Links mentioned:


Cohere ▷ #discussions (42 messages🔥):

Tool use with Maya, Local model integration, Importance of sleep, Hugging Face UI update, Image analysis using VQA


Cohere ▷ #announcements (1 messages):

Rate-limit increase, Multimodal Image Embed endpoint, API key types, Rate limits details, Community engagement

Link mentioned: API Keys and Rate Limits — Cohere: This page describes Cohere API rate limits for production and evaluation keys.


Cohere ▷ #questions (51 messages🔥):

Cohere Structured Outputs Implementation, Embedding Dimensions Concerns, RAG-based PDF Answering System Issues, Cohere Reranker Functionality, Cohere and Nvidia Relationship

Links mentioned:


Cohere ▷ #cmd-r-bot (1 messages):

setupisanoun: hey buddy


Cohere ▷ #projects (2 messages):

Findr Launch, Infinite Memory, Product Hunt

Link mentioned: Tweet from Nishkarsh (@Nish306): we’ve launched on Product Hunt. i would greatly appreciate your support https://www.producthunt.com/posts/findr-remember-everythingwe're giving humans infinite memory and a searchable digital brai...


Cohere ▷ #cohere-toolkit (3 messages):

Cohere Toolkit Deployment, AWS Stream Errors, Docker Logs Analysis


Modular (Mojo 🔥) ▷ #general (22 messages🔥):

Mojo REPL Issues on Archcraft, Magic Environment Challenges, Stable Diffusion Example Reference, Max Installation Problems, Creating Threads for Problem Solving


Modular (Mojo 🔥) ▷ #mojo (57 messages🔥🔥):

Mojo Documentation Updates, Mojo Kernel Definition, Feature Development vs. Syntax, Revisiting Early Mojo Decisions, Compute Kernels in Mojo

Link mentioned: Mojo language basics | Modular Docs): Introduction to Mojo's basic language features.


Modular (Mojo 🔥) ▷ #max (13 messages🔥):

Custom Ops in Mojo, MOToMGP Pass Manager Errors, Documentation Issues, Feature Requests for Error Messages, Improving UX for Custom Ops

Link mentioned: [Feature Request] Single compilation unit kernels and/or improved error messages · Issue #269 · modularml/max: What is your request? This is a 2-part request, but bundled since they both address the same UX issue. Part one is to make the "custom op not found" error message direct users to documentati...


Latent Space ▷ #ai-general-chat (90 messages🔥🔥):

Nvidia Jetson Orin Nano, Github Copilot Free, 1-800-CHATGPT Experience, AI Video Trends, Google Whisk Tool

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

Sakana AI's EvoMerge, DynoSaur, LLM Paper Club Event

Link mentioned: LLM Paper Club (Sakana AI EvoMerge and DynoSaur) · Zoom · Luma: Ramon is joining us for the first time to present…


OpenInterpreter ▷ #general (28 messages🔥):

Open Interpreter Errors, OI 1.x Version, Cloudflare AI Gateway, Truffle AI Computer, Long Term Memory Integration

Links mentioned:


tinygrad (George Hotz) ▷ #general (27 messages🔥):

Benchmarks for LLaMA models, Mergeability of ShapeTrackers in Lean, Counterexamples in view merging, CuTe layout algebra, Challenges in proving layout injectivity

Links mentioned:


Torchtune ▷ #dev (25 messages🔥):

FSDP Adjustment, Bug Fix in TRL, Loss Scaling Discussion, Gradient Scaling, Optimizer in Backward Case

Links mentioned:


Torchtune ▷ #papers (2 messages):

Evolutionary Algorithms, Sakana Scaling, Gradient Techniques


DSPy ▷ #show-and-tell (1 messages):

collabin: https://youtu.be/BrvVheleOqc


DSPy ▷ #papers (4 messages):

AI's impact on knowledge economy, Chain of Continuous Thought

Links mentioned:


DSPy ▷ #general (11 messages🔥):

TypedReAct Class, RouteLLM Maintenance, DSPy Evolution with Reasoning Models

Link mentioned: Agents - DSPy: The framework for programming—rather than prompting—language models.


Nomic.ai (GPT4All) ▷ #general (12 messages🔥):

Jinja template issues, GPT4All CLI usage, Localdocs support in GPT4All, Docker container version


LlamaIndex ▷ #blog (2 messages):

Agentic AI SDR, Composio platform, Function calling in LlamaIndex, Agentic RAG, ReAct integration

Link mentioned: composio/python/examples/quickstarters at master · ComposioHQ/composio: Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling - ComposioHQ/composio


LlamaIndex ▷ #general (4 messages):

OpenAIAgent concurrency, RAG evaluation, Async function execution

Link mentioned: Single-Turn Multi-Function Calling OpenAI Agents - LlamaIndex: no description found


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (3 messages):

BFCL Leaderboard Issues, Gorilla Benchmark for Structured Outputs

Link mentioned:
Berkeley Function Calling Leaderboard V3 (aka Berkeley Tool Calling Leaderboard V3)
: no description found


LAION ▷ #general (1 messages):

GPT-O1 Reverse Engineering, Technical Reports, Twitter Updates on GPT-O1


LAION ▷ #research (1 messages):

GenAI Research Internship, Generative AI advancements, Monetization AI team

Link mentioned: Research Scientist Intern, Monetization AI (PhD): Meta's mission is to build the future of human connection and the technology that makes it possible.


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

kallemickelborg: Thank you for that!


Axolotl AI ▷ #general (1 messages):

New engineer for RL, KTO assistance


Mozilla AI ▷ #announcements (1 messages):

Developer Hub Announcement, Blueprints Initiative




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}