Frozen AI News archive

Olympus has dropped (aka, Amazon Nova Micro|Lite|Pro|Premier|Canvas|Reel)

**Amazon** announced the **Amazon Nova** family of multimodal foundation models at AWS Re:Invent, available immediately with no waitlist in configurations like Micro, Lite, Pro, Canvas, and Reel, with Premier and speech-to-speech coming next year. These models offer **2-4x faster token speeds** and are **25%-400% cheaper** than competitors like **Anthropic Claude** models, positioning Nova as a serious contender in AI engineering. Pricing undercuts models such as **Google DeepMind Gemini Flash 8B**, and some Nova models extend context length up to **300k tokens**. However, benchmarking controversy exists as some evaluations show Nova scoring below **Llama-3 70B** in **LiveBench AI** metrics. Separately, **CycleQD** was introduced by **Sakana AI Labs**, using evolutionary computation for population-based model merging to develop niche LLM agents.

Canonical issue URL

AI News for 12/2/2024-12/3/2024. We checked 7 subreddits, 433 Twitters and 29 Discords (198 channels, and 2914 messages) for you. Estimated reading time saved (at 200wpm): 340 minutes. You can now tag @smol_ai for AINews discussions!

we apologize for the repeated emails yesterday. It was a platform bug we had no control over but we will watch closely as obviously we have zero desire to spam you/harm our own deliverability. fortunately ainews is also founded on the idea that email length and quantity is near (but not quite) free.

As widely rumored (as Olympus) in the past year, AWS Re:invent (full stream here) kicked off, ex-AWS and now Amazon CEO Andy Jassy had quite a bombshell to drop: their own, for real, actually competitive, not screwing around, set of multimodal foundation models, Amazon Nova (report, blog):

image.png

As an incredible (for a large tech player keynote) bonus, there is NO WAITLIST - Micro/Lite/Pro/Canvas/Reel are immediately Generally Available, with Premier and Speech-to-Speech and "Any-to-Any" coming next year.

The LMArena elo is running now, but already this is a much more serious contender for real AI Engineer than the previous Titan generation. Not stressed in the keynote, but of high importance are both the high speed (2-4x faster tok/s vs Anthropic/OpenAI):

image.png

and low cost (25% - 400% cheaper than Claude equivalent):

image.png

Imputing their Arena scores with their nearest neighbor equivalents, this offers near-frontier price-intelligence performance:

image.png

Of course, everyone is making comments about how this lines up with Amazon also investing $4bn in Anthropic, to which, the Everything Store CEO has one answer:

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Theme 1. Amazon Nova Foundation Models: Release, Pricing, and Evaluation

Theme 2. CycleQD: Evolutionary Approach in Language Models

Theme 3. AI Humor and Memes

Theme 4. Hugging Face Concerns and Community Response

Theme 5. New and Noteworthy Model Innovations

Theme 6. AI Winter and Industry Outlook


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. HuggingFace Imposes 500GB Limit, Prioritizes Community Contributors

Theme 2. DeepSeek and Qwen Surpass Expectations, Challenge OpenAI's Position

Theme 3. National Security Concerns Used to Push AI Regulation

Theme 4. New Tools: Open-WebUI Enhanced with Advanced Features

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. ChatGPT Used to Win $1180 Small Claims Court Case Against Landlord

Theme 2. HunyuanVideo Claims State-of-Art Video Generation, Beats Gen3 & Luma

Theme 3. ChatGPT Parent OpenAI Considers Adding Advertisements

Theme 4. Vodafone's AI Commercial Shows New Benchmark in AI Video Production


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1: New Optimizers and Training Techniques Revolutionize AI

Theme 2: New AI Models Stir Excitement and Debate

Theme 3: AI Tools Face Performance and Update Challenges

Theme 4: Community Explores AI Methods and Frameworks

Theme 5: AI Community Engages in Opportunities and Events


PART 1: High level Discord summaries

Nous Research AI Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


aider (Paul Gauthier) Discord


Cursor IDE Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


Notebook LM Discord Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Latent Space Discord


Cohere Discord


GPU MODE Discord


LlamaIndex Discord


LM Studio Discord


LLM Agents (Berkeley MOOC) Discord


OpenInterpreter Discord


DSPy Discord


Torchtune Discord


MLOps @Chipro Discord


Axolotl AI Discord


tinygrad (George Hotz) Discord


LAION Discord


Mozilla AI Discord


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Nous Research AI ▷ #announcements (2 messages):

DeMo Optimizer Release, Nous DisTrO, Decentralized Pre-training, Distributed Training Research

Links mentioned:


Nous Research AI ▷ #general (426 messages🔥🔥🔥):

DisTrO Training Update, Using Smaller Models for Different Tasks, Function Calling vs. MCP in AI Models, Community Contributions to AI Training, Job Opportunities in AI Development

Links mentioned:


Nous Research AI ▷ #ask-about-llms (3 messages):

Techno-Socialism, Nous Research, XCLR8


Nous Research AI ▷ #reasoning-tasks (4 messages):

DisTro Issues, Flux Capacitor Reference, DeLorean Nostalgia

Link mentioned: no title found: no description found


Eleuther ▷ #general (181 messages🔥🔥):

Use of JAX vs. PyTorch, Vendor Lock-in Concerns, Performance Optimizations with Torch Compile, AI Lab Hiring Practices, Collaboration Between Universities and Tech Companies

Links mentioned:


Eleuther ▷ #research (151 messages🔥🔥):

DeMo Optimizer, Differential Attention, Second Order Optimization, NAS in ML, Moving Sofa Problem

Links mentioned:


Eleuther ▷ #interpretability-general (1 messages):

Common Methods Reference, Survey vs Textbook Distinction


Eleuther ▷ #lm-thunderdome (13 messages🔥):

VLLM Seed Configuration, QwQ Preview Leaderboard Status, External Loadable Evals, Versioning and Reproducibility Concerns

Link mentioned: lm-evaluation-harness/lm_eval/main.py at f49b0377bf559f5558e8cd9ebd1190218c7df2a4 · EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


Eleuther ▷ #gpt-neox-dev (2 messages):

Logging Configuration, Performance Breakdown


Modular (Mojo 🔥) ▷ #general (120 messages🔥🔥):

Socket Communication in Mojo, Mojo's SIMD Support, Networking API Design, High-Performance File Server Implementation, Custom Allocators

Links mentioned:


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Magic Package Distribution, Early Access Preview, Community Testing, Feature Iteration


Modular (Mojo 🔥) ▷ #mojo (133 messages🔥🔥):

Mojo Development Insights, Inline References Concept, Reference Trait Proposal, Current Python Support for Mojo, Compilation Structure Updates

Links mentioned:


aider (Paul Gauthier) ▷ #general (154 messages🔥🔥):

OpenRouter performance issues, Aider's new features, Amazon Foundation Models announcement, User experiences with Aider, Troubleshooting Repo-map

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (82 messages🔥🔥):

Using Aider with Docker, Updating Aider, Function Refactoring Challenges, Context Management in Aider, Scraping Documentation for Aider

Links mentioned:


aider (Paul Gauthier) ▷ #links (1 messages):

pierrunoyt: https://supabase.com/blog/supabase-ai-assistant-v2 Nice stuff


Cursor IDE ▷ #general (213 messages🔥🔥):

Cursor Lag Issues, Windsurf vs Cursor Performance, Agent Features and Limitations, Syntax Highlighting Concerns, Chat Functionality Problems after Update

Links mentioned:


Perplexity AI ▷ #general (188 messages🔥🔥):

Perplexity Pro subscription issues, Performance and speed problems, Image generation capabilities, Comparison of AI models, Amazon Nova foundation models

Links mentioned:


Perplexity AI ▷ #sharing (12 messages🔥):

Pending Searches, Partition in Linux, Parameter Counts, Purchasing Items, Vesuvius Challenge Progress

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (7 messages):

API Error Responses, Content Citation Issues


Unsloth AI (Daniel Han) ▷ #general (115 messages🔥🔥):

LoRA finetuning process, Model compatibility issues, Training Llama 3.2, xformers installation issues, Inference and tokenization with finetuned models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (4 messages):

Claude for Coding, Continued Pretraining, Citation of Founders, Understanding Numerical Data, Accounting Domain Tokens


Unsloth AI (Daniel Han) ▷ #help (48 messages🔥):

Unsloth Model Issues, Fine-tuning Challenges, Llama-3 Model Conversion to GGUF, Partially Trainable Embeddings, Model Sequence Length Concerns

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

QWen2 VL 7B finetuning, LLaVA-CoT dataset, Hugging Face model card

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

PhD students' working conditions, 996 work culture, Work output and relaxation, Tokenizers in research


Notebook LM Discord ▷ #announcements (1 messages):

Raiza's departure from Google, NotebookLM team achievements, New venture announcement

Link mentioned: We're Building: no description found


Notebook LM Discord ▷ #use-cases (28 messages🔥):

Notebook LM usage for scripting, OCR tools for PDF handling, Podcast creation, Fiction writing with AI, Multilingual capabilities in AI

Links mentioned:


Notebook LM Discord ▷ #general (140 messages🔥🔥):

NotebookLM Updates, Audio Overview Features, Language Support, User Experience Feedback, Google Drive Integration

Link mentioned: Tweet from Bryan Kerr (@BryanKerrEdTech): I figured out how to listen to NotebookLM Audio Overviews in my podcast app. Now I enjoy my walks and commutes with more purpose.You can do it too. All you need is a Dropbox or OneDrive account and Pu...


OpenAI ▷ #ai-discussions (122 messages🔥🔥):

Italy's AI Regulation, ChatGPT Feature Issues, Voting and Quantum Computing, Content Moderation Challenges, AI Translation Comparisons

Link mentioned: React App: no description found


OpenAI ▷ #gpt-4-discussions (6 messages):

GPT functionality issues, ChatGPT Plus Plan, Transcription models


OpenAI ▷ #prompt-engineering (9 messages🔥):

Custom Instructions in ChatGPT, Improving Prompt Engineering, Learning User Styles, ChatGPT Writing Styles


OpenAI ▷ #api-discussions (9 messages🔥):

Custom Instructions, Prompt Engineering, Storytelling Style Adaptation


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Model removals, Price reductions, Claude 3.5 Haiku discount


OpenRouter (Alex Atallah) ▷ #general (117 messages🔥🔥):

Hermes 405B Model Status, OpenRouter API Key Management, Gemini Flash Errors, New Amazon Nova Models, LLM Tokenization Insights

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (5 messages):

Custom provider keys, BYOK access, Gemini experimental model


Stability.ai (Stable Diffusion) ▷ #general-chat (100 messages🔥🔥):

LORA training, Stable Diffusion Guidance, Scammer Alerts, GPU Utilization, New Image Synthesis Model

Links mentioned:


Latent Space ▷ #ai-general-chat (84 messages🔥🔥):

Pydantic AI, NotebookLM team changes, Hunyuan Video release, Amazon Nova foundation model, ChatGPT's handling of names

Links mentioned:


Latent Space ▷ #ai-announcements (3 messages):

Bolt Launch, AI Agents Discussion, Open Source Strategies, Revenue Growth in AI, AI Interface Dynamics

Link mentioned: Tweet from Latent.Space (@latentspacepod): 🆕 Bolt, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapperwith @ericsimons40 and @itamar_mar!We are excited to catch up with @QodoAI and debut @stackblitz on the pod, w...


Cohere ▷ #discussions (53 messages🔥):

Manufacturing Discussions, New Rerank 3.5 Features, Colpali and Tewi References, Multilingual Support in Rerank, Community Engagement

Links mentioned:


Cohere ▷ #announcements (1 messages):

Rerank 3.5, API deprecations, Multilingual capabilities, Enhanced reasoning, Legacy model lifecycle

Links mentioned:


Cohere ▷ #questions (9 messages🔥):

TooManyRequestsError, Payment Issues with Card, API Key Setup Delay


Cohere ▷ #api-discussions (6 messages):

TooManyRequestsError, Production API Key Setup Delay


Cohere ▷ #projects (1 messages):

Harmony Project, Large Language Model Competition, Natural Language Processing in Questionnaire Harmonisation

Links mentioned:


Cohere ▷ #cohere-toolkit (1 messages):

mrdragonfox: - hey "new" - im mrdragonfox ^^


GPU MODE ▷ #general (7 messages):

Xmma Kernels Performance, Nvjet vs Cutlass Comparisons, GEMM Toolkit Updates, Runtime Error with Meta Tensors


GPU MODE ▷ #triton (3 messages):

Triton MLIR Dialects, Floating Point Representations in Triton, Documentation and Tutorials

Links mentioned:


GPU MODE ▷ #cuda (5 messages):

Warp Schedulers in GPU Architecture, Comparison of FP32 Cores in Different Models


GPU MODE ▷ #torch (1 messages):

bf16 training, debugging tips


GPU MODE ▷ #beginner (26 messages🔥):

MIT Efficient ML Course, Stanford CS 229S Course, Assignments for ML Courses, Machine Learning Optimization Techniques

Links mentioned:


GPU MODE ▷ #youtube-recordings (1 messages):

mobicham: Is the 3-bit version here symmetric or asymmetric ?


GPU MODE ▷ #off-topic (6 messages):

Mastodon for AI/ML, HPC Community on Mastodon, Mastodon Overview

Link mentioned: Mastodon: no description found


GPU MODE ▷ #arm (3 messages):

Low Bit ARM kernels, Low-bit operations, LUT techniques, Bitnet.cpp

Link mentioned: Lecture 38: Low Bit ARM kernels: Speaker: Scott RoySlides: TBD


GPU MODE ▷ #webgpu (1 messages):

Performance Optimizations, TFLOP/s Metrics


GPU MODE ▷ #self-promotion (6 messages):

CUDARC Project, Luminal Framework, Talk Invitation


GPU MODE ▷ #🍿 (3 messages):

KernelBench introduction, Kernel performance evaluation, Leaderboard concerns

Link mentioned: GitHub - ScalingIntelligence/KernelBench: Contribute to ScalingIntelligence/KernelBench development by creating an account on GitHub.


GPU MODE ▷ #thunderkittens (1 messages):

WGMMA+TMA custom kernel, Race Condition in Kernel, Mask Implementation, Shared Memory Issues, Latest Fork Updates


LlamaIndex ▷ #blog (6 messages):

NVIDIA Financial Analysis, LlamaCloud Pipeline with Google Drive, Multi-Agent Meetup at GitHub, AI Apps on Vercel, Amazon's Nova Foundation Models


LlamaIndex ▷ #general (43 messages🔥):

Embedding model limitations, Quarterly report generation, RAG implementations, Structured output from multimodal models, Workflow management for chat history

Links mentioned:


LM Studio ▷ #general (20 messages🔥):

LM Studio Windows Download Issues, LM Studio Performance on Windows, Community Support and Attitudes, Qwen LV 7B Model Functionality


LM Studio ▷ #hardware-discussion (15 messages🔥):

Docker Containers for HF Spaces, Optimal GPU Configurations, FP8 Quantization in Models, Changing LLaMA.cpp Version, Intel Arc Battlemage Cards

Link mentioned: Reddit - Dive into anything: no description found


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (2 messages):

Sierra AI Info Session, Recruitment Opportunities at Sierra

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (2 messages):

Final Lecture & Presentation, Course Completion Certificate, Quizzes Reminder, Course Website Resources

Link mentioned: - YouTube: no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (18 messages🔥):

LLM Agents Learning Course, Post-mortem assignment, Lab assignments requirements, Written article assignment, Social media sharing guidelines

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

GPT-4 PII leaks, AOL search log release

Link mentioned: AOL search log release - Wikipedia: no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (5 messages):

ReAct Paradigm, Implementation Quality, Benchmark Evaluations


OpenInterpreter ▷ #general (21 messages🔥):

Development Branch Updates, OpenAI Compatibility, Usage Issues with Anthropic, Testing Requests, Linux OS Compatibility

Link mentioned: So Close GIF - So Close This - Discover & Share GIFs: Click to view the GIF


OpenInterpreter ▷ #O1 (1 messages):

LiveKit Connection, Device Interaction, Local OpenInterpreter Operations


DSPy ▷ #show-and-tell (3 messages):

Pydantic AI, DSLModel, AI Development, Pydantic Logfi, Live Demos

Links mentioned:


DSPy ▷ #general (7 messages):

Optimization Duration, DSPy on AWS Lambda, Program Of Thought Deprecation


DSPy ▷ #examples (9 messages🔥):

Agentic examples in DSPy, RAG Example in DSPy, Codetree quick version, DSPy Module Class

Links mentioned:


Torchtune ▷ #general (4 messages):

Image Generation in Torchtune, T5 Integration, Fine-tuning Models

Links mentioned:


Torchtune ▷ #papers (1 messages):

pjbontrager: This would be a fun recipe: https://sakana.ai/cycleqd/


MLOps @Chipro ▷ #events (4 messages):

Event Attendance, Registration Process, India Visit


Axolotl AI ▷ #announcements (1 messages):

Office Hours Announcement, Axolotl Survey, Swag Giveaway

Link mentioned: Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.: A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team


Axolotl AI ▷ #general (3 messages):

ADOPT optimizer updates, Axolotl codebase


tinygrad (George Hotz) ▷ #general (1 messages):

jewnex: PR#7987 worth a tweet, run some benchmarks, no gpu hang with beam this time 🚀


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

Thread group/grid sizes in graph rewrites, Optimizations in uopgraph.py


LAION ▷ #general (1 messages):

bio-ML advancements, Gene Diffusion model, mechanistic interpretability, protein sequencing modeling, self-supervised learning

Link mentioned: Through a Glass Darkly | Markov Bio: What does the path toward end-to-end biology look like and what role does human understanding play in it?


Mozilla AI ▷ #announcements (1 messages):

December schedule events, Next Gen Llamafile Hackathon, Introducing Web Applets, Theia IDE demonstration, Llamafile update




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}