Frozen AI News archive

$200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews

**OpenAI** launched the **o1** model with multimodal capabilities, faster reasoning, and image input support, marking it as a state-of-the-art model despite some bugs and mixed community reviews. The new **o1-pro** tier offers unlimited access for $200/month with notable benchmark improvements but some performance trade-offs compared to **claude-3.5-sonnet**. **Google** released the **PaliGemma 2** vision-language model family in sizes **3B, 10B, and 28B**, excelling in visual question answering, image segmentation, and OCR, with day-0 support for fine-tuning. **LlamaIndex** announced discounts and feature updates for large-scale document processing. The AI community also reacted humorously to the new pricing tiers and model comparisons. *"o1 can see now, which makes it the SOTA multimodal model"* and *"most users will be best served by free/Plus tiers"* were notable sentiments.

Canonical issue URL

AI News for 12/4/2024-12/5/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (206 channels, and 6267 messages) for you. Estimated reading time saved (at 200wpm): 627 minutes. You can now tag @smol_ai for AINews discussions!

As Sama teased, OpenAI's 12 days of shipmas (which perhaps includes the Sora API and perhaps GPT4.5) kicked off with the full o1 launch:

https://www.youtube.com/watch?v=iBfQTnA2n2s

and the clearest win is that o1 can see now, which Hyungwon notes makes it the SOTA multimodal model:

image.png

Although it still has embarrassing bugs.

As with all frontier reasoning models, we have to resort to new reasoning/instruction following evals:

image.png

and here is o1 doing protein search

image.png

as for the new o1 pro via the $200/mo unlimited ChatGPT Pro, it is unclear just how different of a model o1-pro is compared to o1-full, but the benchmark jumps are not trivial:

image.png

Tool use, system messages and API access are on their way.

The community reviews have been mixed, focusing on obligatory system card detailing safety assessments (with standard alarmism) and mitigations , because the mitigations did appreciably 'nerf' the base o1-full:

image.png

and under-performs 3.5 Sonnet:

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Based on the provided tweets, I'll organize the key discussions into relevant themes:

OpenAI o1 Release and Reactions

PaliGemma 2 Release from Google

LlamaParse Updates and Document Processing

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Google's PaliGemma 2: Major New Vision-Language Models

Theme 2. Visual Model Race: SAM 2 vs SAMURAI Performance

Theme 3. O1's Emergent Behaviors: System Card Revelations

Theme 4. Democratizing AI: New Open-Source Model Breakthroughs

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. OpenAI Pro Launches at $200/mo - Includes o1 Pro Mode & Unlimited Access

Theme 2. Security Alert: Malicious Mining Attack via ComfyUI Package Dependencies

Theme 3. Post-LLM Crisis: Traditional ML Engineers Face Industry Shift

Theme 4. Breakthrough: Fast Video Generation on Consumer GPUs


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. OpenAI's o1 Model: Hype and Hiccups

Theme 2. AI Tools in Turmoil: Windsurf and Cursor IDE Struggles

Theme 3. Model Magic: Unsloth AI's Quantization Quest

Theme 4. New Kids on the Block: Fresh Models and Fierce Competitions


PART 1: High level Discord summaries

Codeium / Windsurf Discord


aider (Paul Gauthier) Discord


Unsloth AI (Daniel Han) Discord


Cursor IDE Discord


Bolt.new / Stackblitz Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


OpenAI Discord


Interconnects (Nathan Lambert) Discord


Notebook LM Discord Discord


Cohere Discord


Nous Research AI Discord


Stability.ai (Stable Diffusion) Discord


Latent Space Discord


Perplexity AI Discord


LM Studio Discord


GPU MODE Discord


Torchtune Discord


OpenInterpreter Discord


LLM Agents (Berkeley MOOC) Discord


Axolotl AI Discord


DSPy Discord


MLOps @Chipro Discord


LAION Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium / Windsurf ▷ #announcements (1 messages):

Cascade Resource Exhaustion, Windsurf Load Issues, Premium Model Rate Limiting, Pro/Teams Access Priority


Codeium / Windsurf ▷ #discussion (432 messages🔥🔥🔥):

Windsurf access issues, Pro and Free Trial differences, Claude Sonnet and GPT model availability, User experiences with billing and subscription, Game development projects

Links mentioned:


Codeium / Windsurf ▷ #windsurf (930 messages🔥🔥🔥):

Claude 3.5 Sonnet Issues, Pro Plan Subscriptions, User Experiences with Windsurf, Monthly Step Limits, User Innovations and Workarounds

Links mentioned:


aider (Paul Gauthier) ▷ #general (471 messages🔥🔥🔥):

O1 Model Announcements, Aider Multi-Model Functionality, User Experiences with Aider Pro, Rust Project Structure Discussion, New Features in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (50 messages🔥):

Using Aider Architect Mode, Managing API Keys for Hyperbolic Direct, Aider Composer Integration, Commit Message Generation Failure, Documentation Feeding Tools

Link mentioned: OpenAI compatible APIs: aider is AI pair programming in your terminal


Unsloth AI (Daniel Han) ▷ #general (258 messages🔥🔥):

Qwen2-VL Model Fine-tuning, PaliGemma 2 Introduction, WandB Tracking Issues, Multi-GPU Support in GA, Memory Issues and Solutions

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (13 messages🔥):

Fimbul's Reddit Experience, Merging Qwen Models, Machine Learning Certifications

Link mentioned: Qwen2-VL CAN merge with qwen2.5 finetunes.: I've been wanting an RP vision model for a long time now. It wasn't supported by mergekit. Nobody has really tuned qwen2-vl, but plenty have tuned...


Unsloth AI (Daniel Han) ▷ #help (67 messages🔥🔥):

Onboarding Assistant Development, Sparse Training of Embeddings, RAG vs. Fine-tuning for Chatbots, Training Speed Estimation for Unsloth, Conversation Script Implementation

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

theyruinedelise: oh congrats i love this


Unsloth AI (Daniel Han) ▷ #research (7 messages):

DeepThought-8B, Llama 3.2 Vision Fine-Tuning, Dynamic 4-bit Quantization, Florence-2 for Fine-Tuning, Model Compression Techniques

Links mentioned:


Cursor IDE ▷ #general (333 messages🔥🔥):

Cursor IDE functionality, Comparison between Cursor and Windsurf, O1 model and Pro mode, User experiences with Cursor, Issues with code generation

Links mentioned:


Bolt.new / Stackblitz ▷ #prompting (17 messages🔥):

Database Sync Issues, UI Tweaks with Bolt, Firebase for Game Development, Responsive Design Testing, Feature Request Management

Link mentioned: Tweet from Tomek Sułkowski (@sulco): 💡 Bolt․new tip:With the just introduced "fullscreen" and "responsive" buttons, you can easily test the layout of your app for different screens — even if you're working on a small...


Bolt.new / Stackblitz ▷ #discussions (273 messages🔥🔥):

Token Usage Issues, Mobile Preview Feature, GitHub Repo Integration, CORS Issues with Firebase, Error Handling in Bolt

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (5 messages):

OpenRouter token generation, Lambda model price reductions, Author Pages feature launch, Google AI Studio models outage, Amazon Nova model family release

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (232 messages🔥🔥):

OpenRouter outages, Amazon Nova models, OpenAI O1 updates, Claude's correction behavior, Elon Musk and Sam Altman podcast

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (4 messages):

Custom Beta Keys Access


Modular (Mojo 🔥) ▷ #mojo (205 messages🔥🔥):

C++ Learning Challenges, Job Acquisition in Programming, Mojo Language Features, User-Defined Dialects in Mojo

Links mentioned:


Eleuther ▷ #general (36 messages🔥):

Muon Optimizer, Open Source LLMs, Heavyball Implementation of SOAP, AGPL Licensing Discussions, AR Decoders and Codebook Codes

Links mentioned:


Eleuther ▷ #research (161 messages🔥🔥):

Eval-harness questions, Modded-nanoGPT record, MuP and token-based approaches, Low precision training concepts, Token-dependent mechanisms in RWKV

Link mentioned: Tweet from Braden Koszarsky (@KoszarskyB): New NanoGPT training speed record: 3.28 FineWeb val loss in 4.41 minutesPrevious record: 4.66 minutes Changelog: - Layerwise Token Value Embeddings- hyperparameter tweaks


Eleuther ▷ #interpretability-general (2 messages):

David Bau's Seminar, Interpretability Papers


Eleuther ▷ #lm-thunderdome (4 messages):

MCQ dataset evaluation, Prompting techniques, MMLU template, arc_easy template, eval-harness framework

Links mentioned:


Eleuther ▷ #gpt-neox-dev (2 messages):

Non-parametric LayerNorm in NeoX, LayerNorm Parameters, Layer Normalization Paper

Link mentioned: LayerNorm — PyTorch 2.5 documentation: no description found


OpenAI ▷ #annnouncements (1 messages):

Exciting new product development, 12 Days of OpenAI

Link mentioned: - YouTube: no description found


OpenAI ▷ #ai-discussions (112 messages🔥🔥):

ChatGPT's Features and Limitations, User Experiences with ChatGPT Pro, Issues with ChatGPT Accessibility, Pricing Concerns for Pro Models, Online Discussions about AI Capabilities

Link mentioned: I put ChatGPT on a Robot and let it explore the world: The first 500 people to use my link https://skl.sh/nikodembartnik10241 will get a 1 month free trial of Skillshare premium!My tools: https://indystry.cc/my-t...


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

Advanced Voice Programming, GPT Functionality Issues, Image Feature Problems, TranslateGPT Capabilities, Comparing GPT Models


OpenAI ▷ #prompt-engineering (30 messages🔥):

Exploring Reasoning in Models, Prompt Engineering Resources, Markdown Rendering Issues, Using LaTeX for Academic Work, Language Requirements in Servers


OpenAI ▷ #api-discussions (30 messages🔥):

OpenAI Prompt Engineering, Markdown Rendering Issues, LaTeX Rendering in OpenAI, Searching for Communities, API Automation Test Cases


Interconnects (Nathan Lambert) ▷ #events (1 messages):

natolambert: will put in email next wednesday


Interconnects (Nathan Lambert) ▷ #news (142 messages🔥🔥):

OpenAI Pro Pricing, Decentralized Training with DeMo, Tsunami Warning in California, o1 Performance vs. Preview, Community Reactions to AI Models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (15 messages🔥):

o1 Pro performance, LLM reasoning capabilities, OpenAI competition, Community reactions, Simple-evals repository

Links mentioned:


Interconnects (Nathan Lambert) ▷ #nlp (2 messages):

Price Concerns, Message Reference


Interconnects (Nathan Lambert) ▷ #posts (6 messages):

Model Variance, Response Quality, Replication Attempts


Notebook LM Discord ▷ #use-cases (69 messages🔥🔥):

Privacy Law in NotebookLM, AI-Powered Panel Discussions, Large Language Models' Multilingual Capabilities, Project Odyssey AI Film Maker Contest, NotebookLM Use Cases for Project Managers

Links mentioned:


Notebook LM Discord ▷ #general (96 messages🔥🔥):

Notebook LM Podcast Feature, Language Support in Notebook LM, Using PDF Sources and Equations, Generating Longer Audio Overviews, Sharing Files in Notebook LM

Link mentioned: ANOTHER Laser Engraver! ...oh, and this thing called Bitcoin?!?: DISCLAIMERThis is NOT financial advice and I am NOT a financial advisor. Some of these geek projects are expensive and can be risky. Crypto Currency is...


Cohere ▷ #discussions (60 messages🔥🔥):

Cohere Theme, Token Prediction Issues, RAG Implementation, Rerank 3.5 Launch, Masked Diffusion in LLMs

Link mentioned: Introducing Rerank 3.5: Precise AI Search: Rerank 3.5 delivers improved reasoning and multilingual capabilities to search complex enterprise data with greater accuracy. 


Cohere ▷ #questions (2 messages):

Connector ID usage, Command R model updates


Cohere ▷ #api-discussions (82 messages🔥🔥):

Rerank 3.5 Model, Cohere API Usage, Integration Challenges, Strict Tools Parameter, Performance Comparisons

Link mentioned: Rerank — Cohere: This endpoint takes in a query and a list of texts and produces an ordered array with each text assigned a relevance score.


Nous Research AI ▷ #general (115 messages🔥🔥):

Model Training and Efficiency, Nous Research Token speculation, Optimizers and Model Quantization, Disruption in LLM Performance, Continuous Learning Opportunities

Links mentioned:


Nous Research AI ▷ #ask-about-llms (9 messages🔥):

Lingering Sampling, Embedding and Logit Relationships, Auto-looping in LLMs, Token Embedding Experiments


Nous Research AI ▷ #reasoning-tasks (1 messages):

AI Engineers recruitment, Multi-model integration


Stability.ai (Stable Diffusion) ▷ #general-chat (116 messages🔥🔥):

Image Generation Issues, flux and comfortable usage, Color Control in Image Editing, Model Testing and Variability, Community Resources for AI Tools

Link mentioned: THE OTHER LoRA TRAINING RENTRY: Stable Diffusion LoRA training science and notesBy yours truly, The Other LoRA Rentry Guy.This is not a how to install guide, it is a guide about how to improve your results, describe what options do,...


Latent Space ▷ #ai-general-chat (102 messages🔥🔥):

OpenAI o1 Release, ElevenLabs AI Agents, Anduril OpenAI Partnership, PaliGemma 2 Launch, New AI Models and Innovations

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

swyxio: announced next week's monster paper club https://x.com/swyx/status/1864423257266639166


Perplexity AI ▷ #general (94 messages🔥🔥):

o1 Pro Model Availability, Fake Perplexity App, Complexity Extension, Issues with Image Generation, Language Interpretation Problems

Links mentioned:


Perplexity AI ▷ #sharing (5 messages):

C, Drug Discovery Pipeline Tools, Prompt Writing Techniques, Web Design Practices, Oldest Alphabetic Writing


Perplexity AI ▷ #pplx-api (2 messages):

Limiting search results, Prompt engineering techniques


LM Studio ▷ #general (78 messages🔥🔥):

LM Studio API Features, Installing LM Studio on Linux, Uninstalling LM Studio, Client-specific LLM Setup, Running Large Models with Limited RAM

Links mentioned:


LM Studio ▷ #hardware-discussion (3 messages):

ASUS TUF Gaming X570-Plus, Multiple GPUs Performance, Flash Attention Limit on Apple Silicon


GPU MODE ▷ #general (18 messages🔥):

XMMA vs WMMA usage, NVIDIA GPU Emulator Inquiry, Vulkan Discussions, FP8 Benchmarking vs INT8, NVIDIA H100 Access for Experimentation


GPU MODE ▷ #triton (12 messages🔥):

Triton confusion, 3D indexing, TMA load limitations, LLVM errors and GitHub issues, Profiling kernel performance


GPU MODE ▷ #cool-links (17 messages🔥):

Dynamic 4-bit Quantization, HQQ-mix Algorithm, Model Quantization Techniques, Mixtral Model Updates, HQQ Integration for Unsloth

Links mentioned:


GPU MODE ▷ #jobs (1 messages):

Replicate Job Opening, Open Source ML Performance, Company Culture at Replicate

Link mentioned: Machine Learning Engineer - Media Models - Replicate: no description found


GPU MODE ▷ #beginner (3 messages):

Programming languages and frameworks, Triton vs CUDA, Triton IDs


GPU MODE ▷ #pmpp-book (5 messages):

CUDA Warps Scheduling, GPU Core Execution Units, Lecture 37 on GPU Microarchitecture, NVIDIA A100 Documentation

Links mentioned:


GPU MODE ▷ #off-topic (4 messages):

Environmental Impact of Technology, Knowledge Barrier in Kernel Writing, Jevon's Paradox


GPU MODE ▷ #sparsity-pruning (1 messages):

Weight Pruning Techniques


GPU MODE ▷ #liger-kernel (1 messages):

0x000ff4: okay I have updated my PR about the kto loss


GPU MODE ▷ #self-promotion (1 messages):

gemlite updates, matmul kernels, Triton performance enhancements

Link mentioned: GitHub - mobiusml/gemlite: Fast low-bit matmul kernels in Triton: Fast low-bit matmul kernels in Triton. Contribute to mobiusml/gemlite development by creating an account on GitHub.


GPU MODE ▷ #🍿 (2 messages):

Security concerns in submissions, Malicious behavior in competitions, Compute resource management


Torchtune ▷ #general (30 messages🔥):

Merging checkpoints, Model parallel vs tensor parallel, LoRA training changes, Using PyTorch's distributed checkpoint, Megatron model features

Links mentioned:


Torchtune ▷ #dev (2 messages):

Weight Release Speculation


Torchtune ▷ #papers (9 messages🔥):

Meta's technology, Federated Learning, Community GPU contributions, Block validation metrics, Crypto lottery with LLM


OpenInterpreter ▷ #general (23 messages🔥):

Early Access Notifications, Open Interpreter in VM, Gemini 1.5 Flash Usage, Model I Vision Support, Community Discussions

Link mentioned: Minecraft Dead Chat GIF - Minecraft Dead Chat Dead Chat Xd - Discover & Share GIFs: Click to view the GIF


OpenInterpreter ▷ #O1 (16 messages🔥):

01 Light App Usage, 01 Pro Mode Launch

Link mentioned: Android & iOS - 01: no description found


OpenInterpreter ▷ #ai-content (1 messages):

zohebmalik: https://x.com/openai/status/1864729936847868192?s=46&t=G6jp7iOBtkVuyhaYmaDb0w


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

RAG based approach, Spring term 2025 MOOC


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (6 messages):

Closed Captioning for Lectures, Last Lecture Slides


Axolotl AI ▷ #announcements (1 messages):

Axolotl swag, Survey respondents rewards


Axolotl AI ▷ #general (4 messages):

Sticker Giveaway, Sticker Survey


DSPy ▷ #general (1 messages):

DSPy framework, Text summarization prompts, Initializing DSPy, New user orientation


MLOps @Chipro ▷ #events (1 messages):

Live Webinar on AI Success, JFrog's 2024 State of AI & LLMs Report, MLOps and DevOps Integration, AI Deployment Challenges, Featured Speakers

Link mentioned: State of AI Webinar: LIVE WEBINAR | From Challenges to Strategy: Preparing for AI Success in 2025 | December 10, 2024 - 11:00 AM EST


LAION ▷ #research (1 messages):

Data-Mixing in LLMs, Decentralized Pre-Training Competition, Subnet 9 Rewards System, Hugging Face FineWeb Edu Dataset, Daily Perplexity and SOTA Benchmarks

Link mentioned: Macrocosmos.ai: no description found






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}