Frozen AI News archive

ChatGPT Canvas GA

**OpenAI** launched **ChatGPT Canvas** to all users, featuring **code execution** and **GPT integration**, effectively replacing Code Interpreter with a Google Docs-like interface. **Deepseek AI** announced their **V2.5-1210** update improving performance on **MATH-500 (82.8%)** and LiveCodebench. **Meta AI Fair** introduced **COCONUT**, a new continuous latent space reasoning paradigm. **Huggingface** released **TGI v3**, processing **3x more tokens** and running **13x faster** than vLLM on long prompts. **Cognition Labs** released **Devin**, an AI developer building Kubernetes operators. **Hyperbolic** raised **$12M Series A** to build an open AI platform with an **H100 GPU marketplace**. Discussions included **AI capabilities and employment impact**, and **NeurIPS 2024** announcements with **Google DeepMind** demos and a debate on AI scaling. On Reddit, **Llama 3.3-70B** supports **90K context length** finetuning using **Unsloth** with **gradient checkpointing** and Apple's **Cut Cross Entropy (CCE)** algorithm, fitting on **41GB VRAM**. **Llama 3.1-8B** reaches **342K context lengths** with Unsloth, surpassing native limits.

Canonical issue URL

AI News for 12/9/2024-12/10/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (206 channels, and 5518 messages) for you. Estimated reading time saved (at 200wpm): 644 minutes. You can now tag @smol_ai for AINews discussions!

It's still early innings but already we are ready to call OpenAI's 12 Days of Shipmas a hit. While yesterday's Sora launch is still (as of today) plagued with gated signups to deal with overwhelming demand, ChatGPT Canvas needs no extra GPUs and launched to all free and paid users today with no hiccup.

image.png

Canvas now effectively supercedes Code Interpreter and is also remarkably Google Docs-like, which further demonstrates the tendency of OpenAI to build Google features faster than Google can build OpenAI.

There's a theory that the jokes ending each episode are a preview of the next one. If this is true, tomorrow's ship will be a doozy.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Here's a categorized summary of the key Twitter discussions:

AI Model & Research Updates

Product Launches & Updates

Industry & Market Analysis

NeurIPS Conference

Memes & Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Llama 3.3-70B Finetuning: 90K Context on <41GB VRAM

Theme 2. DeepSeek V2.5-1210: Final Version and What Next

Theme 3. InternVL2.5 Released: Top Performance in Vision BM

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Google Willow: Quantum Computing's Gargantuan Leap

Theme 2. Gemini 1.5 Outperforms Llama 2 70B: Industry Reactions

Theme 3. Sora Video Generator: Redefining AI Creativity


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1: AI Model Advancements and New Releases

Theme 2: AI Tools and User Experience Challenges

Theme 3: AI Integration in Software Development

Theme 4: Community and Open Source Initiatives in AI

Theme 5: AI in Creative Content and User Interaction


PART 1: High level Discord summaries

Codeium / Windsurf Discord


Eleuther Discord


Cursor IDE Discord


aider (Paul Gauthier) Discord


Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


Perplexity AI Discord


OpenAI Discord


Bolt.new / Stackblitz Discord


Modular (Mojo đŸ”„) Discord


Notebook LM Discord Discord


LM Studio Discord


Nous Research AI Discord


Interconnects (Nathan Lambert) Discord


Latent Space Discord


Axolotl AI Discord


LLM Agents (Berkeley MOOC) Discord


LlamaIndex Discord


Cohere Discord


Torchtune Discord


DSPy Discord


LAION Discord


OpenInterpreter Discord


Mozilla AI Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Codeium / Windsurf ▷ #content (1 messages):

Windsurf AI giveaway, User engagement on Twitter

Link mentioned: Tweet from Windsurf (@windsurf_ai): Excited to announce our first merch giveaway 🏄Share what you've built with Windsurf for a chance to win a care package đŸȘ‚ #WindsurfGiveawayMust be following to qualify


Codeium / Windsurf ▷ #discussion (384 messagesđŸ”„đŸ”„):

Credit System Issues, Windsurf IDE Performance, Cline vs Windsurf, Codeium Plugin Functionality, Support and Outage Communication

Links mentioned:


Codeium / Windsurf ▷ #windsurf (715 messagesđŸ”„đŸ”„đŸ”„):

Windsurf Pricing Model, Flow Credits Issues, AI Capabilities in Development, Devin vs. Windsurf, AI Collaboration Limitations

Links mentioned:


Eleuther ▷ #general (43 messagesđŸ”„):

Draft PR for ML Systems, Reproducibility Concerns in LLMs, HumanEval Benchmark PR, Importance of Training Data, OLMs Hallucination Considerations

Links mentioned:


Eleuther ▷ #research (257 messagesđŸ”„đŸ”„):

Coconut Architecture, Universal Transformers, Gated DeltaNet, EOT Token Handling, Linear Transformers

Links mentioned:


Eleuther ▷ #lm-thunderdome (231 messagesđŸ”„đŸ”„):

GSM8k Evaluation Metrics, Arc Challenge Configurations, Batch Size Effects on Model Performance, RWKV Model Implementation Concerns, Attention Masking Issues in Transformers

Link mentioned: llama3/eval_details.md at main · meta-llama/llama3: The official Meta Llama 3 GitHub site. Contribute to meta-llama/llama3 development by creating an account on GitHub.


Eleuther ▷ #multimodal-general (1 messages):

tensor_kelechi: https://machinelearning.apple.com/research/multimodal-autoregressive


Cursor IDE ▷ #general (331 messagesđŸ”„đŸ”„):

Slow Requests with Cursor, Comparison of AI Models, Issues with Cursor and Agents, User Experiences and Feedback, Code Evaluation with AI

Links mentioned:


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.68.0 features, API key management, Enhanced shell command support, Experimental Gemini models, Error messaging improvements

Link mentioned: YAML config file: How to configure aider with a yaml config file.


aider (Paul Gauthier) ▷ #general (281 messagesđŸ”„đŸ”„):

Aider Features and Improvements, Gemini Model Performance, Integration of Aider with LangChain, Using Multiple Aider Instances, Aider Tutorials and Resources

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (29 messagesđŸ”„):

Best Practices for Large Codebases, Integration of Aider with Language Servers, Using Aider Outside Command Line, Handling System Prompts in Aider, Differences in Claude Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (161 messagesđŸ”„đŸ”„):

Llama 3.3 ultra long context, Sora model discussion, Fine-tuning Qwen models, Performance of quantized models, Educational access for students

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (85 messagesđŸ”„đŸ”„):

Unsloth Model Installation, Finetuning Gemma 2, CUDA/Triton Kernel Development, Long Text Generation Issues, Using Guidance AI for Non-Conversational Tasks

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (6 messages):

Awesome RAG project, Deep dive on roles and cards, Constrained generation techniques

Link mentioned: GitHub - lucifertrj/Awesome-RAG: RAG-VectorDB-Embedings-LlamaIndex-Langchain: RAG-VectorDB-Embedings-LlamaIndex-Langchain. Contribute to lucifertrj/Awesome-RAG development by creating an account on GitHub.


Unsloth AI (Daniel Han) ▷ #research (4 messages):

APOLLO optimizer for LLMs, QTIP quantization method, Dataset repository for WizardLM Arena paper

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (238 messagesđŸ”„đŸ”„):

Image Enhancement Techniques, Use of Stable Diffusion, Llama 3.2-Vision Model, Memory Management in WebUI, Image Metadata and Tagging

Links mentioned:


Perplexity AI ▷ #general (219 messagesđŸ”„đŸ”„):

Perplexity AI Image Generation, Claude and GPT Models, Custom GPTs Functionality, Perplexity Pro Subscription, AI Tools and Resources

Links mentioned:


Perplexity AI ▷ #sharing (8 messagesđŸ”„):

OpenAI's Sora release, Bitcoin reaching $100K, World's largest gold deposit, Perplexity AI updates, AI and monitoring in 2025

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (2 messages):

Service Restoration


OpenAI ▷ #annnouncements (1 messages):

Canvas updates, 12 Days of OpenAI

Link mentioned: Canvas—12 Days of OpenAI: Day 4: Kevin Weil, Lee Byron, and Alexi Christakis introduce and demo updates to canvas.


OpenAI ▷ #ai-discussions (149 messagesđŸ”„đŸ”„):

Sora Generation Speculations, AI Model Comparisons, LLM Capabilities, User Experience with New Features


OpenAI ▷ #gpt-4-discussions (23 messagesđŸ”„):

Sora account issues, GPT-3.5 development struggles, Domain verification problems, Refunding clients, Communication with moderators


OpenAI ▷ #prompt-engineering (12 messagesđŸ”„):

Custom GPTs continuity, Nested code blocks, Translation effectiveness, OpenAI API model fine-tuning


OpenAI ▷ #api-discussions (12 messagesđŸ”„):

Custom GPTs updates, Nested Code Blocks, Fine-tuning models, Translation quality, Synthesis of continuity


Bolt.new / Stackblitz ▷ #prompting (27 messagesđŸ”„):

Prompting conventions, Subscription management with Bolt, Shopify automation, Document scanning issues, Integration with Airtable

Links mentioned:


Bolt.new / Stackblitz ▷ #discussions (114 messagesđŸ”„đŸ”„):

Token Issues and Subscription Support, Integrations with Payment Gateways, Using Multiple LLMs, Image Upload Problems, Troubleshooting No Preview Available

Links mentioned:


Modular (Mojo đŸ”„) ▷ #general (45 messagesđŸ”„):

Swag Challenge Winners, User Engagement on the Forum, Network Interrupt Issues, Mojo Language Typing, Hugging Face Integration

Links mentioned:


Modular (Mojo đŸ”„) ▷ #mojo (83 messagesđŸ”„đŸ”„):

Destroy keyword in Mojo, Memory management in Multi-Paxos, Ownership semantics, Pros and cons of struct destructors, Implementation challenges in Multi-Paxos

Links mentioned:


Notebook LM Discord ▷ #use-cases (43 messagesđŸ”„):

Podcast Content Creation, Source Utilization Challenges, Language Settings, AI Podcast Generation, Community Engagement in AI

Links mentioned:


Notebook LM Discord ▷ #general (51 messagesđŸ”„):

NotebookLM Features, Podcast Functionality, User Experience Issues, Customization Queries, Language Support

Links mentioned:


LM Studio ▷ #general (78 messagesđŸ”„đŸ”„):

LM Studio Updates, Tailscale Configuration, Model Compatibility Issues, RAG Techniques, Performance Optimization

Links mentioned:


LM Studio ▷ #hardware-discussion (9 messagesđŸ”„):

Cooling Solutions, Reservoirs and Pumps, Alphacool Products, GPU Cooling Setup

Link mentioned: no title found: no description found


Nous Research AI ▷ #announcements (1 messages):

New Channel for Collaborations


Nous Research AI ▷ #general (61 messagesđŸ”„đŸ”„):

Idefics Model Insights, Collaborations in Research, Long-term Memory Pathways, VLM Model Fine-tuning, Forum Creation for Project Discussion

Links mentioned:


Nous Research AI ▷ #ask-about-llms (17 messagesđŸ”„):

Building a Security Agent, ReAct Agent Examples, Observability in RAG Systems, Generating O1-type Synthetic Data, Thinking LLMs Paper from Meta

Link mentioned: Thinking LLMs: General Instruction Following with Thought Generation | Oxen.ai: The release of OpenAI-O1 has motivated a lot of people to think deeply about
thoughts 💭. Thinking before you speak is a skill that some people have better than others 😉, but a skill that LLMs have c...


Nous Research AI ▷ #interesting-links (1 messages):

deki04: https://x.com/omarsar0/status/1866143542726340890?s=46


Nous Research AI ▷ #reasoning-tasks (5 messages):

Scratchpad Feedback, Visual Representation of Outputs, Core Reasoning Task Insights


Interconnects (Nathan Lambert) ▷ #events (12 messagesđŸ”„):

Conference Profile Publicity, Microwave Gang, Discord Profile Names

Link mentioned: Reddit - Dive into anything: no description found


Interconnects (Nathan Lambert) ▷ #news (12 messagesđŸ”„):

DeepSeek V2.5 Launch, Internet Search Feature, DeepSeek License Discussion

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (4 messages):

Sam confirmed as CEO, User identity confusion, Footwear preferences


Interconnects (Nathan Lambert) ▷ #random (11 messagesđŸ”„):

vLLM Project Joins PyTorch, Expectations on Model Capabilities, Conference Experiences

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (7 messages):

xAI and Pepes, Fchollet's Scaling Law Discussion, Twitter Dynamics

Link mentioned: Tweet from François Chollet (@fchollet): Dude, what are you talking about?1. I have no idea who you are, so I don't "think" anything about you.2. I have never bet against scaling laws. Rather, I have pushed back against the idea ...


Latent Space ▷ #ai-general-chat (44 messagesđŸ”„):

WaveForms AI Launch, vLLM Joins PyTorch, Devin Generally Available, Molmo Full Recipe Release, State of AI Agents 2024 Report

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

Sora Launch, Generative Video WorldSim, DeepMind Genie, VideoPoet, DeCAF Test of Time Winner

Link mentioned: Tweet from Latent.Space @NeurIPSConf Live! (@latentspacepod): 🆕 Generative Video WorldSim, Diffusion, Vision, Reinforcement Learning and RoboticsOur longest episode ever! https://latent.space/p/icml-2024-video-robotsa deep dive into- @OpenAI Sora (with @billpe...


Axolotl AI ▷ #general (44 messagesđŸ”„):

Torch Compile Usage, Reward Models in RL, KTO Model Benefits, Dataset Limitations, Quantitative Research in Fine-tuning


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (25 messagesđŸ”„):

Quizzes Access, Article Submission Guidelines, Hackathon Write-Up Requirements, Social Media Posting for Articles, Course Completion Requirements

Link mentioned: Large Language Model Agents MOOC: MOOC, Fall 2024


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (3 messages):

Function Calling in LLMs, Important Papers in Tool Learning

Links mentioned:


LlamaIndex ▷ #blog (5 messages):

LlamaParse Auto Mode, LlamaParse Webinar, Document Agent Workflows, LlamaParse JSON Mode, Invoice Processing Agent

Link mentioned: no title found: no description found


LlamaIndex ▷ #general (21 messagesđŸ”„):

Running Local Agent Examples, Issues with Document Retrieval, ColPali Reranking Feature, Cohere Rerank Postprocessor, Using Smaller Models

Links mentioned:


Cohere ▷ #discussions (10 messagesđŸ”„):

Cohere's business context, Irrelevant humor in discussions


Cohere ▷ #questions (9 messagesđŸ”„):

Rerank 3.5 English model plans, CmdR+Play Bot status, Aya-expanse performance, API request 403 error

Link mentioned: Careers: Our team of ML/AI experts is passionate about helping developers solve real-world problems. From our offices in Toronto, London, and Palo Alto, we work at the cutting edge of machine learning to unloc...


Cohere ▷ #api-discussions (2 messages):

API request errors, Trial key limitations


Torchtune ▷ #dev (17 messagesđŸ”„):

Merging Config Files, TorchTune PR Discussion, DoraLinear and LoraLinear Initialization, Tensor Device Handling, Magnitude Calculation

Links mentioned:


DSPy ▷ #show-and-tell (1 messages):

LangWatch Optimization Studio, DSPy programs, Low-code tools, Open source release

Link mentioned: GitHub - langwatch/langwatch: Source available LLM Ops platform and LLM Optimization Studio powered by DSPy.: Source available LLM Ops platform and LLM Optimization Studio powered by DSPy. - langwatch/langwatch


DSPy ▷ #general (13 messagesđŸ”„):

DSPy documentation access, API reference location, O1 series model impact, Error during optimization

Link mentioned: DSPy Documentation: The framework for programming—rather than prompting—language models.


LAION ▷ #general (2 messages):

Awareness of AI capabilities, Grassroots Science Initiative, Multilingual LLMs, Risks of AI-generated content

Links mentioned:


LAION ▷ #research (9 messagesđŸ”„):

Training 7B on 12GB, Hyperefficient Small Models, Scale vs Efficiency in Models


OpenInterpreter ▷ #general (10 messagesđŸ”„):

01 Voice-Enabled App, Controlling GPT o1 Pro, Beta Access for Mac Users, Website Issues

Links mentioned:


Mozilla AI ▷ #announcements (1 messages):

Web Applets, Theia-ide, Programming Interviews, Integration with IDEs

Link mentioned: Tweet from Robert Scoble (@Scobleizer): Back in the day if you were interviewing for a programming job at Microsoft they might have you write a bubble sort on the white board to make sure you knew how to program that.Now?Just tell your IDE ...






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}