Frozen AI News archive

not much happened to end the week

**AI News for 11/29/2024-11/30/2024** covers key updates including the **Gemini multimodal model** advancing in musical structure understanding, a new **quantized SWE-Bench** for benchmarking at **1.3 bits per task**, and the launch of the **DeepSeek-R1 model** focusing on transparent reasoning as an alternative to **o1**. The establishment of the **1st International Network of AI Safety Institutes** highlights global collaboration on AI safety. Industry updates feature **Amazon's Olympus AI model**, **Tesla's Optimus**, and experiments with **ChatGPT** as a universal translator. Community reflections emphasize the impact of large language models on daily life and medical AI applications. Discussions include scaling sparse autoencoders to **gpt-4** and the need for transparency in reasoning LLMs. The report also notes humor around **ChatGPT**'s French nickname.

Canonical issue URL

AI News for 11/29/2024-11/30/2024. We checked 7 subreddits, 433 Twitters and 29 Discords (198 channels, and 1195 messages) for you. Estimated reading time saved (at 200wpm): 142 minutes. You can now tag @smol_ai for AINews discussions!

Happy holidays. Lots of QwQ discussion in the reddits, but the latest from the Qwen team is that the tech report will take a month or so.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

1. Advances and Trends in AI: Notable Releases and Tools

2. AI Safety and Ethical Initiatives

3. AI in Practice: Industry Updates and Applications

4. Thanksgiving Reflections and Community Engagement

5. AI Critiques and Discussions

6. Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Alibaba's QwQ 32B Model Release and Reception

Theme 2. Janus: New Browser-Based Multimodal AI from Deepseek

Theme 3. Innovative LLM Tools: V0, Memoripy, and Steel Browser

Theme 4. Local LLM Hardware & Benchmarks: M3/M4 vs NVIDIA GPUs

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Claude Performance Concerns and Anthropic's Response

Theme 2. Chinese AI Models Challenging Western Dominance (Alibaba QwQ-32B)

Theme 3. AI Video Generation Breakthroughs and Comparisons

Theme 4. Model Compression and Efficiency Advances


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1: Cursor IDE Update Sparks Developer Frustration


Theme 2: Anthropic’s MCP Framework Supercharges Claude


Theme 3: Low-Bit Quantization Shakes Up AI Training


Theme 4: AI Powers Rapid Creative Content Creation


Theme 5: AI Trends and Investments Make Waves



PART 1: High level Discord summaries

Cursor IDE Discord


OpenAI Discord


aider (Paul Gauthier) Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LM Studio Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


GPU MODE Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Notebook LM Discord Discord


Latent Space Discord


Stability.ai (Stable Diffusion) Discord


tinygrad (George Hotz) Discord


Cohere Discord


DSPy Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


OpenInterpreter Discord


Interconnects (Nathan Lambert) Discord


LlamaIndex Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Axolotl AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LAION Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The HuggingFace Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (237 messages🔥🔥):

Cursor IDE Updates, Composer vs Chat Mode, Windsurf Advantages, API Key Usage, Context Management

Links mentioned:


OpenAI ▷ #ai-discussions (91 messages🔥🔥):

Gemini's Moral Constraints, Anthropic MCP Framework, ChatGPT's Capabilities, Speech to Text on Windows, AI Models for Coding

Link mentioned: Tweet from Pietro Schirano (@skirano): Today @Anthropic is releasing MCP, a framework that allows Claude to run servers, giving it superpowers and effectively turning the Claude app into an API.We created some server that I think you'l...


OpenAI ▷ #gpt-4-discussions (7 messages):

App vs. Browser Performance, Issues with Customized GPTs, Loading Errors for Files and Photos


OpenAI ▷ #prompt-engineering (32 messages🔥):

Image Captioning Issues, Structured Outputs Problems, Model Recommendations, User Experience with OpenAI Support


OpenAI ▷ #api-discussions (32 messages🔥):

Issues with Image Uploads, Vision Model Malfunctions, Structured Output Errors, Switching Models to Claude, Debugging Best Practices


aider (Paul Gauthier) ▷ #general (83 messages🔥🔥):

QwQ model configurations, DeepSeek model performance, Using Aider for local models, Issues with OpenRouter, Ensemble frameworks for reasoning

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (46 messages🔥):

Aider file management, QwQ model from Qwen, Monorepo settings, OpenAI API instances, Repository map experiences

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (53 messages🔥):

Instruct vs Non-instruct Fine-tuning, Fine-tuning Dataset Formatting, Alternative GPU Recommendations, Creating Custom Datasets, Support for Schedule Free Optimizers

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (4 messages):

RAG usage, Training models, OOM errors

Link mentioned: Chuckles Im In Danger GIF - Chuckles Im In Danger Ralph Wiggum - Discover & Share GIFs: Click to view the GIF


Unsloth AI (Daniel Han) ▷ #help (48 messages🔥):

Unsloth Fine-tuning Models, Using Unsloth for Private Data, Grad Norm Fluctuations, LLama 3.1 OOM Errors, SyntaxWarnings in Unsloth

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (4 messages):

Unsloth fine-tuning, RAG costs, Latent paraphraser, Fine-Tuning or Retrieval paper, Custom tokenizers

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Perplexity Pro Discount, AI Models Access, One-Click Shopping


Perplexity AI ▷ #general (74 messages🔥🔥):

Perplexity Pro Subscription Features, User Experiences with Claude, Image Generation Queries, Customer Support for Subscription Issues, Black Friday Discounts

Links mentioned:


Perplexity AI ▷ #sharing (9 messages🔥):

Hard problem of consciousness, Factoids overheard, Cloud water content, RBAC vs ABA, Battery optimization


Perplexity AI ▷ #pplx-api (5 messages):

Perplexity in Claude, Claude Project Knowledge, Perplexity's text file reading issue, Custom instructions for spaces


LM Studio ▷ #announcements (1 messages):

HF Search Issue, Image Analysis


LM Studio ▷ #general (56 messages🔥🔥):

LM Studio AIDE Integration, Llama 3.1 Models in LM Studio, LM Studio Network Issues, Document Interaction in LM Studio, GUI Access Issues in Mac

Links mentioned:


LM Studio ▷ #hardware-discussion (17 messages🔥):

Seasonic PSU longevity, a770 performance comparison, PC build recommendations, Intel vs AMD processors, Performance of Qwen2.5-14b


Eleuther ▷ #general (29 messages🔥):

De-escalation of resource contention, GPU job submission management, SLURM and Kubernetes usage, AI and Crypto intersection, Open access to academic resources


Eleuther ▷ #research (23 messages🔥):

Poincare Ball Embedding, Hyperbolic Geometry, Graph Distortion, Embedding Trees, HyperE

Link mentioned: HyperE: no description found


Eleuther ▷ #scaling-laws (5 messages):

AutoML Challenges, TaskSet Dataset, Neural Architecture Design, Equivariant vs Non-equivariant Networks

Links mentioned:


Eleuther ▷ #lm-thunderdome (17 messages🔥):

HF Tokenizer Handling, Custom Tokenizer Considerations, Evaluation Harness Model Functions, Generation Parameters in Models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Feature Requests Voting, Channel for Additional Requests


OpenRouter (Alex Atallah) ▷ #general (57 messages🔥🔥):

Pixtral Large's Capabilities, Concerns about Model Responses, Provider-Specific Features, Image Generation in OpenRouter, Structured Outputs from Llama 3.2

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (5 messages):

Custom Provider Keys


GPU MODE ▷ #general (2 messages):

Parallel processing on NVIDIA GPU, Posting in beginner section


GPU MODE ▷ #triton (9 messages🔥):

Metaprogramming Proposal, Building Triton from Source, Offline Compilation Dependencies

Links mentioned:


GPU MODE ▷ #cuda (1 messages):

cuBLAS async loads, Custom kernels performance, SASS instructions, CuTe templates, Throughput considerations


GPU MODE ▷ #torch (26 messages🔥):

Triton performance, Fusion strategies, Memory usage in PyTorch, Max autotune settings, NANOgpt integration


GPU MODE ▷ #algorithms (1 messages):

melanimahes: https://arxiv.org/pdf/2411.17116


GPU MODE ▷ #cool-links (1 messages):

Diffusion Models Overview, Classifier-free Diffusion Guidance, Perspectives on Diffusion Models, Noise Schedules in Diffusion Models

Links mentioned:


GPU MODE ▷ #off-topic (2 messages):

Series A Docs Process, HR Reporting Protocols

Link mentioned: Tweet from Nathan Benaich (@nathanbenaich): 12 hours and counting - notary reads every single word of Series A docs in Germany out loud in front of founders. In person. Guys, we have GDP to grow here. Pure prehistoric madness.


GPU MODE ▷ #bitnet (2 messages):

BitNet b1.58, 1-bit LLMs, Open-Source Models, RedPajama Dataset, Dolma Dataset

Links mentioned:


GPU MODE ▷ #self-promotion (1 messages):

Japanese LLM evaluation, Open Japanese LLM Leaderboard, Hugging Face collaboration

Link mentioned: Introducing the Open Leaderboard for Japanese LLMs!: no description found


GPU MODE ▷ #thunderkittens (14 messages🔥):

Non-Warp Specialized Implementations, Unification of ThunderKittens and ThunderMittens, API Contracts between TK and TM, Auto Optimizer for TK, Triton vs ThunderKittens Features

Link mentioned: ThunderKittens/kernels/layernorm/non_pc at main · HazyResearch/ThunderKittens: Tile primitives for speedy kernels. Contribute to HazyResearch/ThunderKittens development by creating an account on GitHub.


Nous Research AI ▷ #general (42 messages🔥):

Hermes 3 updates, Mistral Platform issues, Truth Terminal in Crypto & AI, Job hunting in Discord, AI and Crypto community crossover

Link mentioned: GOAT: The Gospel of Goatse: Why Truth Terminal is an asymmetric bet on society's growing fascination with autonomous AI agents


Nous Research AI ▷ #research-papers (5 messages):

Low-bit quantization effects, Precision-aware scaling laws, Ternary quantization vs undertrained models, FP4 efficiency

Links mentioned:


Nous Research AI ▷ #interesting-links (3 messages):

Filter issues, Content policy, User experience


Nous Research AI ▷ #research-papers (5 messages):

Low-bit quantization effects, Precision-aware scaling laws, Ternary quantization usefulness, FP4 as efficient representation, QaT for smaller models

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (35 messages🔥):

Mojo Origins, Rust Lifetimes, Compiler Behavior, Destructor Calls, Variable Naming


Notebook LM Discord ▷ #use-cases (6 messages):

Notebook LM Podcast Feature, Worldbuilding with NotebookLM, RAX Times Square Takeover, FPD Breakup of German Government, Use Case Examples of NotebookLM

Links mentioned:


Notebook LM Discord ▷ #general (17 messages🔥):

GenFM competition with NotebookLM, Changing language settings in NotebookLM, Using NotebookLM for gaming and worldbuilding, Social psychology inquiries

Link mentioned: GenFM, Now Playing on ElevenReader: Smart Podcasts Produced by Generative AI: We’re making the ElevenReader app even more powerful. You can now generate smart personal podcasts from any of your PDFs, articles, ebooks, docs or imported ...


Latent Space ▷ #ai-general-chat (18 messages🔥):

Perplexity Black Friday Deals, AI to Human Comparison, Generative AI in Enterprises, Freysa AI Agent Challenge, Technology Adoption Trends

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (18 messages🔥):

AI Model Performance, Stable Diffusion Hardware Questions, ControlNet for SD 3.5 Feedback, Content Creation Queries, LoRA Model Request


tinygrad (George Hotz) ▷ #general (14 messages🔥):

TinyFPGA Memory Hierarchy, Memory Utilization Techniques, Exa Laboratories, Tenstorrent Training Algorithm, Brain-like Processing Models

Link mentioned: Exa Laboratories: no description found


tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

VIZ tool, VIZ vs LLVM/MLIR, tinygrad tutorials

Link mentioned: tinygrad-notes/20241129_viz.md at main · mesozoic-egg/tinygrad-notes: Tutorials on tinygrad. Contribute to mesozoic-egg/tinygrad-notes development by creating an account on GitHub.


Cohere ▷ #discussions (12 messages🔥):

Thanksgiving celebrations, Aya project contributions, Healthy meal choices, Food sharing, Dungeness crab


DSPy ▷ #general (8 messages🔥):

dspy.asyncify, dspy demo behavior, New Member Introduction


Torchtune ▷ #dev (5 messages):

DPO Fine-tuning, Full-parameter DPO, DPO vs LoRA-DPO, Full-finetuning DPO

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Quiz 11 availability, OpenAI credits inquiry, MOOC certificate eligibility


OpenInterpreter ▷ #ai-content (2 messages):

Open Interpreter inspired project, Open-source dashboard


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

OLMo 2, Weight Watcher AI, Model Performance Comparison

Link mentioned: WeightWatcher: Data-Free Diagnostics for Deep Learning: no description found


LlamaIndex ▷ #general (1 messages):

Web Development, JavaScript Frameworks, Testing Tools, API Integrations, Cloud Services








{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}