Frozen AI News archive

not much happened today

This week in AI news, **Anthropic** launched **Claude Sonnet 3.5**, enabling desktop app control via natural language. **Microsoft** introduced **Magentic-One**, a multi-agent system built on the **AutoGen framework**. **OpenCoder** was unveiled as an AI-powered code cookbook for large language models. **SambaNova** is sponsoring a hackathon with prizes up to **$5000** for building real-time AI agents. **Sophiamyang** announced new **Batch and Moderation APIs** with **50% lower cost** and multi-dimensional harmful text detection. Open-source tools like **Infisical** for secret management, **CrewAI** for autonomous agent orchestration, and **Crawlee** for web scraping were released. Research highlights include **SCIPE** for error analysis in LLM chains, **Context Refinement Agent** for improved retrieval-augmented generation, and **MemGPT** for managing LLM memory. The week also saw a legal win for **OpenAI** in the RawStory copyright case, affirming that facts used in LLM training are not copyrightable.

Canonical issue URL

a quiet week is all you need.

AI News for 11/7/2024-11/8/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (217 channels, and 2343 messages) for you. Estimated reading time saved (at 200wpm): 248 minutes. You can now tag @smol_ai for AINews discussions!

It seems that big launches were appropriately muted this whole week. We're celebrating the RawStory vs OpenAI dismissal, stating that facts used in LLM training are not copyrightable, and enjoying gorgeous images from the closed model Flux 1.1 [pro] Ultra and Raw launch.

Time to build, with this week's sponsor!


[Sponsored by SambaNova] SambaNova’s Lightning Fast AI Hackathon is here! Give yourself about 4 hours to build a cool AI agent that responds in real time using super-speedy models on SambaNova’s Cloud. Are there prizes? Yes. Up to $5000, plus it’s a chance to connect with other AI devs. Deadline is November 22, so get started now


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Models and APIs

AI Engineering and Infrastructure

AI Research and Techniques

AI Safety and Ethics

Company and Product Updates

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Qwen2.5 Series Shows Strong Performance Across Sizes

Theme 2. New Llama.cpp Server UI Released with Vue.js & DaisyUI

Theme 3. Training Speed Records: 3.28 Hours for NanoGPT Training

Theme 4. Open Source Models Show Near-Zero Refusal Rates vs Proprietary LLMs

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT

Theme 1. AI Companies Embrace Military Contracts: Palantir, Anthropic, OpenAI Remove Restrictions

Theme 2. CogVideoX 5B Released: Major Open Source Text-to-Video Progress

Theme 3. OpenAI's O1 Preview Shows Advanced Reasoning Capabilities

Theme 4. SVDQuant Claims 3x Speedup Over NF4 for Stable Diffusion


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1: New AI Models and Releases Making Waves

Theme 2: Optimizations and Training Strategies in AI Models

Theme 3: AI Tools and Frameworks Enhancing Development

Theme 4: AI Ethics, Legal Issues, and Monetization Strategies

Theme 5: Community Engagement and Career Discussions in AI


PART 1: High level Discord summaries

HuggingFace Discord


OpenRouter (Alex Atallah) Discord


Perplexity AI Discord


Eleuther Discord


Stability.ai (Stable Diffusion) Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


Unsloth AI (Daniel Han) Discord


Latent Space Discord


Notebook LM Discord Discord


Modular (Mojo 🔥) Discord


OpenAI Discord


LM Studio Discord


Interconnects (Nathan Lambert) Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


DSPy Discord


Cohere Discord


OpenAccess AI Collective (axolotl) Discord


LlamaIndex Discord


LAION Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


AI21 Labs (Jamba) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #general (410 messages🔥🔥🔥):

  • Microsoft LightBGM
  • Hugging Face models for IFC files
  • Suno song creating tools
  • AI game development and scripting
  • Maintaining context in LLM conversations

Links mentioned:


HuggingFace ▷ #today-im-learning (7 messages):

  • SFT Learning
  • Machine Learning Resources
  • BPE Tokenizer Visualization
  • FastBert Usage

Links mentioned:


HuggingFace ▷ #cool-finds (14 messages🔥):

  • User Spamming Concerns
  • Token Project Discussion
  • Scam Alert
  • Data Council '25 Call for Proposals

HuggingFace ▷ #i-made-this (12 messages🔥):

  • AI-generated programmatic ads
  • AI/ML workflow platform
  • PostgreSQL text optimization
  • HF Space for OS-ATLAS

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

noaroggendorff: yet


HuggingFace ▷ #NLP (1 messages):

  • Fine Tuning Vocabulary Size

HuggingFace ▷ #diffusion-discussions (1 messages):

  • ComfyUI usage
  • Hacking Forge
  • SD3.5 Feature Request

Links mentioned:


HuggingFace ▷ #gradio-announcements (2 messages):

  • Cinnamon AI
  • Kotaemon RAG tool
  • Live broadcast on X

OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • New Rankings Page
  • MythoMax Performance

OpenRouter (Alex Atallah) ▷ #general (303 messages🔥🔥):

  • OpenRouter performance
  • Rate limits
  • Model comparisons
  • API issues
  • Command R+ alternatives

Links mentioned:

: no description foundMistral Moderation API: We are introducing our new moderation service enabling our users to detect undesirable text content along several policy dimensions.Mistral Batch API: Lower cost API for AI builders.Mistral AI API | Mistral AI Large Language Models: Our Chat Completion and Embeddings APIs specification. Create your account on La Plateforme to get access and read the docs to learn how to use...


OpenRouter (Alex Atallah) ▷ #beta-feedback (4 messages):

  • Integration Beta Feature
  • OpenRouter Monetization

Perplexity AI ▷ #general (241 messages🔥🔥):

  • Subscription Prices Discussion
  • Mobile Device Specs
  • Comparison of AI Models
  • Discount Code Issues
  • App Functionality Problems

Links mentioned:


Perplexity AI ▷ #sharing (8 messages🔥):

  • Gladia Functionality
  • Neohtop Insights
  • Studies in Psychology
  • AI with Infinite Memory
  • USA Election Discussion

Perplexity AI ▷ #pplx-api (10 messages🔥):

  • Citations in Perplexity API
  • Default rate limits increase
  • Citations visibility issues
  • API discussions on GitHub

Links mentioned:


Eleuther ▷ #general (203 messages🔥🔥):

  • Job fulfillment and career moves
  • Experiences transitioning between companies
  • Team dynamics and internal transfers
  • Background checks and employment history
  • Hiring processes at major tech companies

Links mentioned:


Eleuther ▷ #research (16 messages🔥):

  • Forward Gradient for Flash Attention
  • Papers and Their Importance
  • Curvature and Tangent Spaces from Diffusion Geometry

Links mentioned:


Eleuther ▷ #interpretability-general (4 messages):

  • Inverse Interpretability
  • Refusal in LLMs
  • Symbolic Representation of Neural Networks
  • Behavioral Changes in AI Models

Link mentioned: Refusal in LLMs is mediated by a single direction — LessWrong: This work was produced as part of Neel Nanda's stream in the ML Alignment & Theory Scholars Program - Winter 2023-24 Cohort, with co-supervision from…


Eleuther ▷ #lm-thunderdome (2 messages):

  • nlls with <bos> token
  • Meta Llama 3.1
  • Salamandra model card
  • Saving results issues

Links mentioned:


Eleuther ▷ #gpt-neox-dev (21 messages🔥):

  • Benchmarking NeoX vs LitGPT
  • LitGPT usage in projects
  • FSDP vs ZeRO
  • GPT-NeoX features
  • Training with limited resources

Stability.ai (Stable Diffusion) ▷ #general-chat (175 messages🔥🔥):

  • ComfyUI connection issues
  • Image generation concerns with inpainting
  • Model recommendations for ComfyUI
  • Flux model advantages
  • Baseline models vs. merged models

Links mentioned:


Nous Research AI ▷ #general (120 messages🔥🔥):

  • TEE HEE He bot functionality
  • Nous community engagement
  • VLM for handwriting to LaTeX conversion
  • Abliteration concept
  • Anthropic and Palantir deal

Links mentioned:


Nous Research AI ▷ #ask-about-llms (5 messages):

  • llm-evaluation-harness
  • PyTorch model evaluation
  • HellaSwag dataset

Nous Research AI ▷ #research-papers (1 messages):

  • Ferret-UI
  • Multimodal LLMs
  • Apple's UI comprehension

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

  • Ferret-UI
  • Gemma-2B
  • Llama-3-8B
  • Multimodal LLMs
  • User Interface Comprehension

Links mentioned:


Nous Research AI ▷ #rag-dataset (2 messages):

  • RAG usage in chat sessions

aider (Paul Gauthier) ▷ #general (63 messages🔥🔥):

  • Aider Usage Tips
  • Exponent AI Pair Programmer
  • Gemini 2.0 Launch
  • Model Comparison
  • Funding and Donations for Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (51 messages🔥):

  • Aider Model Architecture
  • RAG Integration with Qdrant
  • Emacs Keybindings for Aider
  • Exploring Aider Features
  • Using Aichat for RAG

Links mentioned:


aider (Paul Gauthier) ▷ #links (3 messages):

  • Aider
  • Developed Approaches
  • Hacker News Discussion

Unsloth AI (Daniel Han) ▷ #general (61 messages🔥🔥):

  • LoRA vs Full Fine-tuning
  • 4bit improvements
  • Multi-GPU Training
  • Feature Contribution Analysis Tools
  • Curriculum Learning in Fine-tuning

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

  • Maths Exam
  • Excellence
  • Celebration

Unsloth AI (Daniel Han) ▷ #help (38 messages🔥):

  • Concurrent Setup with Unsloth Inference
  • Integration of Transformers-Interpret
  • Fine-tuning Models for Text Classification
  • Ollama and Streamlit Integration
  • Language Adaptation for LLMs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

  • Avian inference approach
  • Inference speed comparison

Unsloth AI (Daniel Han) ▷ #research (5 messages):

  • Errors in AI/ML Research Papers
  • Reproducibility Issues
  • Preprint Papers and Peer Review

Latent Space ▷ #ai-general-chat (26 messages🔥):

  • Claude limitations
  • Codebuff versus Aider comparison
  • Mistral's new APIs
  • FLUX1.1 Ultra features
  • Gemini API release

Links mentioned:


Latent Space ▷ #ai-in-action-club (80 messages🔥🔥):

  • Tech Issues in Recording
  • Code Generation and Correction
  • Daily Use Cases
  • Open Interpreter Features
  • Community Feedback

Links mentioned:


Notebook LM Discord ▷ #announcements (1 messages):

  • Audio Overviews Feedback Survey
  • NotebookLM User Feedback

Link mentioned: Register your interest: Google feedback survey: Hello, We are looking for feedback on NotebookLM via a short survey. This will help the Google team better understand your needs in order to incorporate them into future product enhancements. To regi...


Notebook LM Discord ▷ #use-cases (34 messages🔥):

  • NotebookLM for Exam Preparation
  • Importing Google Recordings
  • AI Language Models and Bias
  • Using NotebookLM for Technical Job Prep
  • Mock Interviews with AI

Links mentioned:


Notebook LM Discord ▷ #general (52 messages🔥):

  • NotebookLM features
  • Sharing Notebooks
  • Using AI in education
  • Podcast generation
  • YouTube Terms of Service

Links mentioned:


Modular (Mojo 🔥) ▷ #announcements (1 messages):

  • ModCon 2024
  • Advancements in MAX and Mojo
  • ModCon 2025 Updates

Modular (Mojo 🔥) ▷ #mojo (73 messages🔥🔥):

  • Mojo Interoperability
  • OpenSSL Layer Discussions
  • Cryptography in Mojo
  • JSON Support in Mojo
  • MLIR Reflection API

Links mentioned:


OpenAI ▷ #ai-discussions (39 messages🔥):

  • AI Sales Agents
  • Quantum Networking
  • Benevolent AI Development
  • AI's Impact on Society
  • Training Data Usage

OpenAI ▷ #gpt-4-discussions (5 messages):

  • Connecting GPT chat bot to Firestore
  • GPT model updates
  • User interface concerns
  • Guessing GPT model name

LM Studio ▷ #general (29 messages🔥):

  • Llama 3.2 Vision
  • LM Studio prompts and features
  • LLM web searching capability
  • GPU usage in LM Studio
  • Beta tools release

Link mentioned: Llama 3.2 Vision · Ollama Blog: Llama 3.2 Vision


LM Studio ▷ #hardware-discussion (8 messages🔥):

  • Qwen 2.5 memory calculation
  • Gemma2 27B model on Mac Mini
  • Mac Mini recommendations

Interconnects (Nathan Lambert) ▷ #news (6 messages):

  • Court ruling on OpenAI
  • Google Gemini model
  • Amazon's investment in Anthropic

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (3 messages):

  • Model Token Limits
  • Instruction Data Usage
  • Text Rewriting Techniques

Interconnects (Nathan Lambert) ▷ #random (19 messages🔥):

  • Sasha Rush livestream
  • PRMs and value models
  • O1 blog post
  • Speculations on Test-Time Scaling
  • Awesome O1 repository

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (5 messages):

  • Logan's favorites
  • Podcast ideas
  • Discussion on Julia

Link mentioned: Tweet from Logan Kilpatrick (@OfficialLoganK): My favorite Friday ship 🚢 in a while : ) will be continuing to remove some of the rough edges here over the next few weeks.


OpenInterpreter ▷ #general (24 messages🔥):

  • Streaming viewer limits
  • OmniParser details
  • Running LLMs on servers
  • User experience feedback
  • Access to desktop app

Links mentioned:


tinygrad (George Hotz) ▷ #general (12 messages🔥):

  • Nvidia hardware optimization
  • Groq hardware benefits
  • ASIC advantages
  • Operations on ASICs
  • Demand for compiler improvements

tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

  • x.shard function
  • Model Sharding Strategies
  • Optimizing CPU Pipeline

DSPy ▷ #papers (2 messages):

  • OptoPrime
  • Stanford Optimizer Naming

DSPy ▷ #general (12 messages🔥):

  • Self Consistency Module Caching
  • Dynamic Few-Shot Examples
  • Predict Module with N Outputs
  • MIPRO Optimizer for Question Generation

Cohere ▷ #discussions (3 messages):

  • Tavily for AI Searches
  • Testing with API Calls
  • Comparison with Search Engines

Cohere ▷ #questions (8 messages🔥):

  • Cohere API trial key
  • Embedding errors
  • Implementation challenges
  • Support resources

OpenAccess AI Collective (axolotl) ▷ #general (8 messages🔥):

  • Metaparameter Tuning
  • Memory Requirements of Optimizers
  • Training on AMD GPUs
  • Preliminary Research Discussions

Link mentioned: Understanding Optimization in Deep Learning with Central Flows: Optimization in deep learning remains poorly understood, even in the simple setting of deterministic (i.e. full-batch) training. A key difficulty is that much of an optimizer's behavior is implici...


LlamaIndex ▷ #blog (2 messages):

  • Context Refinement Agent
  • Agentic RAG Query Engine
  • NVIDIA NIM
  • Open-source Model Inference

LlamaIndex ▷ #general (5 messages):

  • LlamaIndex workflow
  • AI NLP Engineer position
  • Open source LLM resources

Link mentioned: Workflows - LlamaIndex: no description found


LAION ▷ #general (2 messages):

  • MicroDiT model
  • LOST SOUNDTRACK - BONNIE AND CLYDE

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

  • Computing Resources Deadline

MLOps @Chipro ▷ #events (1 messages):

  • Data Council '25
  • CFP Opening

AI21 Labs (Jamba) ▷ #general-chat (1 messages):

  • Jurassic endpoint deprecation
  • Transition to Jamba model





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}