Frozen AI News archive

not much happened today

**ChatGPT Search** was launched by **Sam Altman**, who called it his favorite feature since ChatGPT's original launch, doubling his usage. Comparisons were made between ChatGPT Search and **Perplexity** with improvements noted in Perplexity's web navigation. **Google** introduced a "Grounding" feature in the Gemini API & AI Studio enabling Gemini models to access real-time web information. Despite Gemini's leaderboard performance, developer adoption lags behind **OpenAI** and **Anthropic**. **SmolLM2**, a new small, powerful on-device language model, outperforms **Meta's Llama 3.2 1B**. A **Claude** desktop app was released for Mac and Windows. **Meta AI** announced robotics advancements including Meta Sparsh, Meta Digit 360, and Meta Digit Plexus. **Stable Diffusion 3.5 Medium**, a 2B parameter model with a permissive license, was released. Insights on AGI development suggest initial inferiority but rapid improvement. **Anthropic** advocates for early targeted AI regulation. Discussions on ML specialization predict training will concentrate among few companies, while inference becomes commoditized. New AI tools include **Suno AI Personas** for music creation, **PromptQL** for natural language querying over data, and **Agent S** for desktop task automation. Humor was shared about Python environment upgrades.

Canonical issue URL

AI News for 10/31/2024-11/1/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (231 channels, and 2436 messages) for you. Estimated reading time saved (at 200wpm): 254 minutes. You can now tag @smol_ai for AINews discussions!

Not much happened today, but a month's worth of launches happened in the past two days that you may want to keep up on.

Alternatively you may wish to tune in to the latest LS pod on LMSys/Chatbot Arena!

https://www.youtube.com/watch?v=vBlhoAIb0iE


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

ChatGPT Search and AI-Powered Search

AI Model Releases and Updates

AI Research and Insights

AI Tools and Applications

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. AI Real-Time Game Generation Breakthrough

Theme 2. Ollama Framework Security: Multiple CVEs Discovered

Theme 3. Meta's MobileLLM: 125M Model Matches 500M Performance

Theme 4. QTIP: Next-Gen 2-bit Quantization for 405B Models

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Development and Research

AI Gaming and Graphics

Product Updates and Announcements

Memes and Humor


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. AI Model Performance and Optimization


Theme 2. AI Deployment, APIs, and Cost Efficiency


Theme 3. AI Frameworks, Finetuning, and Tool Development


Theme 4. Research Innovations in AI


Theme 5. Community Events, Announcements, and Giveaways


PART 1: High level Discord summaries

Nous Research AI Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


OpenAI Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Notebook LM Discord Discord


aider (Paul Gauthier) Discord


Stability.ai (Stable Diffusion) Discord


Eleuther Discord


Latent Space Discord


GPU MODE Discord


Interconnects (Nathan Lambert) Discord


Torchtune Discord


DSPy Discord


OpenInterpreter Discord


LAION Discord


LlamaIndex Discord


Cohere Discord


Modular (Mojo 🔥) Discord


OpenAccess AI Collective (axolotl) Discord


Alignment Lab AI Discord


LLM Agents (Berkeley MOOC) Discord


tinygrad (George Hotz) Discord


Mozilla AI Discord


The LangChain AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Nous Research AI ▷ #general (379 messages🔥🔥):

  • Running AI Models on Clusters
  • Encoder-Decoder vs. Decoder-Only Models
  • Creative Writing with LLMs
  • Goliath 120B Model Insights
  • Networking Considerations for Clusters

Links mentioned:


Nous Research AI ▷ #ask-about-llms (4 messages):

  • Gemma2B Tokenizer Vocabulary
  • Open-source Vector to Language Models
  • Hermes 3 Serverless Deployment

Nous Research AI ▷ #research-papers (1 messages):

trre: https://openreview.net/forum?id=q2Lnyegkr8


Nous Research AI ▷ #interesting-links (6 messages):

  • SmolLM2 family
  • Tiny LLMs by Meta
  • BART model optimization

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

trre: https://openreview.net/forum?id=q2Lnyegkr8


Unsloth AI (Daniel Han) ▷ #general (249 messages🔥🔥):

  • Unsloth Finetuning Framework
  • Continual Pre-Training
  • Dataset Size for Training
  • Gradient Accumulation
  • Instruction Tuning

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (25 messages🔥):

  • CUDA Version Support
  • Deprecated Tokenizer Warning
  • RAG vs Fine-Tuning for Chatbots
  • Graph RAG vs Light RAG
  • Llama 3.1 Notebook Errors

Perplexity AI ▷ #general (196 messages🔥🔥):

  • Perplexity Pro Cancellation
  • Comparisons with ChatGPT
  • Model Switching Benefits
  • Claude Sonnet Performance
  • Real-time Data and API Usage

Link mentioned: Tweet from Aravind Srinivas (@AravSrinivas): Been enjoying using the Grok 2 model. Now on Perplexity iOS app too for Pro users. (Restart app if you don’t see it on “Settings->AI Model”)


Perplexity AI ▷ #sharing (7 messages):

  • Google's Decillion Fine
  • Toxic Black Plastic
  • Ecuador's Forest Song
  • Volume, Form, and Mass
  • Aluminium Price Predictions

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (6 messages):

  • Pplxity API Features
  • Implementing RAG Functionality
  • Cost Comparison
  • Chatbot Functionality

Link mentioned: no title found: no description found


OpenAI ▷ #ai-discussions (165 messages🔥🔥):

  • ChatGPT Search
  • Image Generation Models
  • AI and Human Interaction
  • Community Contributions
  • AI Impact on Employment

Link mentioned: LiveBench: no description found


OpenAI ▷ #gpt-4-discussions (1 messages):

  • Nouswise Multi-File Connection

OpenAI ▷ #prompt-engineering (13 messages🔥):

  • D&D GPT limitations
  • Context windows and system prompts
  • Tokenization in LLMs
  • Message history importance
  • Weighting in model responses

OpenAI ▷ #api-discussions (13 messages🔥):

  • D&D GPT Interaction
  • Context Windows and Tokenization
  • Message History vs. Prompting
  • Weight of System Prompts
  • Python Tool Returns

LM Studio ▷ #general (142 messages🔥🔥):

  • LM Studio context issues
  • Open WebUI for LM Studio
  • HTML rendering in models
  • IBM's Granite model requirements

Link mentioned: GitHub - open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...): User-friendly AI Interface (Supports Ollama, OpenAI API, ...) - open-webui/open-webui


LM Studio ▷ #hardware-discussion (17 messages🔥):

  • LM Studio limitations
  • CPU performance issues
  • Multi-GPU support
  • Inference speed on cards

OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

  • Hermes 3 405B removal
  • /api/v1/models API speedup

Link mentioned: Hermes 3 405B Instruct - API, Providers, Stats: Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coheren...


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

  • Rubik's AI search interface
  • Beta testing opportunity
  • Promotional offer for premium access

Link mentioned: Rubik's AI - AI Research Assistant & Search Engine: Access powerful AI models for NLP, computer vision, and more. Get instant answers from Groq, Claude-3.5 Sonnet, and GPT-4o.


OpenRouter (Alex Atallah) ▷ #general (137 messages🔥🔥):

  • Hermes 3 Issues
  • OpenRouter Setup
  • Alternatives to Hermes 3
  • ChatGPT Model Changes
  • Novel Writing Tools

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (6 messages):

  • Access to Integrations
  • Beta Access Requests

Notebook LM Discord ▷ #use-cases (31 messages🔥):

  • Podcast Source Errors
  • Python Utility for Audio Processing
  • Voice to Avatar Integration
  • Google TTS Voice Quality
  • Long Audio Synthesis with Google Cloud

Links mentioned:


Notebook LM Discord ▷ #general (60 messages🔥🔥):

  • NotebookLM Podcast Features
  • User Feedback on NotebookLM
  • Language Support for Podcasts
  • CSV Upload Functionality
  • Technical Limitations and User Inquiries

Link mentioned: June 6 2024 - Help: no description found


aider (Paul Gauthier) ▷ #announcements (2 messages):

  • Aider v0.61.0 features
  • Aider's code contributions
  • Anonymous analytics
  • Model support enhancements
  • Launch command options

Link mentioned: Release history: Release notes and stats on aider writing its own code.


aider (Paul Gauthier) ▷ #general (53 messages🔥):

  • Aider analytics
  • Customizable AI workflows
  • Continue VS Code Alternative
  • GitHub Copilot
  • Image processing errors

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (22 messages🔥):

  • Aider Documentation
  • Sonnet File Handling
  • Aider UX Limitations
  • Read-Only Context in Aider
  • Test Command Differences

Link mentioned: REQ: Ability to toggle --verbose function on and off from within aider, perhaps /verbose-debug ? · Issue #1870 · Aider-AI/aider: Issue It would be really useful, to aid in debugging issues and understanding the "behind the curtain" work aider is doing, to be able to toggle the verbose output seen with --verbose, on an...


aider (Paul Gauthier) ▷ #links (2 messages):

  • Electron App
  • Browser Functionality

Stability.ai (Stable Diffusion) ▷ #general-chat (55 messages🔥🔥):

  • ComfyUI Setup for SD
  • FP16 Model Performance
  • Lora Trigger Word Access
  • Video Generation Models
  • Lora Training Methods

Links mentioned:


Eleuther ▷ #general (9 messages🔥):

  • Attention Weights in Inference
  • Post Removal Discussion
  • User Ban Considerations

Eleuther ▷ #research (29 messages🔥):

  • DEQ model challenges
  • Hypernetworks discussion
  • Forgetting Transformer
  • Training dynamics
  • Innovative classification methods

Links mentioned:


Latent Space ▷ #ai-general-chat (34 messages🔥):

  • SmolLM2 Release
  • AI Regulation Discussion
  • Claude 3.5 Sonnet Benchmarks
  • AI Tool Announcements
  • OpenAI AMA Highlights

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

  • LM Arena Podcast
  • Audio Quality Challenges
  • Statistics of Subjectivity
  • ELO Tracking
  • 4o-mini Ranking Drama

Link mentioned: Tweet from Alessio Fanelli (@FanaHOVA): In the arena, generating tokens 🏟️ @infwinston and @ml_angelopoulos came on the pod to talk about the history and future of LM Arena: - The Statistics of Subjectivity - Using ELO to track the Paret...


GPU MODE ▷ #general (2 messages):

  • CUDA Optimization for Matrix Multiplication
  • GPU Scheduling with Kubernetes
  • NVIDIA Device Plugin
  • Deep Learning Performance
  • GPU Pod Resource Recognition

Link mentioned: How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog: In this post, I’ll iteratively optimize an implementation of matrix multiplication written in CUDA.My goal is not to build a cuBLAS replacement, but to deepl...


GPU MODE ▷ #triton (1 messages):

  • Triton casting strategies
  • Rescale Kernel Implementation
  • vLLM Baseline Comparison
  • FP8 Quantization
  • Triton vs vLLM outputs

Link mentioned: vllm/csrc/quantization/fp8/common.cu at 55650c83a0c386526ed04912a0c60eccca202f3e · vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm


GPU MODE ▷ #beginner (3 messages):

  • Colpali model usage
  • Model quantization
  • Inference speed with LoRas
  • Performance at different bit widths

GPU MODE ▷ #off-topic (2 messages):

  • Asking Dumb Questions
  • Advanced Topics
  • Google Search Answers

GPU MODE ▷ #irl-meetup (1 messages):

lavawave03: I would be so down!


GPU MODE ▷ #triton-puzzles (1 messages):

  • Triton Learning
  • Triton Puzzle Visualization

GPU MODE ▷ #rocm (3 messages):

  • Composable Kernel Performance
  • XOR based Permutation Strategy

Link mentioned: composable_kernel/include/ck/tensor_operation/gpu/grid/gridwise_gemm_xdl_cshuffle_v3.hpp at 03c6448ba3c854195c61c817036b66af1fa0e844 · ROCm/composable_kernel: Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators - ROCm/composable_kernel


GPU MODE ▷ #liger-kernel (3 messages):

  • Learning Triton
  • Accessing GPU Resources
  • Cloud Services for GPU
  • AI Development Environments

GPU MODE ▷ #self-promotion (2 messages):

  • FlashAttention
  • FlashAttention-2
  • GPU Memory Optimization

Link mentioned: FlashAttention-2 | DigitalOcean: no description found


GPU MODE ▷ #🍿 (4 messages):

  • Triton Kernels Dataset
  • Github Repository Scanning
  • Cudabench Schema Definition
  • Submission Scoring Criteria

Links mentioned:


GPU MODE ▷ #thunderkittens (3 messages):

  • Session Start Delay
  • Prerequisite Stream

Interconnects (Nathan Lambert) ▷ #news (14 messages🔥):

  • SmolLM2 launch
  • Traditional NLP evaluations
  • Changes in model expectations
  • Outdated evaluation metrics
  • New evaluation rubric development

Link mentioned: Tweet from Loubna Ben Allal (@LoubnaBenAllal1): Introducing SmolLM2: the new, best, and open 1B-parameter language model. We trained smol models on up to 11T tokens of meticulously curated datasets. Fully open-source Apache 2.0 and we will release...


Interconnects (Nathan Lambert) ▷ #ml-questions (4 messages):

  • Diffusion and Robotics
  • Style Transfer Techniques

Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

xeophon.: https://x.com/sahir2k/status/1852064158830989757


Interconnects (Nathan Lambert) ▷ #reads (3 messages):

  • OpenAI o1-preview
  • Reasoning in Language Models
  • Token Billing and Latency
  • Search Algorithms in AI

Link mentioned: Reasoning Series, Part 1: Understanding OpenAI o1: OpenAI's o1-preview, released in September 2024, introduces "reasoning tokens" to enhance complex problem-solving capabilities. This post explores the model's reasoning process, debu...


Interconnects (Nathan Lambert) ▷ #posts (3 messages):

  • Discord Community History
  • Friendship Origins
  • Engagement Expressions

Torchtune ▷ #general (3 messages):

  • Llama 4 Training
  • Meta's unexpected projects

Torchtune ▷ #dev (20 messages🔥):

  • Graph Breaks and Activation Offloading
  • PPO Performance Issues
  • Profiling Techniques
  • Checkpoints and Activation State

DSPy ▷ #show-and-tell (6 messages):

  • DSPy Signatures
  • Typed Outputs
  • Server Generation with vLLM
  • Constrained Generation

Link mentioned: Tweet from Omar Khattab (@lateinteraction): @karthikkalyan90 @dottxtai Hey Karthik! Super cool! But btw you can just ask for type bool directly and use a server like vLLM that uses Outlines for constrained generation — and you’ll get scheme adh...


DSPy ▷ #papers (1 messages):

js7772219: <@738704828494118953> good feedback!


DSPy ▷ #general (14 messages🔥):

  • Streaming DSPy completions
  • Synthetic data generation with pre-trained models
  • Textgrad integration
  • User feedback on streaming needs

Links mentioned:


OpenInterpreter ▷ #general (17 messages🔥):

  • Anthropic API Support Issues
  • Beta Testing Opportunities
  • Invite Link Problems
  • Open Interpreter Desktop Subscription Upgrade
  • Request Size Error

OpenInterpreter ▷ #O1 (1 messages):

  • ngrok issues
  • busy harvest season

OpenInterpreter ▷ #ai-content (2 messages):

  • Meta FAIR robotics developments
  • Meta Sparsh
  • Meta Digit 360
  • Meta Digit Plexus
  • Open source community impact

Link mentioned: Tweet from AI at Meta (@AIatMeta): Today at Meta FAIR we’re announcing three new cutting-edge developments in robotics and touch perception — and releasing a collection of artifacts to empower the community to build on this work. Deta...


LAION ▷ #general (16 messages🔥):

  • Autoregressive Image Generation
  • Patch Artifacts in Image Generation
  • MAR Model
  • VAE Usage
  • Meta's New Video Technology

LAION ▷ #research (2 messages):

  • TokenFormer architecture
  • Sparse Autoencoders (SAEs)
  • SDXL Turbo
  • Text-to-image models

Links mentioned:


LlamaIndex ▷ #blog (3 messages):

  • Open Telemetry
  • Llama Impact Hackathon
  • LlamaParse new features

LlamaIndex ▷ #general (13 messages🔥):

  • Neo4j PropertyGraphIndex Issues
  • Changelog Location
  • Workflows to Tools Conversion
  • Workflow Queries

Cohere ▷ #discussions (3 messages):

  • LLM Framework
  • Component Building
  • Tailwind Support
  • Output Issues

Cohere ▷ #questions (4 messages):

  • Master Thesis Collaboration
  • Expediting Applications

Cohere ▷ #api-discussions (4 messages):

  • Command R Reliability Scores
  • Command R Fine-Tuning
  • Benchmarks for AI Models

Link mentioned: Introducing Command R Fine-Tuning: Industry-Leading Performance at a Fraction of the Cost: Command R fine-tuning offers superior performance on enterprise use cases and costs up to 15x less than the largest models on the market.


Cohere ▷ #projects (1 messages):

  • Agent Building Experience
  • Application Review Process

Modular (Mojo 🔥) ▷ #general (2 messages):

  • Mojmelo Project
  • Level Advancement

Link mentioned: Mojmelo/tests/LogisR_test.mojo at main · yetalit/Mojmelo: Machine Learning algorithms in pure Mojo 🔥. Contribute to yetalit/Mojmelo development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #mojo (7 messages):

  • Mojo parametric capability
  • mojo test issues in GitHub Actions
  • Syntactic macros concerns
  • Support for custom decorators
  • Malloc fault issues

Link mentioned: Issues · modularml/mojo): The Mojo Programming Language. Contribute to modularml/mojo development by creating an account on GitHub.


OpenAccess AI Collective (axolotl) ▷ #general (5 messages):

  • Axolotl Docker Image Release Strategy
  • Stable Release Plans
  • Previous Release Information
  • Testing Procedures

Alignment Lab AI ▷ #ai-and-ml-discussion (1 messages):

tpojd: steam gift 50$ - steamcommunity.com/gift-card/pay/50
@everyone


Alignment Lab AI ▷ #general (1 messages):

tpojd: steam gift 50$ - steamcommunity.com/gift-card/pay/50
@everyone


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

  • Course Guidance
  • Website Navigation

tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

  • Device Driver Methods
  • Hailo Python Wrapper
  • Proprietary Compilation Process

Mozilla AI ▷ #announcements (1 messages):

  • Mozilla Builders Demo Day
  • Builders Accelerator program
  • Event Registration
  • Open Source AI Projects

Link mentioned: San Francisco Builders Demo Day Event Application: We have limited space available to attend Mozilla Builders Demo Day on December 5th. The provided email, name, and GitHub profile will be submitted with this form. We use your information only to pr...







{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}