Frozen AI News archive

Not much happened today

**Grok Beta** surpasses **Llama 3.1 70B** in intelligence but is less competitive due to its pricing at **$5/1M input tokens** and **$15/1M output tokens**. **Defense Llama**, developed with **Meta AI** and **Scale AI**, targets American national security applications. **SWE-Kit**, an open-source framework, supports building customizable AI software engineers compatible with **Llama 3**, **ChatGPT**, and **Claude**. **LangChainAI** and **Weights & Biases** integrate to improve retrievers and reduce hallucinations in **RAG applications** using **Gemini**. **Perplexity AI** offers enhanced election tracking tools for the **2024 elections**, including live state results and support for **Claude 3.5 Haiku**. **AI Talk** launched featuring discussions on Chinese AI labs with guests from **Qwen**. Memes highlight **Elon Musk** and humorous AI coding mishaps.

Canonical issue URL

AI News for 11/5/2024-11/6/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (217 channels, and 1685 messages) for you. Estimated reading time saved (at 200wpm): 200 minutes. You can now tag @smol_ai for AINews discussions!

For some reason, nobody scheduled big AI releases today. We can't imagine why.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Models and Benchmarking

AI Tools and Development

Political Discussions and Elections

Product Announcements and Integrations

Memes / Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Microsoft's Magentic-One: Open-Source Multi-Agent System Released

Theme 2. Ollama Expands Vision Capabilities with Llama 3.2

Theme 3. Wave Networks: An Innovative Approach Using Complex Vectors

Theme 4. Llama 3.1's Struggles: Tool Usage Failures

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Claude 3.5 Haiku Underperformance and Pricing Issues

Theme 2. PromptGen v2.0 Released: Enhanced Image Captioning and Analysis

Theme 3. Prompt Optimization Tools for Better LoRA Integration


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. New AI Model Releases and Comparisons

Theme 2. AI Performance Issues and Limitations

Theme 3. AI Hardware and Optimization

Theme 4. AI Tools and Platform Updates

Theme 5. Funding Frenzies and Business Moves in AI


PART 1: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Eleuther Discord


Stability.ai (Stable Diffusion) Discord


LM Studio Discord


Notebook LM Discord Discord


GPU MODE Discord


LlamaIndex Discord


Latent Space Discord


Interconnects (Nathan Lambert) Discord


Cohere Discord


OpenAI Discord


tinygrad (George Hotz) Discord


OpenInterpreter Discord


Modular (Mojo 🔥) Discord


OpenAccess AI Collective (axolotl) Discord


LAION Discord


DSPy Discord


Torchtune Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Perplexity AI ▷ #announcements (1 messages):

  • U.S. Presidential Race
  • Election Hub

Perplexity AI ▷ #general (253 messages🔥🔥):

  • Perplexity Pro Features
  • Claude Model Comparison
  • Subscription Activation Issues
  • AI Model Performance
  • User Experience Feedback

Links mentioned:


Perplexity AI ▷ #sharing (13 messages🔥):

  • Powershell Coding
  • Distinguishing Techniques
  • Differences Explained
  • Utilizing Resources Effectively
  • Creator Economy Size

Perplexity AI ▷ #pplx-api (6 messages):

  • Llama 3.1 API Pricing
  • Return Citations Functionality
  • Haiku 3.5 Limits
  • Translation Requests

Link mentioned: no title found: no description found


Unsloth AI (Daniel Han) ▷ #general (101 messages🔥🔥):

  • Unsloth updates
  • SFT and DPO fine-tuning
  • Model training issues
  • NVIDIA feedback request
  • ECommerce app ideas

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (27 messages🔥):

  • NVIDIA GeForce RTX Community Input
  • NIM API Feedback
  • Model Performance for Finnish Language
  • Scam Alerts in Discord
  • Experience with AI Tools

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (41 messages🔥):

  • Language Translation Data Formatting
  • Model Performance Issues
  • Fine-tuning Llama Models
  • Integration of Fine-tuned Models
  • Output Handling in Model Generation

HuggingFace ▷ #general (147 messages🔥🔥):

  • Speculative Decoding
  • AWS Q for Fine-tuning
  • Song Generators
  • Gradio Interface
  • Image Similarity with CLIP

Links mentioned:


HuggingFace ▷ #today-im-learning (5 messages):

  • Building a GPT model
  • Top-p sampling
  • Transformer architecture
  • BERT model plans

HuggingFace ▷ #cool-finds (1 messages):

gimmyalex3089: https://cohere.com/research/aya/aya-23-technical-report.pdf


HuggingFace ▷ #i-made-this (7 messages):

  • YOLOv5n6 Real-Time Object Detection
  • Upstage AI Hackathon
  • Contrastive Learning
  • JAX Implementation of Flux.1
  • Formula 1 Telemetry Analysis

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

west_ryder: 😝


HuggingFace ▷ #computer-vision (3 messages):

  • HuggingMod rate limiting
  • New Microsoft models

HuggingFace ▷ #NLP (3 messages):

  • Function Default Arguments
  • MaskGCT and F5-TTS Streaming Capabilities

OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

  • API Migration
  • Latency Optimization
  • Completion API Updates

OpenRouter (Alex Atallah) ▷ #general (161 messages🔥🔥):

  • Hermes 3 performance
  • Claude API updates
  • Llama model comparisons
  • Rate limits and errors
  • PDF support for Claude

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (3 messages):

  • Custom Provider Keys
  • Beta Feature Access

aider (Paul Gauthier) ▷ #general (108 messages🔥🔥):

  • Aider 0.62 Version Features
  • Model Selection and Performance
  • Handling File Formats in Aider
  • Configuration Options in Aider
  • Temperature Settings in LLM Models

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (47 messages🔥):

  • Benchmarking Aider Requests
  • DeepSeek and Llama.cpp Integration
  • Aider Configuration Issues
  • Using LLM Models with Aider
  • Aider Command Issues

Links mentioned:


Nous Research AI ▷ #general (98 messages🔥🔥):

  • TEE_HEE_HE Twitter Account
  • Hermes 405B Performance
  • Funding Resources for ML Projects
  • Venice AI Launch
  • OpenAI Eval Feature

Links mentioned:


Nous Research AI ▷ #ask-about-llms (19 messages🔥):

  • Cursor limitations on codebase
  • Accessing Sonnet 3.5 via web UI
  • Opinions on Haiku 3.5
  • Comparing AI user interfaces

Nous Research AI ▷ #research-papers (1 messages):

detailoriented: Has anyone looked at federated learning in any great detail?


Nous Research AI ▷ #research-papers (1 messages):

detailoriented: Has anyone looked at federated learning in any great detail?


Eleuther ▷ #general (15 messages🔥):

  • lm-eval error
  • Autoencoders research
  • Research opportunities in Switzerland
  • NLP faculty at ETH/EPFL
  • Job application advice

Eleuther ▷ #research (89 messages🔥🔥):

  • Hardware-aware algebraic rewrites
  • Flash attention development
  • Attention mechanism visualization
  • Memory access in matrix operations
  • XLA and cuDNN integration

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):

  • lm_eval performance
  • NCCL warnings
  • Batch size settings

Stability.ai (Stable Diffusion) ▷ #general-chat (103 messages🔥🔥):

  • Stable Diffusion Installation
  • Image Generation Issues
  • Outpainting Techniques
  • ControlNet Models
  • AI Image Expansion

Links mentioned:


LM Studio ▷ #general (56 messages🔥🔥):

  • LM Studio Portable Version
  • Intel E-Cores Utilization
  • Auto Load Models in LM Studio
  • Llama 3.2 Vision Support
  • Context Window Limitations

Links mentioned:


LM Studio ▷ #hardware-discussion (37 messages🔥):

  • LLM Benchmarking
  • Windows Scheduler Performance
  • Memory Overclocking Challenges
  • AMD vs Intel Memory Management
  • Single Slot RTX 4090

Link mentioned: RIP Intel: AMD Ryzen 7 9800X3D CPU Review & Benchmarks vs. 7800X3D, 285K, 14900K, & More: BUY OUR NEW INDUCTOR DICE SET: https://store.gamersnexus.net/products/inductor-full-tabletop-mtg-dnd-premium-dice-set-7-piece-dice-wooden-box-token-cardBUY O...


Notebook LM Discord ▷ #use-cases (17 messages🔥):

  • NotebookLM integration with Google Drive
  • Use of Diarization in podcasts
  • Deepfake technology discussions
  • Avatars in video podcasts
  • Podcast reuse policies

Links mentioned:


Notebook LM Discord ▷ #general (64 messages🔥🔥):

  • Podcast Generation from Notes
  • AI Pronunciation Issues
  • Sharing Links
  • Language Settings
  • AI Interaction Glitches

Link mentioned: BYD's Denza: Can It Topple Mercedes & BMW in Luxury EVs? - Unveiling the D9, N9 & More! #suv #MBG.DE: Dive into the world of BYD's luxury EV sub-brand, Denza, as we explore its bold strategy to rival the titans of luxury automotive, Mercedes and BMW. This vid...


GPU MODE ▷ #general (10 messages🔥):

  • NVIDIA AI Dev Tech internship
  • FP8 quantization and computation
  • Dynamic vs Static quantization performance
  • Triton-compiled PTX with CUDA
  • Batch size effects on performance

GPU MODE ▷ #triton (15 messages🔥):

  • GPU Performance Issues
  • Triton Kernel Optimization
  • Quantization Techniques
  • Calling Triton PTX Outside Python
  • Kernel Launch Parameters

Links mentioned:


GPU MODE ▷ #torch (6 messages):

  • PyTorch Tensor Iteration Performance
  • Numpy Iteration Speed
  • Torch Script Debugging
  • Overhead of tolist()

GPU MODE ▷ #cool-links (2 messages):

  • UV Run in Shebang
  • PlayCanvas SuperSplat
  • Self-contained Python Scripts

Links mentioned:


GPU MODE ▷ #jobs (2 messages):

  • Internship Opportunities
  • Community Engagement
  • Job Posting Etiquette

GPU MODE ▷ #beginner (2 messages):

  • Caffe2 File Removal
  • PR Series Impact
  • GitHub Contributions

Links mentioned:


GPU MODE ▷ #jax (2 messages):

  • Learning JAX
  • JAX Implementation of Flux.1

Link mentioned: GitHub - ml-gde/jflux: JAX Implementation of Black Forest Labs' Flux.1 family of models: JAX Implementation of Black Forest Labs' Flux.1 family of models - ml-gde/jflux


GPU MODE ▷ #torchao (4 messages):

  • Model Size Reduction
  • PyTorch-Quantization Library Disappearance
  • Page Cleanup Efforts
  • FP32 Computation for Mobile Apps

GPU MODE ▷ #liger-kernel (3 messages):

  • Liger Kernel Release v0.4.0
  • Efficient RMSNorm aggregation
  • GroupNorm Kernel implementation

Links mentioned:


GPU MODE ▷ #self-promotion (3 messages):

  • Nebius Explorer Tier
  • GPU pricing for researchers
  • Availability of H100 GPUs
  • Self-service platform advantages

Links mentioned:


GPU MODE ▷ #🍿 (10 messages🔥):

  • Maintainer Access
  • GitHub Access Request
  • Popcorn Project Opportunity
  • Automated Deployments on Heroku
  • Deployment Strategies

GPU MODE ▷ #thunderkittens (7 messages):

  • Desirable Kernels
  • Beginner Contributions to ThunderKittens
  • Preliminary List of Features
  • Long Convolution Example

Links mentioned:


GPU MODE ▷ #edge (1 messages):

  • Training and inference on edge devices
  • Hardware evolution for mobile/embedded
  • Challenges in production environments
  • Consumer and commercial use-cases
  • Hardware heterogeneity

LlamaIndex ▷ #blog (3 messages):

  • NVIDIA competition
  • Automated resume insights agent
  • AI in recruiting

Link mentioned: NVIDIA and LlamaIndex Developer Contest: Stand a chance to win cash prizes, a GeForce RTX GPU, and more.


LlamaIndex ▷ #general (40 messages🔥):

  • Handling ChatMessage input
  • Issues with Anthropic tools
  • Citations in Llama Index
  • Pull Request Guidance
  • Parsing Excel Files

Links mentioned:


Latent Space ▷ #ai-general-chat (36 messages🔥):

  • Hunyuan-Large MoE Model
  • Integuru AI Agent
  • Chat.com Domain Sale
  • Scale AI's Defense Llama
  • Perplexity's Funding Round

Links mentioned:


Interconnects (Nathan Lambert) ▷ #events (1 messages):

swyxio: mee


Interconnects (Nathan Lambert) ▷ #news (8 messages🔥):

  • Swedish law in German
  • AI search startup Perplexity
  • Google AI reveal
  • Intersection of languages and domains

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (4 messages):

  • Drift in prompts
  • ChatGPT performance tracking
  • Data quality for applications

Interconnects (Nathan Lambert) ▷ #ml-drama (3 messages):

  • GPU Drama
  • V100 SSH Access

Interconnects (Nathan Lambert) ▷ #random (7 messages):

  • Discord app for email verification
  • Manual email collection process
  • External database synchronization

Interconnects (Nathan Lambert) ▷ #memes (2 messages):

  • OpenAI CEO petition
  • Biden not running

Links mentioned:


Cohere ▷ #discussions (17 messages🔥):

  • Cohere Search Process
  • Embed3 Multimodal Embeddings
  • Parsing Techniques Comparison
  • Cohere Reranker API Availability

Cohere ▷ #questions (4 messages):

  • Cohere billing process
  • Contacting sales

OpenAI ▷ #ai-discussions (9 messages🔥):

  • AI Storytelling Improvements
  • Interactive Prompting Techniques
  • GitHub Copilot Updates

OpenAI ▷ #gpt-4-discussions (4 messages):

  • Document summarization hallucinations
  • Involvement of human experts
  • Canvas document deletion in CGPT4o

OpenAI ▷ #prompt-engineering (3 messages):

  • AI JSON Modification Techniques
  • Assistant API Token Limits
  • File Upload Data Handling

OpenAI ▷ #api-discussions (3 messages):

  • AI Prompting Techniques
  • JSON Data Handling
  • Token Limit Issues

tinygrad (George Hotz) ▷ #general (1 messages):

  • TokenFormer implementation
  • tinygrad

Link mentioned: GitHub - kroggen/tokenformer-minimal at tinygrad: Minimal implementation of TokenFormer for inference and learning - GitHub - kroggen/tokenformer-minimal at tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (10 messages🔥):

  • Hailo Reverse Engineering
  • CUDA WMMA Layout Discrepancies

OpenInterpreter ▷ #general (8 messages🔥):

  • Comparative Tool Interfaces
  • OS Mode Updates
  • Claude Computer Control

Links mentioned:


OpenInterpreter ▷ #O1 (1 messages):

zer0blanks.: https://www.tiktok.com/t/ZTFckAFHR/


Modular (Mojo 🔥) ▷ #mojo (8 messages🔥):

  • C_Buffer Structure Changes
  • Performance Improvements using Pointers
  • Understanding Bounds Checks

OpenAccess AI Collective (axolotl) ▷ #general (4 messages):

  • ScheduleFree SOAP
  • Hyperparameter Adjustments
  • MOEs and Model Merging
  • CAME Comparison

Link mentioned: HeavyBall/heavyball/schedule_free_palm_foreach_soap.py at main · ClashLuke/HeavyBall: Implementations of various optimizers; mostly focussing on fast _foreach and PaLM versions - ClashLuke/HeavyBall


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (1 messages):

  • Zero2 performance
  • Zero1 troubleshooting

LAION ▷ #general (4 messages):

  • Open Source Speech Enhancer
  • Resemble Enhance

DSPy ▷ #general (2 messages):

  • RLhF paradigm
  • Serialized multi-component systems

Torchtune ▷ #dev (2 messages):

  • KD-div Misinterpretation
  • Cross-Entropy Optimization







{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}