Frozen AI News archive

not much happened today

**Meta** released **SAM 2**, a unified model for real-time object segmentation with a new dataset 4.5x larger and 53x more annotated than previous ones. **FastHTML**, a new Python web framework by **Jeremy Howard**, enables easy creation and deployment of interactive web apps. **Scale AI** launched the SEAL Leaderboard on adversarial robustness, topped by **Gemini 1.5 Pro** from **Google DeepMind**. **Apple** published a technical report on their Intelligence Foundation Language Models for on-device and server use. **Yann LeCun** emphasized the importance of open source AI in an article co-authored with Martin Casado and Ion Stoica. **Maarten Grootendorst**'s "Visual Guide to Quantization" on efficient LLM inference went viral. **ChatGPT** started rolling out advanced voice and vision-enabled modes to select users. **Leonardo AI** was acquired by **Canva**. **Jim Fan** shared insights on Project Groot augmenting human demonstration data for robotics. **Midjourney v6.1** was released.

Canonical issue URL

AI News for 7/29/2024-7/30/2024. We checked 7 subreddits, 384 Twitters and 28 Discords (248 channels, and 2257 messages) for you. Estimated reading time saved (at 200wpm): 262 minutes. You can now tag @smol_ai for AINews discussions!

A few small items:

We had fun recording a demo of Advanced Voice Mode, coming on the next LS podcast.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Meta Releases SAM 2 for Object Segmentation

New Web Development Framework: FastHTML

AI Model Developments and Benchmarks

Open Source AI and Compute Resources


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Quantization Advancements for Efficient LLM Inference

Theme 2. Meta's Open-Source AI Contributions and Impact

Theme 3. Performance Comparisons of Recent LLM Releases

Theme 4. Hardware and Efficiency Considerations for Local LLM Inference

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

TO BE COMPLETED


AI Discord Recap

A summary of Summaries of Summaries

Claude 3.5 Sonnet

1. LLM Advancements and Benchmarking

2. Model Optimization and Performance Tuning

3. Open-Source AI Developments

4. AI Industry News and Partnerships


PART 1: High level Discord summaries

HuggingFace Discord


LM Studio Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


Unsloth AI (Daniel Han) Discord


CUDA MODE Discord


Nous Research AI Discord


OpenAI Discord


Cohere Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


OpenAccess AI Collective (axolotl) Discord


LangChain AI Discord


OpenRouter (Alex Atallah) Discord


Latent Space Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


Interconnects (Nathan Lambert) Discord


DSPy Discord


AI21 Labs (Jamba) Discord


LAION Discord


Mozilla AI Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace â–· #announcements (1 messages):

  • Llama 3.1 Release
  • Argilla 2.0 Sneak Peek
  • PEFT v0.12.0
  • Hugging Face Video Initiative
  • Inference as a Service with Nvidia

Links mentioned:


HuggingFace ▷ #general (301 messages🔥🔥):

  • Token Limit for Meta LLaMA
  • Issues with Hugging Face Datasets
  • Training Models on Different GPUs
  • Background Removal Model Updates
  • Using Accelerator for TPUs

Links mentioned:


HuggingFace â–· #today-im-learning (4 messages):

  • Extensive Testing Insights
  • Quantization in LLMs

Link mentioned: A Visual Guide to Quantization: Exploring memory-efficient techniques for LLMs


HuggingFace ▷ #cool-finds (9 messages🔥):

  • Quantization of LLMs
  • YouTube Content on AI
  • Diffusion Models
  • TikTok AI Trends

Links mentioned:


HuggingFace ▷ #i-made-this (11 messages🔥):

  • ClIP-Enhanced Text Search
  • Unity ML Agent Experiment
  • Frooty Integration Queries
  • Guanaco Evolved Dataset
  • SAM v2 Segmentation Mask Generator

Links mentioned:


HuggingFace â–· #reading-group (2 messages):

  • Music Generation Models
  • Autoregressive Model Techniques
  • Papers on Music Generation

HuggingFace â–· #core-announcements (1 messages):

  • Quantization Tools for Diffusion Transformers
  • Transformer-based Diffusion Models
  • Memory Savings in Large Models
  • High-Resolution Text-to-Image Generation

Link mentioned: Memory-efficient Diffusion Transformers with Quanto and Diffusers: no description found


HuggingFace â–· #NLP (6 messages):

  • CachedGISTEmbedLoss function
  • Evaluating translated sentences
  • Statistical methods for evaluation
  • Seq2seq tasks limitations
  • Pseudo labels and ontology use

HuggingFace â–· #diffusion-discussions (4 messages):

  • HuggingChat integration
  • Knowledge Distillation of 7B model
  • SOTA Image Generation
  • Compute Resources for Model Training

LM Studio ▷ #general (195 messages🔥🔥):

  • LM Studio Model Loading Issues
  • Llama 3.1 Model Behavior
  • AI Development Learning Resources
  • Thread Count Adjustment in Local Server
  • System Prompt Management

Links mentioned:


LM Studio ▷ #hardware-discussion (93 messages🔥🔥):

  • GPU Compatibility and Performance
  • LM Studio ROCm Release
  • Motherboard and GPU Upgrades
  • AI Streaming Setup
  • Driver and Cooling Issues

Link mentioned: configs/Extension-Pack-Instructions.md at main · lmstudio-ai/configs: LM Studio JSON configuration file format and a collection of example config files. - lmstudio-ai/configs


Perplexity AI â–· #announcements (1 messages):

  • Perplexity Publishers Program
  • Partnerships with Media Organizations
  • Revenue Sharing Initiatives

Link mentioned: Introducing the Perplexity Publishers’ Program: From day one, we’ve included citations in each answer, ensuring publishers receive proper credit and building user trust.


Perplexity AI ▷ #general (209 messages🔥🔥):

  • Perplexity AI Down
  • Perplexity Publishers Program
  • AI Language Models Comparison
  • Impact of Pricing on Services
  • User Support and Experience

Links mentioned:


Perplexity AI â–· #sharing (2 messages):

  • Tesla's Charging Warning
  • Genetically Engineered Flies
  • Space Force Satellite Expansion

Link mentioned: Perplexity: Perplexity is a free AI-powered answer engine that provides accurate, trusted, and real-time answers to any question.


Perplexity AI ▷ #pplx-api (8 messages🔥):

  • Llama-3 model issues
  • Deprecation of Llama models
  • Request for citations in API

Link mentioned: Request for citations in API: no description found


Stability.ai (Stable Diffusion) â–· #announcements (1 messages):

  • Stable Artisan new command
  • Image style generation

Stability.ai (Stable Diffusion) ▷ #general-chat (215 messages🔥🔥):

  • Graphics Card Performance
  • Training LoRA Models
  • AI Animation Tools
  • SAM 2 Segment Anything Model
  • Stable Diffusion Configurations

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (63 messages🔥🔥):

  • Unsloth usage on Windows
  • LLM fine-tuning discussions
  • Matrix representation with custom tokens
  • Support for rope scaling
  • Dataset building tools

Links mentioned:


Unsloth AI (Daniel Han) â–· #off-topic (6 messages):

  • Fine-Tuning
  • Applied LLMs Course
  • Prompt Engineering Video
  • LLM Educational Resources

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (142 messages🔥🔥):

  • Fine-tuning models
  • Memory management techniques
  • Model conversion
  • Model performance metrics
  • Multi-GPU usage issues

Links mentioned:


Unsloth AI (Daniel Han) â–· #community-collaboration (4 messages):

  • Translation datasets
  • Continued pretraining
  • Blockchain Engineer portfolio

Links mentioned:


CUDA MODE ▷ #general (8 messages🔥):

  • Randomized SVD
  • Jacobian method for SVD
  • CUBLAS vs CUTLASS
  • Accel Venture Capital
  • Learning materials for distributed training

Link mentioned: Accel: Accel is a global venture capital firm, and the first partner to exceptional teams from seed to IPO. Facebook, Flipkart, CrowdStrike, UiPath, and Spotify are among the companies Accel has backed over ...


CUDA MODE â–· #triton (3 messages):

  • PTX Output Verification
  • Libdevice Non-Approximated Exponential
  • Code Reflection in Java for Triton
  • IR and MLIR Dialects

Links mentioned:


CUDA MODE â–· #torch (2 messages):

  • CUDA Memory Alignment
  • PyTorch Tensor Alignment

CUDA MODE â–· #announcements (1 messages):

  • CUDA MODE IRL Meeting
  • Keynotes by ML systems leaders
  • Working Groups and Hack Leads
  • Fireside Chat with Wen-mei Hwu
  • Sponsorship and Registration

CUDA MODE â–· #cool-links (2 messages):

  • Machine Intelligence Advances
  • PIM Rediscovery

Link mentioned: Experimental demonstration of magnetic tunnel junction-based computational random-access memory - npj Unconventional Computing: no description found


CUDA MODE â–· #beginner (1 messages):

  • CUDA Memory Types
  • VRAM and Caches
  • Memory Management in CUDA

CUDA MODE ▷ #torchao (19 messages🔥):

  • Optimizer CPU Offload
  • FSDP and Optimizers
  • Low-Bit Optimizers
  • CUDA Streams and Transfers
  • Pytorch AO Tutorials

Links mentioned:


CUDA MODE ▷ #off-topic (14 messages🔥):

  • SAM 2 demo
  • Object Tracking Challenges

Link mentioned: SAM 2 Demo | By Meta FAIR: Track an object across any video and create fun effects interactively, with as little as a single click on one frame.


CUDA MODE â–· #hqq (1 messages):

  • Apple and HQQ+
  • LoRA adapters
  • Quantization loss recovery

Link mentioned: Tweet from Teortaxes▶️ (@teortaxesTex): 4chan did quantization loss recovering LoRAs 3 months before Apple made it cool btw Quoting Blaze (Balázs Galambosi) (@gblazex) One of the most interesting things in the Apple paper for me was how ...


CUDA MODE ▷ #llmdotc (125 messages🔥🔥):

  • Llama 3 Finetuning
  • RoPE Integration
  • SwiGLU Implementation
  • Training Code Developments
  • Model Evaluations and Discussions

Links mentioned:


CUDA MODE â–· #webgpu (7 messages):

  • WebGPU API
  • gpu.cpp usage
  • Realtime multimodal integration
  • Hybrid model computation
  • Local device computation

CUDA MODE ▷ #cudamode-irl (13 messages🔥):

  • Event Registration
  • Keynote Recording
  • Attendee Guidelines
  • GPU Access
  • Event Excitement

Nous Research AI â–· #research-papers (1 messages):

_paradroid: https://arxiv.org/abs/2407.04620


Nous Research AI â–· #off-topic (1 messages):

mautonomy: <:uhhh:1133962718349639721> <:thinking:1134948374760669225>


Nous Research AI â–· #interesting-links (5 messages):

  • AI Model Size Comparison
  • AI Interpretability Concerns

Links mentioned:


Nous Research AI ▷ #general (79 messages🔥🔥):

  • Apple AI Models
  • Hermes Model Merging Techniques
  • Midjourney V6.1 Launch
  • GPT-4o Capabilities Report
  • New MRLM Method

Links mentioned:


Nous Research AI ▷ #ask-about-llms (18 messages🔥):

  • NousResearch/Meta-Llama-3.1-8B-Instruct differences
  • Theta performance issues
  • Hermes 3 and custom datasets
  • BigCodeBench leaderboard

Link mentioned: Can Ai Code Results - a Hugging Face Space by mike-ravkine: no description found


Nous Research AI â–· #rag-dataset (1 messages):

teknium: https://x.com/omarsar0/status/1818139150882664696


Nous Research AI â–· #reasoning-tasks-master-list (1 messages):

n8programs: awesome


OpenAI â–· #annnouncements (1 messages):

  • Advanced Voice Mode Rollout
  • Safety Reinforcements for Voice Conversations
  • GPT-4o Voice Capabilities Testing
  • Planned Features for ChatGPT Plus
  • Privacy Measures in Voice Mode

OpenAI ▷ #ai-discussions (51 messages🔥):

  • Search GPT access
  • AI-DALLE 3 challenges
  • GPT-4o advanced vision release
  • AGI discussions
  • Midjourney V6.1 release

OpenAI ▷ #gpt-4-discussions (14 messages🔥):

  • GPT Vision Release
  • OpenAI Assistant Discussions
  • Quality of GPT Responses
  • API Response Size for GPT Actions
  • Memory Functionality in GPT

OpenAI ▷ #prompt-engineering (8 messages🔥):

  • GPTs training data awareness
  • DALL-E bot channel issues
  • Custody schedule script problems
  • Function call performance in GPT-4

OpenAI ▷ #api-discussions (8 messages🔥):

  • GPT Training Data Transparency
  • DALL-E Bot Functionality Issues
  • Custody Schedule Python Script
  • Function Call Performance in GPT-4o

Cohere ▷ #discussions (33 messages🔥):

  • API Issues
  • Cohere Team Acknowledgment
  • Project Development with Cohere API
  • Office Hours Announcement

Link mentioned: incident.io - Status pages: no description found


Cohere â–· #questions (5 messages):

  • Cohere Status Page
  • Enterprise Workflow Automation Webinar

Link mentioned: Cohere Status Page Status: Latest service status for Cohere Status Page


Cohere ▷ #api-discussions (23 messages🔥):

  • Cohere API downtime
  • Connector response format
  • Move towards tool usage

Links mentioned:


Cohere ▷ #cohere-toolkit (13 messages🔥):

  • Cohere API Performance
  • Web Search Tool Implementation
  • Industry Hype Cycle
  • Interview Preparation
  • Comparison Testing

Modular (Mojo 🔥) ▷ #general (20 messages🔥):

  • Mojo Community Meeting #5
  • Stack-PR Installation
  • GitHub Documentation Issues
  • Community Feedback
  • Conda Packaging for Stack-PR

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (42 messages🔥):

  • Difference between structs and classes in Mojo
  • CSV reader capabilities in Mojo
  • Performance implications of using structs
  • Image parsing libraries in Mojo
  • Dynamic behavior in Mojo classes

Links mentioned:


LlamaIndex â–· #announcements (1 messages):

  • LlamaIndex Office Hours
  • Building Agents
  • In-depth Questions

Link mentioned: LlamaIndex Community Office Hours: Have in-depth questions or feedback for the folks at LlamaIndex? Sign up for our community office hours! We'll get back to you to set up a 15-30 minute Zoom call to chat. We are particularly inter...


LlamaIndex â–· #blog (5 messages):

  • GraphRAG technique
  • Webinar Scheduling
  • Agentic Applications Office Hours
  • LlamaCloud QA Assistant Feature
  • MLflow in LlamaIndex

LlamaIndex ▷ #general (55 messages🔥🔥):

  • Spam Issues
  • LlamaIndex Instrumentation
  • RAPTOR Pack Update
  • Mermaid Diagrams
  • Pydantic Models

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (8 messages🔥):

  • Transformers Error
  • Q-Galore Status
  • Gemma-2-27B Configuration
  • Chat Template Training
  • ``

OpenAccess AI Collective (axolotl) ▷ #general-help (43 messages🔥):

  • Gemma 2 model fine-tuning issues
  • RTX 4090 for chatbot training
  • Best practices for fine-tuning models
  • Retrieval Augmented Generation (RAG)
  • Loss function anomalies in training

LangChain AI ▷ #general (45 messages🔥):

  • Agent Executor and Toolkits Usage
  • LangGraph Functionality
  • Llama 3.1 Tool Calling
  • LangChain Context Caching
  • Google Gemini Integration

Links mentioned:


LangChain AI â–· #langserve (1 messages):

ericyin_41626: https://github.com/langchain-ai/langserve/issues/720


LangChain AI â–· #share-your-work (2 messages):

  • Turing Test Implementation
  • SWE Agent Development

Links mentioned:


LangChain AI â–· #tutorials (2 messages):

  • Self Learning Llama3.1
  • SWE Agent Guide
  • LangChain

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (47 messages🔥):

  • Palm Chat 2 surge
  • GPT-4o capabilities
  • Cost tracking alternatives
  • Claude model instruction templates

Latent Space ▷ #ai-general-chat (33 messages🔥):

  • SAM 2 Release
  • Leonardo AI Joins Canva
  • Kagi LLM Benchmarking
  • OpenAI and Anthropic Collaboration with Brands
  • White House Report on Open-Source AI

Links mentioned:


Latent Space â–· #ai-announcements (1 messages):

  • Apple Intelligence
  • macOS updates
  • iPhone updates

OpenInterpreter ▷ #general (14 messages🔥):

  • Open Interpreter use cases
  • AI for coding
  • Speech command technology
  • Command line options
  • Workflow with Llama 3.1

OpenInterpreter ▷ #O1 (11 messages🔥):

  • Wayland
  • Open Interpreter Installation
  • Pre-order Status
  • Building Parts
  • Poetry Version

OpenInterpreter â–· #ai-content (2 messages):

  • Perplexica
  • Llama-3.1
  • Open source alternatives

Links mentioned:


tinygrad (George Hotz) ▷ #general (12 messages🔥):

  • View Merging Task
  • Shape Tracker Reduction
  • YouTube Talk on Parallel Computing
  • OpenCL Resource Errors

Link mentioned: I want a good parallel computer - UCSC Colloquium: This is the video of a talk I gave at the UC Santa Cruz CSE Colloquium on Apr 10, 2024. The slides are available here: https://docs.google.com/presentation/d...


tinygrad (George Hotz) ▷ #learn-tinygrad (13 messages🔥):

  • Gradients in Training Loop
  • Using TinyJit
  • Jitting the Step Function

Interconnects (Nathan Lambert) ▷ #news (15 messages🔥):

  • Apple's AI Model Training
  • Tim Dettmers' New Role
  • Sewon Kim's Recruitment
  • TPUs Usage
  • Job Market Insights

Links mentioned:


Interconnects (Nathan Lambert) â–· #random (6 messages):

  • Zuck's Stage Performance
  • Perplexity Publishers Program
  • OpenAI Cookbook Controversy
  • Email Open Rates Decline
  • iCloud Private Relay Issues

Links mentioned:


Interconnects (Nathan Lambert) â–· #reads (1 messages):

xeophon.: https://152334h.github.io/blog/scaling-exponents/


DSPy â–· #papers (2 messages):

  • OPTO and Trace
  • AI in Gaming History
  • Neural Networks Evolution
  • Microsoft's AI Innovations

Links mentioned:


DSPy ▷ #general (11 messages🔥):

  • MIRPO updates
  • Penalty metrics in answer scoring
  • DSPy with semantic kernel
  • Optimizers for celebrity exemplars
  • Mean Squared Error in penalties

DSPy â–· #examples (1 messages):

batmanosama: https://github.com/ax-llm/ax


AI21 Labs (Jamba) â–· #announcements (1 messages):

  • Long Context Use Cases
  • Developer Collaboration
  • Enterprise Customer Feedback

AI21 Labs (Jamba) â–· #general-chat (4 messages):

  • Joining Discord
  • Learning about Jamba

LAION â–· #general (2 messages):

  • SWE-Bench Ultra-Hackathon
  • Segment Anything Model 2

Links mentioned:


Mozilla AI â–· #announcements (1 messages):

  • Sentry
  • AutoFix feature





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}