Frozen AI News archive

not much happened today

**Liquid AI** held a launch event introducing new foundation models. **Anthropic** shared follow-up research on social bias and feature steering with their "Golden Gate Claude" feature. **Cohere** released multimodal Embed 3 embeddings models following Aya Expanse. There was misinformation about **GPT-5/Orion** debunked by **Sam Altman**. **Meta AI FAIR** announced **Open Materials 2024** with new models and datasets for inorganic materials discovery using the EquiformerV2 architecture. **Anthropic AI** demonstrated feature steering to balance social bias and model capabilities. **NVIDIA**'s **Llama-3.1-Nemotron-70B** ranked highly on the Arena leaderboard with style control. **Perplexity AI** expanded to 100M weekly queries with new finance and reasoning modes. **LangChain** emphasized real application integration with interactive frame interpolation. **Kestra** highlighted scalable event-driven workflows with open-source YAML-based orchestration. **OpenFLUX** optimized inference speed by doubling it through guidance LoRA training. Discussions on AI safety included trust dynamics between humans and AI, economic impacts of AI automation, and the White House AI National Security memo addressing cyber and biological risks. **LlamaIndex** showcased knowledge-backed agents for enhanced AI applications.

Canonical issue URL

AI News for 10/24/2024-10/25/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (232 channels, and 3136 messages) for you. Estimated reading time saved (at 200wpm): 319 minutes. You can now tag @smol_ai for AINews discussions!

Happy weekend.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Models and Research

AI Tools and Infrastructure

AI Safety and Ethics

AI Applications and Use Cases

AI Community and Events

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Meta's Quantized Llama Models: Pushing On-Device AI Forward

Theme 2. Cerebras Inference Achieves 2,100 Tokens/s on Llama 3.1-70B

Theme 3. New Open Source LLMs Push Boundaries of Context Length and Capabilities

Theme 4. Improving LLM Integration for Developers and Mobile Users

Theme 5. Advancements in LLM Benchmarking and Evaluation Tools

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Releases and Capabilities

AI Research and Techniques

AI Model Improvements

AI Ethics and Societal Impact

Hardware and Infrastructure


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. AI Models and Hardware Break New Ground

Theme 2. Ethical Challenges and Privacy Concerns in AI

Theme 3. User Experiences with AI Tools and Platforms

Theme 4. AI-Assisted Creativity Takes Center Stage

Theme 5. Fine-Tuning AI: Challenges and Best Practices


PART 1: High level Discord summaries

HuggingFace Discord


Unsloth AI (Daniel Han) Discord


Latent Space Discord


Notebook LM Discord Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Eleuther Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Perplexity AI Discord


GPU MODE Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Cohere Discord


OpenInterpreter Discord


LAION Discord


OpenAccess AI Collective (axolotl) Discord


Interconnects (Nathan Lambert) Discord


LangChain AI Discord


LLM Agents (Berkeley MOOC) Discord


DSPy Discord


LLM Finetuning (Hamel + Dan) Discord


Torchtune Discord


Mozilla AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #general (638 messages🔥🔥🔥):

  • AI Infrastructure
  • Usage of Hugging Face and Other Models
  • Quantum Computing in AI
  • Data Privacy and Ethics
  • Video Generation Technology

Links mentioned:


HuggingFace ▷ #today-im-learning (1 messages):

  • Transformers Basics
  • 10M Parameter Model
  • Reddit Posts Generation
  • DeepLLMs Repository
  • Improvements for Learning Models

Link mentioned: DeepLLMs/model_architecture.ipynb at main · its-nmt05/DeepLLMs: Meant for learning the basics of LLMs and transformers and exploring other interesting stuff along the way - its-nmt05/DeepLLMs


HuggingFace ▷ #cool-finds (1 messages):

elliotalder50n: https://lmarena.ai/ whats your opinion on the leaderboard?


HuggingFace ▷ #i-made-this (7 messages):

  • Calculator project using Streamlit
  • Protein and Genomics with HuggingFace
  • Self-Supervised Learning in Autonomous Driving
  • AI RPG Adventure
  • Stable Diffusion 3.5 Large Galleries

Links mentioned:


HuggingFace ▷ #reading-group (2 messages):

  • Automated Penetration Testing Benchmark
  • Ethical Hacking and Cybersecurity Threats
  • Performance of LLM in Cybersecurity

Links mentioned:


HuggingFace ▷ #computer-vision (1 messages):

  • Efficient Transformers for High Feature Channels

HuggingFace ▷ #NLP (2 messages):

  • Uploading models from Google Colab
  • Using .push_to_hub

HuggingFace ▷ #diffusion-discussions (5 messages):

  • Contributing to Hugging Face Diffusers
  • Community Engagement
  • Understanding Noise Effects on Tensors

Link mentioned: <a href="https://github.com/huggingface/diffusers/issues?q=is%3Aopen+is%3Aissue+label%3A"good+first+issue")">Issues%22%3EIssues) · huggingface/diffusers: 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX. - Issues · huggingface/diffusers


Unsloth AI (Daniel Han) ▷ #general (233 messages🔥🔥):

  • Unsloth AI Capabilities
  • Finetuning LLMs
  • Dataset Preparation
  • Pandas and cuDF for Data Handling
  • AI Tool Selection

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (96 messages🔥🔥):

  • AI and Lifestyle Changes
  • Data Center Construction Trends
  • Nvidia Market Dominance
  • ML and Robotics Insights
  • FPGA vs. GPU Use Cases

Link mentioned: Reddit - Dive into anything: no description found


Unsloth AI (Daniel Han) ▷ #help (23 messages🔥):

  • Direct Preference Optimization for conciseness
  • Llama 3.2 model availability
  • SQL query generation based on MySQL schema
  • Gemma model errors during inference
  • SFTTrainer compute_metrics examples

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (5 messages):

  • Unsloth's train_on_completions method
  • Model weight efficiency

Latent Space ▷ #ai-general-chat (46 messages🔥):

  • E2B Desktop Sandbox Launch
  • Claude 3.5 Sonnet Features
  • Cerebras Inference Chip Performance
  • OpenAI Orion Model Plans
  • Cohere Multimodal Embeddings

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

fanahova: New pod out with the NotebookLM gang!

https://x.com/FanaHOVA/status/1849861104111059082


Latent Space ▷ #ai-in-action-club (260 messages🔥🔥):

  • Cursor Pro Tips
  • LLM Integration
  • Markdown Generation
  • Audio Issues in Discord
  • OpenAI Documentation Scraping

Links mentioned:


Notebook LM Discord ▷ #use-cases (71 messages🔥🔥):

  • Podcast Customization and AI Interactions
  • Deepfake Technology Discussion
  • Quiz Game Integrations with AI
  • Character Development Using AI
  • AI Performance and Limitations

Links mentioned:


Notebook LM Discord ▷ #general (186 messages🔥🔥):

  • Voice and Audio Issues
  • Using AI for Podcasting
  • Data Storage and Limitations
  • Transcript Generation and Timestamps
  • NotebookLM Features and Improvements

Links mentioned:


LM Studio ▷ #general (199 messages🔥🔥):

  • LM Studio Feature Requests
  • Model Compatibility Issues
  • Performance Concerns with Large Models
  • GPU and CPU Interaction
  • File Management in LM Studio

Links mentioned:


LM Studio ▷ #hardware-discussion (33 messages🔥):

  • Intel Arc A750 performance
  • Gemma 2 token speeds
  • Mistral 7B usability concerns
  • GPU mixing for ML tasks
  • AMD SAM impact on performance

Link mentioned: snowbin: Delightfully crafted pastebin with <3.


aider (Paul Gauthier) ▷ #general (121 messages🔥🔥):

  • DeepSeek performance
  • Aider v0.60.1 features
  • Prompt caching
  • PearAI and Aider integration
  • Claude 1022 behavior issues

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (53 messages🔥):

  • Aider installation issues
  • Using Aider with Groq/Gemma2
  • Aider features and experiences
  • AI tools in the workplace
  • Aider updates

Links mentioned:


Nous Research AI ▷ #general (152 messages🔥🔥):

  • Nous Research and Corporate Partnerships
  • AI Hype Cycle
  • Model Performance and Benchmarks
  • Quantization Techniques
  • Decentralized AI Development

Links mentioned:


Nous Research AI ▷ #ask-about-llms (5 messages):

  • Hermes 3 SFT dataset
  • OpenHermes 2.5 dataset
  • Open source SFT datasets

Link mentioned: teknium/OpenHermes-2.5 · Datasets at Hugging Face: no description found


Nous Research AI ▷ #research-papers (2 messages):

  • Softmax function limitations
  • Adaptive temperature mechanism
  • Linear Attention
  • Attention coefficients

Link mentioned: Tweet from edward hicksford (@citizenhicks): this @GoogleDeepMind paper explores the limitations of the softmax function in artificial intelligence systems, particularly its inability to maintain sharpness in decision-making as the number of inp...


Nous Research AI ▷ #interesting-links (2 messages):

  • SynthID Text Watermarking
  • OmniParser for Screen Parsing

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

  • Softmax Function Limitations
  • Adaptive Temperature Mechanism
  • Linear Attention Performance

Link mentioned: Tweet from edward hicksford (@citizenhicks): this @GoogleDeepMind paper explores the limitations of the softmax function in artificial intelligence systems, particularly its inability to maintain sharpness in decision-making as the number of inp...


Eleuther ▷ #general (11 messages🔥):

  • NEO Model Testing
  • Topology Book Recommendations
  • Fine-Tuning Llama 3.2
  • Embedding Models for Classification
  • Finetuning Strategies

Link mentioned: MTEB Leaderboard - a Hugging Face Space by mteb: no description found


Eleuther ▷ #research (116 messages🔥🔥):

  • Diffusion Models
  • Classifier-Free Guidance
  • Image Captioning Quality
  • Unconditional Generation
  • Model Autophagy Disorder

Links mentioned:


Eleuther ▷ #lm-thunderdome (14 messages🔥):

  • Raw requests analysis
  • Model output issues
  • BOS token requirement
  • Pythia model limitations
  • lm_eval command troubleshooting

Link mentioned: lm-evaluation-harness/lm_eval/tasks/noticia/noticia.yaml at 7882043b4ee1ef9577b829809c2f4970b0bdba91 · EleutherAI/lm-evaluation-harness): A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


Eleuther ▷ #gpt-neox-dev (1 messages):

  • Contributing to gpt-neox
  • GitHub permission issues

OpenAI ▷ #ai-discussions (120 messages🔥🔥):

  • Anthropic's Opus 3.5 Release
  • AGI vs ANI Discussion
  • AI Training Architecture
  • OpenAI Infrastructure Report
  • Co-Pilot Functionality Issues

Links mentioned:


OpenAI ▷ #gpt-4-discussions (15 messages🔥):

  • Ethics team communication
  • GPT-4o memory functionality
  • Using API for AI agents

OpenRouter (Alex Atallah) ▷ #general (127 messages🔥🔥):

  • Cerebras API Acceptance
  • Censorship Issues with Models
  • Prompt Caching
  • Token Limits and Errors
  • Performance of New Models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (7 messages):

  • OpenRouter Integrations
  • Anthropic/Claude API Access

Stability.ai (Stable Diffusion) ▷ #general-chat (134 messages🔥🔥):

  • Flux and Comic Creation
  • Video Generation with AI
  • Stable Diffusion Models
  • LoRA Training and Usage
  • Art Creation for Music Tracks

Links mentioned:


Perplexity AI ▷ #general (99 messages🔥🔥):

  • Perplexity Pro User Experiences
  • Upcoming AI Model Releases
  • Perplexity App Features
  • Using AI for Legal Research
  • Community Troubleshooting Insights

Links mentioned:


Perplexity AI ▷ #sharing (8 messages🔥):

  • Bitcoin Creator
  • Carbon Capture Technology
  • Space-Based Solar Power
  • Caffeine Influence
  • Haunted Houses

Link mentioned: YouTube: no description found


GPU MODE ▷ #general (3 messages):

  • AI in Veterinary Medicine
  • Community Feedback

GPU MODE ▷ #triton (61 messages🔥🔥):

  • Triton Optimizations
  • BitBLAS Tile Language
  • Mixed Precision Performance
  • Kernel Performance with Custom Ops
  • FA3 Performance Insights

Links mentioned:


GPU MODE ▷ #cool-links (6 messages):

  • Llama 3.2 Model Release
  • Mochi 1 Preview
  • Cerebras Inference Performance

Links mentioned:


GPU MODE ▷ #torchao (23 messages🔥):

  • Llama 3.2 Open Sourcing
  • Quantization-Aware Training (QAT)
  • QLoRA vs QAT
  • HQQ+ Concept
  • Mixing Precision Techniques

Links mentioned:


GPU MODE ▷ #llmdotc (3 messages):

  • CUDA installation issues
  • Layer normalization errors

GPU MODE ▷ #liger-kernel (5 messages):

  • NanoGPT model training
  • Optimized Triton operations
  • Torch compile usage
  • Model compatibility
  • Performance enhancement functions

GPU MODE ▷ #🍿 (4 messages):

  • Discord Cluster Manager Documentation
  • Collaboration on Development Timeline

Link mentioned: Discord Cluster Manager: Our code will be here https://github.com/gpu-mode/discord-cluster-manager User experience Start on Nov 4 and be feature complete by at most Nov 10. For this work we only need a single node. Claud a...


Modular (Mojo 🔥) ▷ #general (59 messages🔥🔥):

  • Channel for General Questions
  • Kitty Ket's LED Matrix Development
  • PostgreSQL Library Integration
  • Learning Resources for Mojo Language

Modular (Mojo 🔥) ▷ #mojo (1 messages):

  • Mojo Bug Reports
  • Memory Management Issues

Link mentioned: [BUG] Mojo frees memory while reference to it is still in use · Issue #3710 · modularml/mojo: Bug description You cannot take the address of a List data into a variable and use it as a reference later, because Mojo frees the memory allocated to data by destructuring the List as it reckons i...


Modular (Mojo 🔥) ▷ #max (1 messages):

  • Serialized model ingestion
  • Graph API use cases

tinygrad (George Hotz) ▷ #general (33 messages🔥):

  • Deterministic GPU Kernels
  • Floating Point Arithmetic Consistency
  • Metal Compiler Optimization
  • Tinygrad's Approach to Determinism
  • Clang Flags for Metal

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (13 messages🔥):

  • Beam Search in Kernel Space
  • Action Chunked Transformers
  • Environment Variables in Notebooks
  • Complex Numbers Support in Tinygrad

LlamaIndex ▷ #blog (2 messages):

  • Knowledge-backed agents
  • Internal deployment at NVIDIA

LlamaIndex ▷ #general (33 messages🔥):

  • RAG in Production
  • Managing Document Updates
  • LlamaIndex Workflows
  • LlamaDeploy & LlamaIndex Compatibility
  • NVIDIA Case Study & Chainlit Integration

Links mentioned:


Cohere ▷ #discussions (24 messages🔥):

  • Cohere Community Coherence
  • AI Model Advancements
  • Song Embedding Recommendations
  • Aya vs Command Models
  • Upcoming Product Releases

Link mentioned: jalammar.github.io/notebooks/nlp/02_Song_Embeddings.ipynb at master · jalammar/jalammar.github.io: Build a Jekyll blog in minutes, without touching the command line. - jalammar/jalammar.github.io


Cohere ▷ #questions (7 messages):

  • Cohere Sales Contact
  • Song Embedding Recommendations
  • Transcribing Calls in Hindi and Telugu

Links mentioned:


Cohere ▷ #api-discussions (3 messages):

  • JSON Argument Formatting
  • Function Tool Calls

OpenInterpreter ▷ #general (21 messages🔥):

  • Interpreter fixes
  • Rate limits with Claude
  • Custom OpenAI API agent setup
  • YouTube test on Open Interpreter performance
  • Error fixing in OS mode

Links mentioned:


OpenInterpreter ▷ #ai-content (1 messages):

  • Clevrr-Computer
  • Chrome Built-in AI

Links mentioned:


LAION ▷ #general (10 messages🔥):

  • Video Classification Model Training
  • DataLoader Performance
  • Disk IO Monitoring
  • Model Size vs. Batch Size
  • Speech Generation Benchmarks

LAION ▷ #resources (2 messages):

  • Best Practices for Building LLM Applications

Link mentioned: Best Practices for Building Successful LLM Applications | Datahour by Bhavul Gauri: Join this webinar featuring a Senior ML Engineer from Meta, who will share practical insights on building impactful LLM solutions. This session will explore ...


OpenAccess AI Collective (axolotl) ▷ #axolotl-phorm-bot (5 messages):

  • DPO Evaluations
  • Axolotl Codebase
  • Evaluation Metrics
  • Model Predictions
  • Training Callbacks

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search): Understand code, faster.


Interconnects (Nathan Lambert) ▷ #ml-questions (3 messages):

  • mid training inclusion
  • specialized training

Interconnects (Nathan Lambert) ▷ #random (1 messages):

xeophon.: Only if they inject diversity into historical mails


Interconnects (Nathan Lambert) ▷ #memes (1 messages):

xeophon.: https://x.com/andrewwhite01/status/1849710726631522574


LangChain AI ▷ #general (3 messages):

  • Dataset evaluations for PDF data
  • Seeking AI developer

LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

  • Email Timestamp Issue
  • Email Confirmation

DSPy ▷ #show-and-tell (1 messages):

  • MIPROv2
  • Prompt Generation
  • GSM8K Dataset

Link mentioned: Tweet from Karthik Kalyanaraman (@karthikkalyan90): 🧵A simplified implementation of "automatic prompt generation" using the techniques used in MIPROv2 optimizer. This program uses the gsm8k dataset consisting of math problems and is made up of...


DSPy ▷ #general (1 messages):

._j_s: did you get the chance?


LLM Finetuning (Hamel + Dan) ▷ #general (1 messages):

.edgarbc: thanks a lot c123ian! I will check those out.


Torchtune ▷ #general (1 messages):

  • Torchtune Issues
  • Community Contributions

Link mentioned: Issues · pytorch/torchtune: PyTorch native finetuning library. Contribute to pytorch/torchtune development by creating an account on GitHub.


Mozilla AI ▷ #announcements (1 messages):

  • AI content compensation
  • Creative works licensing
  • Mozilla's Data Futures Lab

Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

honolouloute: Good catch





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}