Frozen AI News archive

not much happened today

**OpenAI's o1-preview and o1-mini models** lead benchmarks in Math, Hard Prompts, and Coding. **Qwen 2.5 72B** model shows strong performance close to **GPT-4o**. **DeepSeek-V2.5** tops Chinese LLMs, rivaling **GPT-4-Turbo-2024-04-09**. **Microsoft's GRIN MoE** achieves good results with 6.6B active parameters. **Moshi voice model** from Kyutai Labs runs locally on Apple Silicon Macs. **Perplexity app** introduces voice mode with push-to-talk. **LlamaCoder** by Together.ai uses **Llama 3.1 405B** for app generation. **Google DeepMind's Veo** is a new generative video model for YouTube Shorts. The **2024 ARC-AGI competition** increases prize money and plans a university tour. A survey on model merging covers 50+ papers for LLM alignment. The **Kolmogorov–Arnold Transformer (KAT)** paper proposes replacing MLP layers with KAN layers for better expressiveness. **Hugging Face Hub** integrates with **Google Cloud Vertex AI Model Garden** for easier open-source model deployment. **Agent.ai** is introduced as a professional network for AI agents. *"Touching grass is all you need."*

Canonical issue URL

AI News for 9/18/2024-9/19/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (221 channels, and 2506 messages) for you. Estimated reading time saved (at 200wpm): 303 minutes. You can now tag @smol_ai for AINews discussions!

After a jam packed day yesterday, the AI community took a breather.

If so inclined, you could check out new talks from Strawberry team members Hyung Won Chung and Noam Brown (who is now hiring multi-agent researchers), as well as brief comments in The Information and @Teortaxes for hints on o1 under the hood. Nous Research announced Forge, their attempt at an open o1 repro, yesterday.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Releases and Benchmarks

AI Tools and Applications

AI Research and Development

AI Industry and Business

AI Ethics and Societal Impact

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Moshi: Open-Source End-to-End Speech-to-Speech Model

Theme 2. LLM Quantization: Balancing Model Size and Performance

Theme 3. Qwen2.5: Impressive New Model Family Outperforming Larger Competitors

Theme 4. OpenAI's Strawberry Model: Controversy Over Reasoning Transparency

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Advancements and Capabilities

AI-Generated Content Creation

Economic and Societal Impacts of AI


AI Discord Recap

A summary of Summaries of Summaries

O1-preview

Theme 1: AI Models on Steroids: New Kids on the Block

Theme 2: User Battles with AI Tools: When Tech Fights Back

Theme 3: AI Gets Creative: From Voice Cloning to Storytelling

Theme 4: The AI Community Unites: Conferences, Hackathons, and Funding

Theme 5: AI Research Hits the Fast Lane with New Tricks

Theme 6. Community Events and Engagement


PART 1: High level Discord summaries

Perplexity AI Discord


LM Studio Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


Stability.ai (Stable Diffusion) Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


Unsloth AI (Daniel Han) Discord


aider (Paul Gauthier) Discord


Eleuther Discord


OpenAI Discord


CUDA MODE Discord


LlamaIndex Discord


LAION Discord


Latent Space Discord


Cohere Discord


Interconnects (Nathan Lambert) Discord


OpenInterpreter Discord


OpenAccess AI Collective (axolotl) Discord


Torchtune Discord


DSPy Discord


tinygrad (George Hotz) Discord


LLM Finetuning (Hamel + Dan) Discord


MLOps @Chipro Discord


DiscoResearch Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Perplexity AI ▷ #general (344 messages🔥🔥):

  • Perplexity AI Subscription Limits
  • Issues with Perplexity Functionality
  • Using AI Models and Their Performance
  • Generating Images with DALL-E 3
  • User Experience with You.com

Links mentioned:


Perplexity AI ▷ #sharing (4 messages):

  • Snap's Large AR Spectacles
  • CATL's Million-Mile Battery
  • Third State Beyond Life and Death
  • Discussions on Washing Machine Modifications
  • Diverse Insights into Multiverse

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (7 messages):

  • Invalid PPLX model issue
  • Outdated model links
  • Sonar model transition
  • Media article links in prompts

Links mentioned:


LM Studio ▷ #general (327 messages🔥🔥):

  • Qwen Model Performance
  • Model Compatibility Issues
  • API Connections
  • Image Processing Challenges
  • Network Configuration for LM Studio

Links mentioned:


LM Studio ▷ #hardware-discussion (26 messages🔥):

  • M4 Mac Mini expectations
  • RAM usage on macOS
  • GPU undervolting
  • NPU options
  • Airflow and thermal management

Link mentioned: Reddit - Dive into anything: no description found


HuggingFace ▷ #announcements (1 messages):

  • Tokenized Title
  • Unity ML Agents Pretraining
  • GSM8K Reasoning Dataset
  • Padoru Dataset
  • Gradio and R Examples

Link mentioned: no title found: no description found


HuggingFace ▷ #general (123 messages🔥🔥):

  • Qwen Math Model Demo
  • PyTorch Conference Attendance
  • Generative Text AI and Syntax Extraction
  • Best LLM for Roleplaying
  • Setting Up Local Voice Chats

Links mentioned:


HuggingFace ▷ #today-im-learning (5 messages):

  • Hugging Face beginner resources
  • AI training processes
  • Multimodal AI training

HuggingFace ▷ #cool-finds (1 messages):

maxstewart.: Hello


HuggingFace ▷ #i-made-this (215 messages🔥🔥):

  • Composite Embeddings
  • Neural Decompiler
  • TensorFlow vs PyTorch
  • ML Agent Integration
  • Real-Time Visualization Tools

Links mentioned:


HuggingFace ▷ #computer-vision (4 messages):

  • Llava OneVision model suffixes
  • reCAPTCHA v2 success rate
  • Model questions on HF Hub

Links mentioned:


HuggingFace ▷ #diffusion-discussions (1 messages):

pseudoterminalx: like ipadapter with a base model(with lora?) finetuned on that style probably


Modular (Mojo 🔥) ▷ #general (9 messages🔥):

  • Mojo roadmap updates
  • Community meeting invitations
  • OpenCV-Python installation issues

Modular (Mojo 🔥) ▷ #announcements (1 messages):

  • Closure of GitHub Discussions
  • Transition to Discord for community interactions
  • Conversion of discussions to issues

Modular (Mojo 🔥) ▷ #mojo (178 messages🔥🔥):

  • Mojo SDK for Windows
  • Decorator Reflection in Mojo
  • Safety and Correctness in Mojo
  • Variable Bit Width Integers
  • Packed Structs in Mojo

Links mentioned:


Modular (Mojo 🔥) ▷ #max (1 messages):

  • AI Development on Laptops
  • MAX Cloud Computational Offering
  • Cost-Effective GPU Solutions

Stability.ai (Stable Diffusion) ▷ #general-chat (181 messages🔥🔥):

  • Lionsgate partnership with RWML
  • Stability AI model comparisons
  • Flux model capabilities
  • Training LoRA and checkpoints
  • Model performance and aesthetics

Links mentioned:


Nous Research AI ▷ #general (152 messages🔥🔥):

  • NousCon Highlights
  • AI Model Developments
  • AI Job Predictions
  • Community Discussions
  • Event Participation

Links mentioned:


Nous Research AI ▷ #ask-about-llms (13 messages🔥):

  • Ollama local API setup
  • Speech-to-Text APIs
  • Hermes-3 model function calling
  • Whisper integration
  • Model precision preferences

Links mentioned:


Nous Research AI ▷ #research-papers (6 messages):

  • Shampoo optimization
  • Diagram of Thought framework
  • ReST-MCTS* paper
  • Reverse engineering O1

Links mentioned:


Nous Research AI ▷ #interesting-links (2 messages):

  • Characterfile format
  • Multi-agent framework tools
  • Twitter archive to character data

Link mentioned: GitHub - lalalune/characterfile: A simple file format for character data: A simple file format for character data. Contribute to lalalune/characterfile development by creating an account on GitHub.


Nous Research AI ▷ #research-papers (6 messages):

  • Shampoo vs Adam
  • Diagram of Thought framework
  • ReST-MCTS paper discussion
  • Reverse engineering O1

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • Chatroom features
  • Qwen 2.5
  • Mistral Pixtral
  • Neversleep Lumimaid v0.2
  • Hermes 3

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

  • Google Gemini
  • No-code agent creation
  • Open Agent Cloud
  • Enterprise automation
  • Screen recording agents

OpenRouter (Alex Atallah) ▷ #general (136 messages🔥🔥):

  • OpenAI Rate Limits
  • Payment Issues on OpenRouter
  • Model Sharing and Chat History
  • Integrating New Models
  • Job Impact of AI

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (99 messages🔥🔥):

  • Qwen 2.5 training issues
  • Extreme Quantization
  • Model support inquiries
  • Lightweight tools for OpenAI endpoints
  • Style transfer in model training

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (10 messages🔥):

  • Probability struggles
  • Swift programming
  • Confidence in exams
  • Qwen models
  • AGI challenges

Unsloth AI (Daniel Han) ▷ #help (18 messages🔥):

  • vllm LoRA Adapter Issues
  • Pip Installation Problems
  • Model Fine-tuning Limitations
  • Unsloth Model Usage in Ollama
  • Batch Size and Training Speed

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (2 messages):

  • Fine-tuning BART
  • F1 Score Discrepancy
  • Multiple BOS Tokens Issue

aider (Paul Gauthier) ▷ #general (98 messages🔥🔥):

  • Aider Environment Variables
  • Benchmarks and Performance
  • Model Utilization in Aider
  • Issues and Solutions with Aider Setup
  • RAG Architecture in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (20 messages🔥):

  • Using Aider in Python apps
  • Security concerns with Aider
  • Creating files with Aider
  • Hugging Face model integration
  • Control over URL scraping

Links mentioned:


aider (Paul Gauthier) ▷ #links (8 messages🔥):

  • ECMAScript vs JavaScript
  • NodeJS alternatives
  • ell Library
  • Prompt Engineering

Link mentioned: Introduction | ell documentation: ell is a lightweight prompt engineering library treating prompts as functions. It provides tools for versioning, monitoring, and visualization of language model programs.


Eleuther ▷ #general (4 messages):

  • airLLM compression
  • Leaderboard tasks
  • HF dataset upload

Eleuther ▷ #research (74 messages🔥🔥):

  • Shampoo Optimizer
  • SOAP Algorithm
  • GFlowNets and JEPA
  • Inference Time Search
  • Self-Supervised Learning Challenges

Links mentioned:


Eleuther ▷ #interpretability-general (15 messages🔥):

  • Data Matching in OPT and Pythia
  • Attention Layer and In-Context Learning Score
  • Distributed Alignment Search and SAEs
  • KV Cache Representation Exploration

Eleuther ▷ #lm-thunderdome (18 messages🔥):

  • Padding and Unpadding Inputs
  • Error Handling in Model Generate
  • Token Generation with Padding
  • Batch Size Management

Link mentioned: lm-evaluation-harness/lm_eval/models/huggingface.py at 9a092f374bdc6d6032ae2b878b7a49b97801ab69 · EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


Eleuther ▷ #gpt-neox-dev (1 messages):

  • Polaris Node Connectivity
  • Job Timeout Issues

OpenAI ▷ #ai-discussions (36 messages🔥):

  • O1-Preview Capabilities
  • Discussion on AI Alignment
  • Qwen 2.5 vs Llama 3.1
  • Recording ChatGPT's Voice
  • Pixtral & OpenRouter

OpenAI ▷ #gpt-4-discussions (21 messages🔥):

  • Daily Limits for GPT Usage
  • GPT-4 and GPT-4o Limits
  • O1 and O1-Mini Message Caps
  • Memory Functionality in GPTs

OpenAI ▷ #prompt-engineering (3 messages):

  • Data extraction with GPT-4o
  • Structured Output feature

OpenAI ▷ #api-discussions (3 messages):

  • GPT-4o Data Extraction
  • Structured Output Feature

CUDA MODE ▷ #general (4 messages):

  • NVIDIA Triton
  • Triton Inference Server
  • Related Discussion Rooms

CUDA MODE ▷ #triton (3 messages):

  • GemLite-Triton
  • Triton Conference Slides
  • Triton-Puzzles Colab
  • Gradio Output

Links mentioned:


CUDA MODE ▷ #torch (7 messages):

  • Chrome Tracing with PyTorch Profiler
  • FlashAttention3 Integration
  • Model Optimization Talks

CUDA MODE ▷ #torchao (4 messages):

  • Torchao Autoquant
  • Torchao Compile

CUDA MODE ▷ #irl-meetup (3 messages):

  • GPU programming groups
  • San Francisco GPU community

CUDA MODE ▷ #llmdotc (10 messages🔥):

  • Hackathon Objectives
  • GQA cuDNN Development
  • L2 Side Aware Optimization
  • Stochastic Rounding Technique

CUDA MODE ▷ #bitnet (10 messages🔥):

  • Language model quantization
  • BitNet performance
  • Knowledge compression
  • Llama 3 8B results
  • Product quantization methods

Links mentioned:


CUDA MODE ▷ #cudamode-irl (17 messages🔥):

  • Hackathon Invitations
  • Access to Hack-Ideas Forum
  • Missing Users in Discord
  • GemLite-Triton Release

Link mentioned: GitHub - mobiusml/gemlite: Simple and fast low-bit matmul kernels in CUDA / Triton: Simple and fast low-bit matmul kernels in CUDA / Triton - mobiusml/gemlite


CUDA MODE ▷ #metal (5 messages):

  • Apple Silicon MLX framework
  • On-device speech models
  • Metal and PyTorch interface

LlamaIndex ▷ #blog (3 messages):

  • Story Generation Agent
  • LlamaParse Premium
  • RAG and agentic applications
  • Opik partnership

LlamaIndex ▷ #general (51 messages🔥):

  • RAG and semantic search
  • Pinecone vector database issues
  • SchemaLLMPathExtractor usage
  • KG schema class requirements
  • Property graph index embedding problems

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

  • RAG
  • LlamaIndex

LAION ▷ #general (17 messages🔥):

  • Fish Speech
  • AdBot incidents

Link mentioned: Fish Speech 1 - a Hugging Face Space by fishaudio: no description found


LAION ▷ #research (11 messages🔥):

  • Muse text to image
  • Open-source GPT-4o model
  • Dataset building for GPT-4o
  • LLM fine-tuning challenges
  • Tokenization issues in LLMs

Latent Space ▷ #ai-general-chat (20 messages🔥):

  • Fal AI funding
  • OpenAI O1 improvements
  • Jina embeddings v3 launch
  • Runway and Lionsgate collaboration
  • New multi-agent research team at OpenAI

Links mentioned:


Latent Space ▷ #ai-announcements (4 messages):

  • NeurIPS 2024 preparations
  • Accommodation logistics
  • Vancouver event updates

Cohere ▷ #discussions (14 messages🔥):

  • Cohere RAG API
  • Client Design Success
  • 504 Gateway Timeout Errors
  • Learning AI with Cohere
  • Community Engagement

Cohere ▷ #questions (7 messages):

  • Command pricing
  • Dataset access for Aya-101
  • Using Command for tagging
  • Efficiency of new models

Cohere ▷ #api-discussions (2 messages):

  • Rerank Fine-Tuning
  • RAG Results Impact

Interconnects (Nathan Lambert) ▷ #news (22 messages🔥):

  • OpenAI o1 models
  • Knowledge Cutoff
  • Qwen 2.5 72B Performance
  • Livecodebench Comparison
  • AI Reasoning Ability

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

SnailBot News: <@&1216534966205284433>


OpenInterpreter ▷ #general (6 messages):

  • OpenInterpreter error handling
  • Agent performance testing
  • OpenInterpreter processing capabilities
  • Model performance comparison
  • Task success stories with OpenInterpreter

OpenInterpreter ▷ #O1 (6 messages):

  • Perplexity browser settings
  • User experiences with browser
  • Windows and Edge compatibility

OpenInterpreter ▷ #ai-content (5 messages):

  • RAG chat app development
  • Multimodal models for PDF interaction
  • Image and text integration in responses
  • AI use case examples
  • AI-generated content

Link mentioned: Tweet from 😎orlie (@sunglassesface): Google finally came out with something impressive. I just tested it... It's wild. I made this in 10 seconds. Quoting Wolfram Ravenwolf 🐺🐦‍⬛ (@WolframRvnwlf) Agreed. This is both very impress...


OpenAccess AI Collective (axolotl) ▷ #general (10 messages🔥):

  • OBS Screen Recording
  • Screenity Recorder
  • Moshi Speech Models
  • GRIN MoE

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (3 messages):

  • Gemma2 DPO Config Issue
  • Chat Template Modification

Torchtune ▷ #general (1 messages):

  • PyTorch conference
  • New work announcements
  • Community engagement

Torchtune ▷ #dev (8 messages🔥):

  • Conference Livestream Query
  • GitHub PR for kv-caching Fix
  • HH RLHF Dataset Documentation
  • Default Preference Dataset Builder

Link mentioned: Fix kv-cacheing and bsz > 1 in eval recipe by SalmanMohammadi · Pull Request #1622 · pytorch/torchtune: Context What is the purpose of this PR? Is it to add a new feature fix a bug update tests and/or documentation other (please add here) Please link to any issues this PR addresses. closes #160...


DSPy ▷ #general (5 messages):

  • DSPy Program Optimization
  • Bootstrapping in DSPy
  • Cost of Prompt Optimization
  • MIPRO and Optimization
  • Non-Determinism in LLMs

tinygrad (George Hotz) ▷ #general (4 messages):

  • tinybox motherboard
  • CLANG bounty
  • Pull Request discussions

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #general (1 messages):

  • OpenAI's o1 model
  • Large reasoning models

Link mentioned: o1 - What is Going On? Why o1 is a 3rd Paradigm of Model + 10 Things You Might Not Know: o1 is different, and even sceptics are calling it a 'large reasoning model'. But why is it so different, and why does that say about the future? When models ...


MLOps @Chipro ▷ #general-ml (1 messages):

  • LunoSmart AI Venture
  • Tech Stack for Application
  • Cross Platform Development
  • Machine Learning Expertise

Link mentioned: Kosi Nzube: ai developer. I program with my favorite tools: Java, Flutter and Keras with Python. Founder@ lunosmart.com


DiscoResearch ▷ #general (1 messages):

flozi00: https://mistral.ai/news/september-24-release/

Mistral follows with a price drop 💪





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}