Frozen AI News archive

Not much technical happened today

**OpenAI** announced raising **$6.6B** in new funding at a **$157B valuation**, with ChatGPT reaching *250M weekly active users*. **Poolside** raised **$500M** to advance AGI development. **LiquidAI** introduced three new MoE models (1B, 3B, 40B) with a **32k context window** and efficient token handling. **OpenAI** released Whisper V3 Turbo, an open-source multilingual model with significant speed improvements. **Meta AI FAIR** is hiring research interns focusing on **LLM reasoning, alignment, synthetic data, and novel architectures**. **Cohere** partnered with Fujitsu to launch Takane, a custom Japanese model. Technical discussions included challenges in **LoRA fine-tuning**, **float8 quantization** in Keras, and new tools like **create-llama** for agent templates. Industry commentary raised concerns about AI development priorities and highlighted freelancing opportunities in AI.

Canonical issue URL

AI News for 10/1/2024-10/2/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (225 channels, and 1832 messages) for you. Estimated reading time saved (at 200wpm): 219 minutes. You can now tag @smol_ai for AINews discussions!

Today OpenAI announced raising 6.6B in new funding at a 157B valuation. On Twitter ChatGPT head of product Nick Turley also added "250M weekly actives, up from 200M about a month ago". image.png

Also in fundraising news Poolside announced a $500 million fundraise to make progress towards AGI. image.png

{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Developments and Industry Updates

AI Research and Technical Discussions

AI Industry Trends and Commentary

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. OpenAI's Whisper Turbo: Breakthrough in Browser-Based Speech Recognition

Theme 2. Convergence and Limitations of Current LLM Architectures

Theme 3. Nvidia's NVLM 72B: New Multimodal Model Release

Theme 4. Advancements in On-Device AI: Gemini Nano 2 for Android

Theme 5. Innovative Techniques for Improving LLM Performance

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Releases and Capabilities

AI Company Updates and Events

AI Features and Applications

AI Ethics and Societal Impact

AI Research and Development


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1. Advancements and Launches of AI Models


Theme 2. AI Infrastructure and Tooling Enhancements


Theme 3. AI Ethics, Safety, and Legal Implications


Theme 4. Model Training, Fine-Tuning, and Optimization


Theme 5. AI Integration and Deployment Strategies


PART 1: High level Discord summaries

aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


LM Studio Discord


Nous Research AI Discord


Interconnects (Nathan Lambert) Discord


Unsloth AI (Daniel Han) Discord


GPU MODE Discord


Eleuther Discord


OpenAI Discord


Latent Space Discord


Stability.ai (Stable Diffusion) Discord


Perplexity AI Discord


Cohere Discord


LlamaIndex Discord


Torchtune Discord


Modular (Mojo 🔥) Discord


OpenInterpreter Discord


LangChain AI Discord


OpenAccess AI Collective (axolotl) Discord


tinygrad (George Hotz) Discord


LAION Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Mozilla AI Discord


MLOps @Chipro Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

aider (Paul Gauthier) ▷ #general (198 messages🔥🔥):

  • Prompt Caching Support
  • AI Models Performance Comparison
  • YAML Parsing Issues
  • File Editing with Aider
  • Error Handling in Aider

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (70 messages🔥🔥):

  • Architect Mode Usage
  • Cache Management in Aider
  • Setting up Aider with Local Models
  • Norton Antivirus Issues
  • Obsidian and LLM Integrations

Links mentioned:


aider (Paul Gauthier) ▷ #links (3 messages):

  • o1-engineer tool
  • screenpipe recording

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

  • Llama 3.1 and 3.2 Endpoints
  • Gemini Token Standardization
  • Cohere Model Discounts
  • Chatroom Upgrades

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (244 messages🔥🔥):

  • Realtime API Updates
  • OpenRouter Model Performance
  • File Upload Limitations
  • OpenAI Caching
  • Free Credit Programs

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

  • Llama 3.2 Release
  • Transformers v4.45.0
  • Whisper Turbo Integration
  • GGUF Model Deployment
  • HuggingChat for macOS

Links mentioned:


HuggingFace ▷ #general (164 messages🔥🔥):

  • Model Performance Comparison
  • Fine-Tuning Challenges
  • Innovative LLM Projects
  • Hugging Face Contributions
  • Community Queries

Links mentioned:


HuggingFace ▷ #today-im-learning (5 messages):

  • Dart and Flutter for Mobile Games
  • History of Transcendental Functions
  • Explainable AI Methods for CV
  • Object Detection and Segmentation
  • Understanding τ and Its Mathematical Significance

HuggingFace ▷ #cool-finds (2 messages):

  • Open Source Contributions
  • Medical AI Updates

Link mentioned: Tweet from Arthur Zucker (@art_zucker): For anyone wanting to dive a bit into fsdp, accelerate, training and their integration with transformers, this issue tracker needs you! 🫡🤗 https://github.com/huggingface/transformers/issues...


HuggingFace ▷ #i-made-this (7 messages):

  • NotebookLM Features
  • XP System and Badges

HuggingFace ▷ #computer-vision (4 messages):

  • trocr-large-handwriting
  • Self-driving car models
  • Fine-tuning models

HuggingFace ▷ #NLP (10 messages🔥):

  • Finetuning Mistral 7B
  • Pretraining Misunderstanding
  • Request for Benchmarks
  • Moderator Request
  • General NLP Introduction

LM Studio ▷ #general (125 messages🔥🔥):

  • LM Studio Bugs
  • Llama 3.1 Issues
  • Langflow Integration
  • GPU Utilization Settings
  • Model Compatibility with LM Studio

Links mentioned:


LM Studio ▷ #hardware-discussion (17 messages🔥):

  • GPU Performance Comparison
  • Thread Count Impact on Inference
  • CPU Utilization Monitoring
  • Llama 3.1 Performance Metrics
  • High-End GPU Setup

Nous Research AI ▷ #general (118 messages🔥🔥):

  • Rapid Model Quantization
  • Audio Token Costs
  • Novel AI Research on Alcohol Effects
  • DisTrO's Reliability Against Bad Data
  • AI Summit 2024 Discounts

Links mentioned:


Nous Research AI ▷ #ask-about-llms (1 messages):

.faiqkhan: Have you tried lancedb?


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://arxiv.org/abs/2409.14664


Nous Research AI ▷ #interesting-links (3 messages):

  • Nova LLM Suite Launch
  • Personalized AI Research Newsletter

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://arxiv.org/abs/2409.14664


Nous Research AI ▷ #reasoning-tasks (1 messages):

  • o1 reasoning extraction
  • context window exploration

Interconnects (Nathan Lambert) ▷ #news (39 messages🔥):

  • OpenAI's Recent Funding Round
  • Liquid.AI Architecture Discussion
  • AI in Research-Level Mathematics

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (65 messages🔥🔥):

  • AI Safety Discussions
  • Ethics in AI Development
  • Google's AI Ambitions
  • Controversies in AI Usage
  • Funding for AI Research

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (9 messages🔥):

  • OpenAI secrets
  • GPU access challenges
  • LLM agent mishap
  • GPU marketplace
  • Shadeform services

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

  • Trash Panda Emoji
  • Social Media Reactions

Link mentioned: Tweet from Trash Panda 🦝 (@trashpandaemoji): @TheXeophon @natolambert


Interconnects (Nathan Lambert) ▷ #rl (2 messages):

  • RL Conference
  • Andrew Barto's Statement

Link mentioned: Tweet from Eugene Vinitsky 🍒 (@EugeneVinitsky): Funniest part of @RL_Conference was when Andrew Barto said "Lets not have RL become a cult" and then received a standing ovation at the end of his talk


Interconnects (Nathan Lambert) ▷ #reads (5 messages):

  • Jack's interview
  • Meta's Llama model training
  • Constrained Generative Policy Optimization
  • Reward models in LLMs
  • Google's insights on model training

Link mentioned: Tweet from Andrew Carr (e/🤸) (@andrew_n_carr): I often wonder how Meta did such a good job post training the Llama series of models. They just released a paper that gives us a good idea. The big challenge is that using a single reward model to...


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

SnailBot News: <@&1216534966205284433>


Unsloth AI (Daniel Han) ▷ #general (82 messages🔥🔥):

  • Feature Extraction with VAE
  • Zoom Event Announcements
  • FP8 Training Challenges
  • NVIDIA AI Summit 2024
  • BFloat16 Performance

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (12 messages🔥):

  • AI Game Feedback
  • Login Concerns for AI Game
  • Bot Detection Measures
  • Humorous Programmer Joke

Link mentioned: LLM Jailbreak: no description found


Unsloth AI (Daniel Han) ▷ #help (21 messages🔥):

  • Unsloth Model Loading Issues
  • Dataset Organization for Fine-tuning
  • ChatML Template with phi3.5
  • Using Unsloth Models on CPU
  • Temperature Parameter in Training

Link mentioned: phi3 playbook gguf: llama_model_load: error loading model: vocab size mismatch · Issue #418 · unslothai/unsloth: The llama.cpp integration within the playbook does not works, anyway i have manually created the gguf file but when i try to serve the model using the llama.cpp server i am getting the following er...


Unsloth AI (Daniel Han) ▷ #research (1 messages):

edd0302: http://arxiv.org/abs/2409.17264

Crazy parallelization for inference


GPU MODE ▷ #triton (3 messages):

  • Kernel Invocation Parameters
  • Pipelining in Triton
  • num_stages Functionality

GPU MODE ▷ #torch (4 messages):

  • CUDA mode
  • no-libtorch-compile
  • Multithreaded data loading
  • SPDL framework

Links mentioned:


GPU MODE ▷ #announcements (1 messages):

  • IRL keynotes recordings
  • Talks by notable speakers
  • Accel's contribution

GPU MODE ▷ #cool-links (1 messages):

as_ai: cool: https://openai.com/index/introducing-the-realtime-api/


GPU MODE ▷ #beginner (2 messages):

  • Breaking into Machine Learning
  • Tensor Manipulation in Triton

GPU MODE ▷ #pmpp-book (1 messages):

deon1217: 5th edition is coming by EOY?


GPU MODE ▷ #youtube-recordings (6 messages):

  • Project Presentations Upload
  • Future Events Locations

GPU MODE ▷ #torchao (11 messages🔥):

  • TorchAO vs pytorch/torch/ao
  • Sensitivity scan and pruning
  • Prototyping features in TorchAO
  • Benchmarking and warmup in training

Links mentioned:


GPU MODE ▷ #sequence-parallel (3 messages):

  • Long Context Methods
  • Survey Papers on Context
  • Author Engagement

GPU MODE ▷ #off-topic (9 messages🔥):

  • Geopolitical Stability
  • Political Discussions in Server
  • User Reactions to Political Stress
  • Community Guidelines on Topics

GPU MODE ▷ #irl-meetup (1 messages):

saurabh_works: Any groups in India? 🇮🇳


GPU MODE ▷ #triton-puzzles (6 messages):

  • Triton Kernel Explanation
  • Add Vector Function
  • Row Major Format in Tensors

GPU MODE ▷ #hqq-mobius (8 messages🔥):

  • AWQ and HQQ comparison
  • Quantization methods
  • Perplexity benchmarks
  • MMLU and GSM8K tests
  • lm eval implementations

Links mentioned:


GPU MODE ▷ #llmdotc (38 messages🔥):

  • Pipeline Parallelism
  • Activation Checkpointing
  • Zero-3 Implementation
  • Chunked Softmax
  • Sequence Parallelism

Links mentioned:


GPU MODE ▷ #rocm (1 messages):

  • Advancing AI event
  • ROCM developers

GPU MODE ▷ #liger-kernel (2 messages):

  • Kernel Functional Reminder
  • Contribution Guide Updates

GPU MODE ▷ #metal (1 messages):

  • Prefix sum puzzle
  • Debugging notebook crashes

GPU MODE ▷ #diffusion (2 messages):

  • FLUX Inference Models
  • Custom Kernels Memory Optimization

Link mentioned: flux/src/flux/sampling.py at 87f6fff727a377ea1c378af692afb41ae84cbe04 · black-forest-labs/flux: Official inference repo for FLUX.1 models. Contribute to black-forest-labs/flux development by creating an account on GitHub.


Eleuther ▷ #general (72 messages🔥🔥):

  • Bayesian vs Frequentist Models
  • NYT Lawsuit Implications
  • Scraping Legitimacy
  • Expert Witness Dynamics
  • OpenAI Settlements

Eleuther ▷ #research (13 messages🔥):

  • Sequential Prediction of Output Representations
  • Liquid Neural Networks Application
  • Self-Supervised Learning on Arbitrary Embeddings
  • Transfer Learning Techniques in NLP

Links mentioned:


OpenAI ▷ #ai-discussions (47 messages🔥):

  • OpenAI Subscription Tiers
  • Voice Model Preferences
  • Liquid AI Architecture
  • Playground Access Issues
  • API Access Updates

Links mentioned:


OpenAI ▷ #gpt-4-discussions (18 messages🔥):

  • Disappearing Responses Issue
  • Creating Custom GPT with Unique Features
  • o1-preview Model Access Query
  • Using OAuth for Google Drive Connections
  • GPT Policy Violation Appeal Process

OpenAI ▷ #prompt-engineering (3 messages):

  • LLMs and chain-of-thought
  • Midjourney seed number retrieval

OpenAI ▷ #api-discussions (3 messages):

  • LLMs Hallucination Issues
  • Midjourney Seed Number Retrieval

Latent Space ▷ #ai-general-chat (53 messages🔥):

  • OpenAI's new funding round
  • OpenAI's Advanced Voice
  • Multi-GPU training techniques
  • Releases of multimodal language models
  • Azure AI's HD neural TTS

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (49 messages🔥):

  • ComfyUI Installation Issues
  • Flux Model Utilization
  • Automatic1111 Troubles
  • Debian-based OS Preferences
  • Python Version Compatibility

Links mentioned:


Perplexity AI ▷ #general (38 messages🔥):

  • Rate Limit Increases
  • Llama 3.2 Release
  • LiquidAI Performance
  • Chat Download Feature
  • Text-to-Speech Utility

Perplexity AI ▷ #sharing (7 messages):

  • Perplexity AI and Philosophy
  • LiquidAI GPT Rival Launch
  • Stability AI vs ClipDrop
  • FLUX Model Efficiency
  • Current AI Model Landscape

Perplexity AI ▷ #pplx-api (2 messages):

  • API Credit Usage
  • Account Details Inquiry

Cohere ▷ #discussions (26 messages🔥):

  • Credit card cloud support
  • Event notifications issue
  • MSFT Copilot Studio inquiry

Cohere ▷ #questions (13 messages🔥):

  • Azure Model Refresh Issues
  • Cohere Chat App Roadmap
  • Cohere Webinar Opportunities
  • RAG++ Course Resources
  • Reasoning Models for AI Agents

Link mentioned: Cookbooks — Cohere: Explore a range of AI guides and get started with Cohere's generative platform, ready-made and best-practice optimized.


Cohere ▷ #api-discussions (6 messages):

  • API Error 403
  • Model Transfer Issues
  • Support Contact

LlamaIndex ▷ #blog (2 messages):

  • Contextual Retrieval RAG
  • Multi-agent systems
  • Human in the loop feedback
  • TypeScript workflows

LlamaIndex ▷ #general (37 messages🔥):

  • LlamaIndex Infrastructure
  • GPU Utilization
  • HuggingFace LLM Usage
  • NVLM Support
  • Document Management Strategies

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

  • Oracle AI Vector Search
  • LlamaIndex Framework
  • Semantic Search
  • Retrieval Augmented Generation (RAG)

Link mentioned: Oracle AI Vector Search with LlamaIndex: A Powerful Combination: Ankush k Singal


Torchtune ▷ #announcements (1 messages):

  • 2024 PyTorch Contributor Awards
  • Salman Mohammadi
  • Community Contributions
  • PyTorch Growth Statistics

Link mentioned: Announcing the 2024 PyTorch Contributor Awards: no description found


Torchtune ▷ #general (18 messages🔥):

  • Knowledge Distillation
  • Training Token Probabilities
  • Dataset Creation for Distillation
  • Optimization Flags in Torchtune

Links mentioned:


Torchtune ▷ #dev (13 messages🔥):

  • H200s Deployment
  • Local In-House LLMs
  • Healthcare Data Regulations
  • B100s Hardware Plans

Modular (Mojo 🔥) ▷ #mojo (25 messages🔥):

  • Mojo Literals
  • EC2 Instance Requirements
  • Mojo Library Imports
  • Memory Management
  • Import Behavior Differences

OpenInterpreter ▷ #general (9 messages🔥):

  • Nova LLM Launch
  • Function Calls in Open Interpreter
  • Open Interpreter Computer Role
  • Trading View Experience
  • October House Party

Link mentioned: Tweet from Rubiks AI (@RubiksAI): 🚀 Introducing Nova: The Next Generation of LLMs by Nova! 🌟 We're thrilled to announce the launch of our latest suite of Large Language Models: Nova-Instant, Nova-Air, and Nova-Pro. Each designe...


OpenInterpreter ▷ #O1 (7 messages):

  • 01 app capabilities
  • OS mode confusion

OpenInterpreter ▷ #ai-content (3 messages):

  • Realtime API
  • Vision in Fine-Tuning
  • Prompt Caching
  • Model Distillation
  • Tool Use Podcast

Link mentioned: Tweet from Sam Altman (@sama): realtime api (speech-to-speech): https://openai.com/index/introducing-the-realtime-api/ vision in the fine-tuning api: https://openai.com/index/introducing-vision-to-the-fine-tuning-api/ prompt cach...


LangChain AI ▷ #general (15 messages🔥):

  • LangChain support for GPT Realtime API
  • Using HuggingFace models in LangChain
  • Concerns about curly braces in prompt templates
  • Local hardware for AI model deployment
  • Feedback on Microsoft Copilot Studio

LangChain AI ▷ #share-your-work (2 messages):

  • Nova LLMs
  • LumiNova
  • OppyDev AI

Link mentioned: Tweet from Rubiks AI (@RubiksAI): 🚀 Introducing Nova: The Next Generation of LLMs by Nova! 🌟 We're thrilled to announce the launch of our latest suite of Large Language Models: Nova-Instant, Nova-Air, and Nova-Pro. Each designe...


OpenAccess AI Collective (axolotl) ▷ #general (8 messages🔥):

  • NVIDIA's 72B model
  • Qwen 2.5 Deployment
  • Advancements in small models

Link mentioned: Tweet from Phil (@phill__1): Wow nvidia just published a 72B model with is ~on par with llama 3.1 405B in math and coding evals and also has vision 🤯


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (6 messages):

  • hf_mlflow_log_artifacts
  • Custom instruct format in sharegpt
  • YAML configuration for datasets
  • Using Axolotl for instruction tuning

Links mentioned:


tinygrad (George Hotz) ▷ #general (6 messages):

  • Unboxing Tiny Box
  • GitHub Bugfix Review
  • PR Refactoring Discussion

Link mentioned: Fix tensor saving and loading twice. by vladov3000 · Pull Request #6815 · tinygrad/tinygrad: Solves #6294. See this comment for the details. TLDR; disk devices keep around unlinked files and don&#39;t create new files. This is my first contribution, so there may be a few rough edges. Name...


tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

  • tinygrad code benefits
  • Python productivity and C interoperability
  • UOp vs UOP pool optimization issues
  • Compiler/Program/Allocator challenges
  • Distributed training in tinygrad

LAION ▷ #general (2 messages):

  • Spam concerns

LAION ▷ #resources (1 messages):

  • Sci Scope Newsletter
  • ArXiv Paper Summaries
  • Personalized Research Updates

Link mentioned: Sci Scope: An AI generated newsletter on AI research


DSPy ▷ #papers (1 messages):

  • Personalized Newsletter
  • AI Research Updates
  • Weekly Summaries
  • ArXiv Papers
  • Sci Scope Features

Link mentioned: Sci Scope: An AI generated newsletter on AI research


DSPy ▷ #colbert (1 messages):

  • Code Similarity Search
  • Colbert for Code Search
  • Code Search Alternatives

LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

  • Lab Release Schedule
  • Course Communication

Link mentioned: Large Language Model Agents: no description found


Mozilla AI ▷ #announcements (2 messages):

  • ML Paper Reading Group
  • Publishing Local LLM Apps
  • Job Board Proposal
  • Lumigator Introduction
  • Upcoming Events

MLOps @Chipro ▷ #general-ml (1 messages):

  • Nova models
  • LumiNova
  • MMLU performance
  • AI Evolution

Link mentioned: Tweet from Rubiks AI (@RubiksAI): 🚀 Introducing Nova: The Next Generation of LLMs by Nova! 🌟 We're thrilled to announce the launch of our latest suite of Large Language Models: Nova-Instant, Nova-Air, and Nova-Pro. Each designe...






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}