Frozen AI News archive

ChatGPT Advanced Voice Mode

**OpenAI** rolled out **ChatGPT Advanced Voice Mode** with 5 new voices and improved accent and language support, available widely in the US. Ahead of rumored updates for **Llama 3** and **Claude 3.5**, **Gemini Pro** saw a significant price cut aligning with the new intelligence frontier pricing. **OpenAI's o1-preview model** showed promising planning task performance with 52.8% accuracy on Randomized Mystery Blocksworld. **Anthropic** is rumored to release a new model, generating community excitement. **Qwen 2.5** was released with models up to 32B parameters and support for 128K tokens, matching GPT-4 0613 benchmarks. Research highlights include PlanBench evaluation of o1-preview, OpenAI's release of a multilingual MMMLU dataset covering 14 languages, and RAGLAB framework standardizing Retrieval-Augmented Generation research. New AI tools include PDF2Audio for converting PDFs to audio, an open-source AI starter kit for local model deployment, and **Moshi**, a speech-based AI assistant from Kyutai. Industry updates feature **Scale AI** nearing $1B ARR with 4x YoY growth and **Together Compute's** enterprise platform offering faster inference and cost reductions. Insights from **Sam Altman**'s blog post were also shared.

Canonical issue URL

AI News for 9/23/2024-9/24/2024. We checked 7 subreddits, 433 Twitters and 31 Discords (222 channels, and 2572 messages) for you. Estimated reading time saved (at 200wpm): 294 minutes. You can now tag @smol_ai for AINews discussions!

Ahead of rumored Llama 3 and Claude 3.5 updates due tomorrow:

Today we saw a big Gemini Pro price cut, updating Gemini pricing inline with the new $/intelligence frontier we have been charting on this newsletter.

image.png

But probably the headline story is ChatGPT Advanced Voice Mode, which company leaders (like Mira!) announced as "rolling out this week" but it seems most people in the US received access by the end of day. There are 5 new voices, and improved accent/language support. and yes, with some effort, it can still sing!

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Developments and Releases

AI Research and Benchmarks

AI Applications and Tools

AI Industry and Business

AI Ethics and Societal Impact

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Qwen 2.5: A New Benchmark in Local LLM Performance

Theme 2. Advancements in LLM Efficiency and Quantization

Theme 3. AI for Creative Applications: Gaming and Music

Theme 4. New AI Datasets and Research Papers

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Advancements and Releases

AI Research and Development

Industry and Market Developments

Perspectives on AI Progress


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. AI Models Level Up: New Releases and Major Updates

Theme 2. Voice Features Roll Out Amidst Controversy

Theme 3. Developers Wrestle with AI Integration and Optimization

Theme 4. AI Reasoning and Reliability Under the Microscope

Theme 5. Collaborative Efforts and Tools Enhance AI Development


Note: All links and details are based on the discussions across various Discord channels and reflect the latest updates and community sentiments.


PART 1: High level Discord summaries

OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Eleuther Discord


aider (Paul Gauthier) Discord


OpenAI Discord


GPU MODE Discord


Interconnects (Nathan Lambert) Discord


Nous Research AI Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LM Studio Discord


Modular (Mojo 🔥) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Latent Space Discord


Cohere Discord


Stability.ai (Stable Diffusion) Discord


LlamaIndex Discord


LAION Discord


OpenAccess AI Collective (axolotl) Discord


LangChain AI Discord


Torchtune Discord


tinygrad (George Hotz) Discord


OpenInterpreter Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

OpenRouter (Alex Atallah) â–· #announcements (3 messages):

  • Cursor Integration
  • Gemini Update
  • Database Downtime
  • New Nous Model
  • Open-source Vision Language Models

Links mentioned:


OpenRouter (Alex Atallah) â–· #app-showcase (1 messages):

  • OpenRouter App Development
  • Demo Apps on GitHub

Link mentioned: tai-llm-chat/demos/tool_calling at main · pxl-research/tai-llm-chat: Repository with demo code for LLM's (using Azure OpenAI and OpenRouter) - pxl-research/tai-llm-chat


OpenRouter (Alex Atallah) ▷ #general (378 messages🔥🔥):

  • OpenRouter's middle-out transforms
  • New Gemini Models
  • Token Pricing Structures
  • Performance of various LLMs
  • User Experiences with Models

Links mentioned:


OpenRouter (Alex Atallah) â–· #beta-feedback (1 messages):

godling72: In my case it would be something I'm running myself.


HuggingFace â–· #announcements (1 messages):

  • Mistral Small Model Release
  • Gradio 5 Launch
  • FinePersonas Dataset Introduction
  • bitsandbytes 0.44.0
  • Wikimedia Structured Wikipedia Dataset

Links mentioned:


HuggingFace ▷ #general (117 messages🔥🔥):

  • Hugging Face Token Issues
  • OpenAI's Word Generation
  • Gradio and Model Queries
  • Feedback on Voice Channels
  • Agents IDE for Hugging Face Tools

Links mentioned:


HuggingFace â–· #today-im-learning (2 messages):

  • Neuralink's FP8 Performance
  • Mixed Precision Loss Comparison

HuggingFace ▷ #cool-finds (8 messages🔥):

  • Llama 3.1 Safety Assessment
  • Comic Sans FLUX Model
  • Qwen/Qwen2.5-72B-Instruct Model
  • AI Font Generators
  • Neural Computation Paper

Links mentioned:


HuggingFace ▷ #i-made-this (175 messages🔥🔥):

  • Hugging Face models
  • Audio extraction tools
  • Social engineering GPT
  • Google Gemini object detection
  • Tau LLM training

Links mentioned:


HuggingFace â–· #reading-group (3 messages):

  • HF Dataset
  • Cross-Posting Etiquette

HuggingFace â–· #computer-vision (2 messages):

  • GOT OCR 2 Model
  • Fine-tuning OCR models
  • Text-image datasets
  • Language-specific training

HuggingFace â–· #NLP (7 messages):

  • SetFit Models Training
  • Daily Topic Modeling
  • Sentiment Analysis Methods
  • BERTTopic
  • Zero-shot Topic Definition

HuggingFace ▷ #diffusion-discussions (14 messages🔥):

  • ControlNet_Union in SDXL
  • Training DiT models
  • Sigma_t term importance
  • Denoising processes
  • Latent variable equations

HuggingFace â–· #gradio-announcements (2 messages):

  • Gradio 5 Beta Release
  • Gradio Performance Improvements
  • Modern UI Design in Gradio
  • AI Playground Feature
  • Office Hours Demo

Links mentioned:


Eleuther ▷ #general (143 messages🔥🔥):

  • RWKV Architecture
  • YouTube to Audio Tool
  • Dynamic Evaluation in ML
  • muP Implementation

Links mentioned:


Eleuther ▷ #research (43 messages🔥):

  • Planning capabilities of LLMs
  • Interpretability in AI
  • FP6 floating-point format performance
  • Scaling laws in ML
  • Implicit instruction tuning

Links mentioned:


Eleuther ▷ #scaling-laws (10 messages🔥):

  • Chinchilla and matmul algorithm
  • Strassen algorithm discussions
  • Decomposing models using Strassen
  • Low precision matmul performance

Eleuther ▷ #lm-thunderdome (13 messages🔥):

  • MMLU scores for Pythia 6.9b-deduped
  • Formatting issues with Pile models
  • Performance comparison to ARC
  • Reference to forthcoming paper

Eleuther â–· #gpt-neox-dev (2 messages):

  • trunc_normal initialization
  • AllenAI ablation study
  • model stability

Link mentioned: OLMoE: Open Mixture-of-Experts Language Models: We introduce OLMoE, a fully open, state-of-the-art language model leveraging sparse Mixture-of-Experts (MoE). OLMoE-1B-7B has 7 billion (B) parameters but uses only 1B per input token. We pretrain it ...


aider (Paul Gauthier) ▷ #general (122 messages🔥🔥):

  • Gemini model updates
  • New RoRF open-source release
  • Claude model expectations
  • Aider installation issues
  • Prompt caching functions

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (62 messages🔥🔥):

  • Aider File Operations
  • Using Models in Aider
  • Upgrading Aider
  • HuggingChat Models
  • Aider Usage Tutorials

Links mentioned:


aider (Paul Gauthier) â–· #links (5 messages):

  • Agentic behavior
  • OpenRouter integration
  • Gemini model updates

Links mentioned:


OpenAI â–· #annnouncements (1 messages):

  • Advanced Voice rollout
  • Custom Instructions update
  • Improved Accents
  • New Voices Feature
  • Multilingual Capabilities

Link mentioned: Tweet from OpenAI (@OpenAI): Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week. While you’ve been patiently waiting, we’ve added Custom Instructions, Memory, five new voices,...


OpenAI ▷ #ai-discussions (108 messages🔥🔥):

  • Advanced Voice Mode
  • Voice Generation Performance
  • Voice Assistant Competition
  • Roleplaying AI Services
  • GPU Server Rentals

Link mentioned: Tweet from OpenAI (@OpenAI): Meet the five new voices.


OpenAI â–· #gpt-4-discussions (5 messages):

  • Voice Function in GPT
  • Calling GPTs

OpenAI ▷ #prompt-engineering (21 messages🔥):

  • Prompt engineering for API
  • Structured output for JSON
  • Generating Minecraft questions
  • Hallucination issues
  • API usage queries

OpenAI ▷ #api-discussions (21 messages🔥):

  • JSON formatting challenges
  • Prompt engineering for AI
  • Minecraft question generation
  • API usage insights
  • Avoiding hallucinations in responses

GPU MODE â–· #general (7 messages):

  • Old server icon returns
  • Community reflections
  • Scam links present
  • Tool discussions

GPU MODE â–· #triton (1 messages):

mobicham: Moving this conversation to the hqq channel


GPU MODE ▷ #torch (30 messages🔥):

  • CUDA Caching Allocator
  • Triton Kernel Support
  • Segment Anything Model 2 (SAM2)
  • Debugging Torch Distributed Training
  • Imitation Learning with SAM2-fast

Links mentioned:


GPU MODE â–· #announcements (1 messages):

  • GPU MODE
  • CUDA MODE Origins
  • IRL Meetup Success
  • Community Growth
  • Future of GPU Programming

GPU MODE â–· #cool-links (4 messages):

  • CUDA Programming Course
  • Nvidia GPUs
  • High-Performance Computing

Link mentioned: CUDA Programming Course – High-Performance Computing with GPUs: Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning. Code:💻 https://github.com/Infatoshi/cuda-course💻 h...


GPU MODE â–· #jobs (1 messages):

  • Luma job opening
  • Performance optimization roles
  • Dream Machine product
  • Luma's research team

GPU MODE ▷ #beginner (8 messages🔥):

  • Cutlass example discussion
  • Porting CUDA to Python
  • Custom ops for PyTorch

Links mentioned:


GPU MODE ▷ #torchao (10 messages🔥):

  • Slice operation for uintx
  • Padding discussion for uintxTensor
  • Test example for uintx slicing
  • Divisibility requirement for tensors

GPU MODE ▷ #off-topic (18 messages🔥):

  • CUDA Mode vs GPU Mode
  • Heterogenous Computing Discussions
  • Segment Anything Model 2
  • Nickname Proposals for GPU Mode
  • Mascots for GPU Mode

Link mentioned: GitHub - facebookresearch/segment-anything-2: The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.: The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th...


GPU MODE â–· #triton-puzzles (2 messages):

  • Triton Puzzles
  • PMP Book and Machine Learning

GPU MODE â–· #hqq-mobius (5 messages):

  • 3-bit attention kernel
  • Performance drop-off below 4 bits
  • Quantization in Attention Modules
  • GemLite for HQQ backend

GPU MODE ▷ #llmdotc (9 messages🔥):

  • repkv Kernel Integration
  • RoPE Implementation Details
  • Testing Framework Enhancements
  • RMSNorm Updates
  • SwigLU Modifications

GPU MODE â–· #rocm (7 messages):

  • Training with xformers
  • CK GroupedGemm usage
  • CUDA and Metal in VSCode
  • Llama3 tuning journey

Link mentioned: Tune Llama3 405B on AMD MI300x (our journey) - Felafax Blog - Obsidian Publish: Tune Llama3 405B on AMD MI300x (our journey) - Felafax Blog - Powered by Obsidian Publish.


GPU MODE â–· #intel (1 messages):

  • Cutlass Sycl Fork

Link mentioned: GitHub - codeplaysoftware/cutlass-fork: CUDA Templates for Linear Algebra Subroutines: CUDA Templates for Linear Algebra Subroutines. Contribute to codeplaysoftware/cutlass-fork development by creating an account on GitHub.


GPU MODE ▷ #bitnet (28 messages🔥):

  • Scaling on 4090 GPUs
  • Distributed Training Optimization
  • Peer-to-Peer GPU Communication

Links mentioned:


GPU MODE ▷ #webgpu (11 messages🔥):

  • WebNN Channel Discussion
  • WebNN Integration with WebGPU and WASM
  • Event Invitation
  • NPU API Interfaces

GPU MODE â–· #liger-kernel (1 messages):

0x000ff4: do we have regular meetings about liger-kernel


GPU MODE â–· #metal (2 messages):

  • GPU Memory Bandwidth
  • Puzzle Completion

Interconnects (Nathan Lambert) ▷ #news (87 messages🔥🔥):

  • OpenAI's o1 models
  • Anthropic's funding talks
  • Stability AI board announcement
  • Gemini model updates
  • Scale AI financials

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (16 messages🔥):

  • Release Chronicle
  • Interconnects Artifacts
  • Late Fusion Visual LMs
  • GPT-4 Performance

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (26 messages🔥):

  • New Subdomain Announcement
  • Introduction to UV for Python
  • Docker and UV Integration
  • Cronjob for UV Updates
  • PyCharm Compatibility Issues

Links mentioned:


Nous Research AI ▷ #general (96 messages🔥🔥):

  • O1 Planning Capabilities
  • World Simulator API Usage
  • Hermes & Monad Dynamics
  • Recent AI Model Upgrades
  • Nous Research and Merchandise

Links mentioned:


Nous Research AI ▷ #ask-about-llms (25 messages🔥):

  • llama.cpp performance
  • Scaling LLMs
  • GPT-2 pre-training
  • Sample packing
  • Tokenizers

Nous Research AI â–· #research-papers (2 messages):

  • DisTrO Resource Management
  • RENDER Network Testing

Nous Research AI â–· #interesting-links (1 messages):

ar02293: hey www.keygunz.com go test


Nous Research AI â–· #research-papers (2 messages):

  • DisTrO functionality
  • RENDER network

Unsloth AI (Daniel Han) ▷ #general (100 messages🔥🔥):

  • Qwen 2.5 Model Issues
  • Unsloth Trainer Memory Management
  • Fine-tuning Models
  • Using Lora with Ollama
  • Deployment Options for Models

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (22 messages🔥):

  • Llama Model Quantization
  • Token Addition in Llama
  • Model Performance Insights
  • Feedback Mechanisms for Model Improvement
  • Memory Management in PyTorch

Perplexity AI ▷ #general (90 messages🔥🔥):

  • New Anthropic Model Release
  • Perplexity Pro Features
  • Merlin Extension
  • User Experiences with Perplexity
  • Query Limits Discussion

Link mentioned: Tweet from Rowan Cheung (@rowancheung): I just finished up an exclusive interview going over a new, major AI model upgrade. Can confirm, tomorrow will be a big day for developers. Dropping the full conversation on X the second the embargo...


Perplexity AI ▷ #sharing (10 messages🔥):

  • Cosmic Ambassador
  • Superintelligence Age
  • Perplexity AI Differences
  • AI Impact on Education
  • OpenAI Reasoning Probes

Perplexity AI ▷ #pplx-api (9 messages🔥):

  • Citational Access Requests
  • API Rate Limits
  • Output Consistency
  • Alternatives to PPLX
  • Exa.ai Exploration

LM Studio ▷ #general (72 messages🔥🔥):

  • LM Studio Installation on Air-Gapped Machines
  • Model Support in LM Studio
  • Model Performance and Handling
  • LongWriter Model Insights
  • Upcoming Model Releases

Links mentioned:


LM Studio ▷ #hardware-discussion (25 messages🔥):

  • ROCm on AMD APU
  • Dual GPU Setup Compatibility
  • Performance of RTX 3090
  • Price Differences in GPU Markets
  • Consumer Protections in EU

Link mentioned: Income Taxes GIF - Income Tax Tax Taxes - Discover & Share GIFs: Click to view the GIF


Modular (Mojo 🔥) ▷ #general (64 messages🔥🔥):

  • Mojo Language Tier List
  • Compilation Issues with Rust
  • NixOS and Package Management
  • MLIR vs LLVM
  • Mojo's Growth and Community

Modular (Mojo 🔥) ▷ #mojo (31 messages🔥):

  • Mojo classes vs Python classes
  • Monkey patching in Mojo
  • Future of pattern matching in Mojo
  • Communication speed between Mojo and C vs Python
  • Metaclasses and advanced features in Mojo

Links mentioned:


DSPy â–· #announcements (2 messages):

  • DSPy 2.5.0 release
  • Migration to LiteLLM
  • Deprecation of pre-2.4 LM clients
  • Feedback solicitation
  • Upcoming changes

Links mentioned:


DSPy â–· #show-and-tell (2 messages):

  • DSPy powered AI code assistant
  • Live coding session

DSPy â–· #papers (1 messages):

  • Chain-of-thought (CoT)
  • Performance Benefits of CoT
  • Quantitative Meta-Analysis of CoT

Link mentioned: To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning: Chain-of-thought (CoT) via prompting is the de facto method for eliciting reasoning capabilities from large language models (LLMs). But for what kinds of tasks is this extra ``thinking'' reall...


DSPy ▷ #general (84 messages🔥🔥):

  • DSPy 2.5.0 Release
  • Feedback on Meetings
  • Custom Adapters for LLM
  • Multimodal Capabilities
  • Cache Control in DSPy

Links mentioned:


DSPy â–· #examples (2 messages):

  • GROQ API Integration
  • Chain of Thought Evaluation

LLM Agents (Berkeley MOOC) â–· #mooc-announcements (1 messages):

  • Lecture 3
  • Agentic AI Frameworks
  • AutoGen
  • Multimodal Knowledge Assistant

Link mentioned: CS 194/294-196 (LLM Agents) - Lecture 3, Chi Wang and Jerry Liu: no description found


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (33 messages🔥):

  • Course Attendance Issues
  • Guest Speaker Requests
  • Quiz Links
  • Open Embedding Models
  • Applications of AutoGen

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (23 messages🔥):

  • Q&A Discussion
  • Technical Setup Delays
  • Zinley Project Mentioned
  • AutoGen Code Details
  • Chess Against AlphaGo

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (33 messages🔥):

  • Search/Retrieval Techniques
  • AutoGen vs. CrewAI
  • O1 API Usage
  • Multi-Agent Collaboration
  • Advanced DSL for Agents

Latent Space ▷ #ai-general-chat (74 messages🔥🔥):

  • Letta AI
  • Gemini Model Updates
  • Voice Feature Rollout
  • Customer Service Agent Experimentation
  • HuggingChat App Launch

Links mentioned:


Cohere ▷ #discussions (29 messages🔥):

  • Cohere AI
  • Aya initiative
  • Job anxiety
  • Testing hypotheses
  • Chain of Thought (COT)

Link mentioned: Aya: Cohere’s non-profit research lab, C4AI, released the Aya model, a state-of-the-art, open source, massively multilingual, research LLM covering 101 languages – including more than 50 previously underse...


Cohere ▷ #questions (8 messages🔥):

  • Server Locations
  • Using Single Step Tools with Javascript

Cohere â–· #api-discussions (5 messages):

  • Multilingual Reranker Issues
  • Embedding Model Selection
  • Reranker Best Practices

Cohere â–· #projects (2 messages):

  • Self-promotion concerns
  • Cohere in embedded systems

Cohere â–· #cohere-toolkit (1 messages):

  • Cohere Toolkit updates
  • Chat features
  • File type support
  • User feedback
  • Team collaboration

Link mentioned: Cohere Toolkit demo 09.2024: no description found


Stability.ai (Stable Diffusion) â–· #announcements (1 messages):

  • James Cameron
  • Stability AI Board of Directors
  • Transforming visual media
  • Generative AI
  • Cinematic technology

Link mentioned: James Cameron, Academy Award-Winning Filmmaker, Joins Stability AI Board of Directors — Stability AI: Today we announced that legendary filmmaker, technology innovator, and visual effects pioneer James Cameron has joined our Board of Directors.


Stability.ai (Stable Diffusion) ▷ #general-chat (41 messages🔥):

  • FNAF Loras creation
  • SDXL performance with GPU
  • Prompt engineering strategies
  • ControlNet applications
  • OpenPose editor integration

Link mentioned: Insideout Joy GIF - InsideOut Joy Hi - Discover & Share GIFs: Click to view the GIF


LlamaIndex â–· #announcements (1 messages):

  • LlamaParse
  • Fraudulent Sites

LlamaIndex â–· #blog (5 messages):

  • LitServe framework
  • AI product manager
  • LlamaIndex workflows workshop
  • Llamaparse fraudulent site
  • AWS Gen AI Loft

Link mentioned: LlamaCloud: no description found


LlamaIndex ▷ #general (35 messages🔥):

  • Approximate Metadata Filtering
  • Human-in-the-Loop Workflows
  • Postgres and pgvector
  • Web Crawling for Embedding

Links mentioned:


LAION ▷ #general (14 messages🔥):

  • Blendtain feedback
  • Playlist generator by dykyi_vladk
  • Study Machine Learning together
  • Impressions of GANs, CNNs, and ViTs

Link mentioned: Adify: no description found


LAION ▷ #research (12 messages🔥):

  • muP Transfer
  • HyperCloning Method
  • SDXL Unet
  • Positional Encoding in UNet
  • Sliding Window Attention

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (15 messages🔥):

  • Nvidia's new synthetic data model
  • MMLU performance
  • Fine-tuning and inferencing challenges
  • Context management in LLMs
  • Run Pod CUDA error

OpenAccess AI Collective (axolotl) â–· #axolotl-dev (3 messages):

  • Qwen 2.5
  • Axolotl support

OpenAccess AI Collective (axolotl) â–· #general-help (4 messages):

  • Fine-tuning spikes
  • Rope scaling
  • Llama3.1 setup
  • Qwen2.5 configurations

OpenAccess AI Collective (axolotl) â–· #axolotl-help-bot (2 messages):

  • Qwen 2.5
  • Axolotl Support

LangChain AI ▷ #general (17 messages🔥):

  • LangChain Pydantic Compatibility
  • GraphRecursionError in LangGraph
  • LLM Friendly Documentation for LangChain
  • Comparison between Mistral and Mixtral

Links mentioned:


Torchtune ▷ #dev (10 messages🔥):

  • CPU Offloading for Optimizers
  • Performance Optimization Techniques
  • Paged Adam vs Torchao CPUOffloadOptimizer

Links mentioned:


tinygrad (George Hotz) â–· #general (2 messages):

  • Distributed Training
  • Planetary Brain Concept
  • DisTrO Project

Link mentioned: GitHub - NousResearch/DisTrO: Distributed Training Over-The-Internet: Distributed Training Over-The-Internet. Contribute to NousResearch/DisTrO development by creating an account on GitHub.


tinygrad (George Hotz) â–· #learn-tinygrad (7 messages):

  • AttributeError in Tensor
  • Tinygrad version issues
  • Model architecture insights

OpenInterpreter ▷ #general (9 messages🔥):

  • Open Interpreter Updates
  • LLM based Browser Automation
  • Community Engagement
  • GitHub Resources
  • Feedback on Project Activity

Links mentioned:








{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}