Frozen AI News archive

OpenAI beats Anthropic to releasing Speculative Decoding

**Prompt lookup** and **Speculative Decoding** techniques are gaining traction with implementations from **Cursor**, **Fireworks**, and teased features from **Anthropic**. **OpenAI** has introduced faster response times and file edits with these methods, offering about **50%** efficiency improvements. The community is actively exploring AI engineering use cases with these advancements. Recent updates highlight progress from companies like **NVIDIA**, **OpenAI**, **Anthropic**, **Microsoft**, **Boston Dynamics**, and **Meta**. Key technical insights include CPU inference capabilities, multimodal retrieval-augmented generation (RAG), and neural network fundamentals. New AI products include fully AI-generated games and advanced content generation tools. Challenges in AI research labs such as bureaucracy and resource allocation were also discussed, alongside AI safety and governance concerns.

Canonical issue URL

AI News for 11/1/2024-11/4/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (216 channels, and 7073 messages) for you. Estimated reading time saved (at 200wpm): 766 minutes. You can now tag @smol_ai for AINews discussions!

Ever since the original Speculative Decoding paper (and variants like Hydra and Medusa), the community has been in a race to deploy it. In May, Cursor and Fireworks announced their >1000tok/s fast apply model (our coverage here, Fireworks technical post here - note this issue initially went out with a factual error stating that the speculative decoding API was not released, as this post explains, Fireworks released a spec decoding API 5 months ago.). In August, Zed teased Anthropic's new Fast Edit mode. But Anthropic's API was not released... leaving room for OpenAI to come in swinging:

image.png

Factory AI reports much faster response times and file edits:

image.png

what this extra processing of draft tokens will cost is a bit more vague (subject to a 32 token match), but you can take ~50% as a nice rule of thumb.

image.png

As we analyze on this week's Latent.Space post, this slots nicely in to the wealth of options that have developed that match AI Engineer usecases:

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Technology and Industry Updates

Industry Commentary & Culture

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Hertz-Dev: First Open-Source Real-Time Audio Model with 120ms Latency

Theme 2. Voice Cloning Advances: F5-TTS vs RVC vs XTTS2

Theme 3. Token Management and Model Optimization Techniques

Theme 4. MG²: New Melody-First Music Generation Architecture

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Performance and Benchmarks

AI Security and Infrastructure

AI Image Generation Progress

AI Ethics and Society

Memes and Humor


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. Major LLM Releases and Model Updates

Theme 2. AI Advancements in Security and Medicine

Theme 3. LLM Fine-Tuning and Performance Struggles

Theme 4. AI Takes on Gaming and Creative Endeavors

Theme 5. AI Ethics, Censorship, and the Community's Voice


PART 1: High level Discord summaries

HuggingFace Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Nous Research AI Discord


Perplexity AI Discord


Eleuther Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Notebook LM Discord Discord


GPU MODE Discord


Latent Space Discord


Stability.ai (Stable Diffusion) Discord


Modular (Mojo 🔥) Discord


Interconnects (Nathan Lambert) Discord


LlamaIndex Discord


OpenInterpreter Discord


DSPy Discord


tinygrad (George Hotz) Discord


OpenAccess AI Collective (axolotl) Discord


LAION Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Torchtune Discord


Alignment Lab AI Discord


MLOps @Chipro Discord


Gorilla LLM (Berkeley Function Calling) Discord


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #general (955 messages🔥🔥🔥):

  • Discussion on AI tools for PDF processing
  • Experiences with various models and datasets
  • Thoughts on politics and current events
  • Challenges in fine-tuning LLMs
  • Building custom applications for specific use cases

Links mentioned:


HuggingFace ▷ #today-im-learning (8 messages🔥):

  • Neuralink code performance
  • Cyber Security AI Models
  • Transformers Image Preprocessor Bug

HuggingFace ▷ #cool-finds (8 messages🔥):

  • Quantum Metrology
  • AI Memory Chips
  • Medical AI Research
  • Open Trusted Data Initiative
  • ShellCheck Tool

Links mentioned:


HuggingFace ▷ #i-made-this (31 messages🔥):

  • AI-Generated Courses
  • Huberman Answers Bot
  • PubMed Scraper
  • New VividNode Release
  • ROS2 Docker Environment

Links mentioned:


HuggingFace ▷ #reading-group (11 messages🔥):

  • Automated Penetration Testing Benchmark
  • Recording Paper Reading Sessions
  • Structure of Paper Reading Sessions
  • New Member Orientation

Link mentioned: Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements: Hacking poses a significant threat to cybersecurity, inflicting billions of dollars in damages annually. To mitigate these risks, ethical hacking, or penetration testing, is employed to identify vulne...


HuggingFace ▷ #computer-vision (1 messages):

  • MAE finetuning
  • Deprecation issues

HuggingFace ▷ #NLP (3 messages):

  • Reading Resources
  • Paper Reading Methods

HuggingFace ▷ #diffusion-discussions (2 messages):

  • Text Embeddings
  • Sana Performance
  • LLMs Comparison

Unsloth AI (Daniel Han) ▷ #general (742 messages🔥🔥🔥):

  • Unsloth AI Model Fine-tuning
  • Python Performance Improvements
  • Model Quantization and Inference
  • Web Development Frameworks for LLM
  • Git Practices and Requirements Management

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (5 messages):

  • Unsloth project
  • Product Hunt discussions

Link mentioned: Contri.buzz - Celebrate your open source contributors | Product Hunt: Showcase a visually appealing Contributors' wall in your GitHub README.md / Website / App


Unsloth AI (Daniel Han) ▷ #help (182 messages🔥🔥):

  • Fine-tuning strategies
  • Model performance issues
  • Prompt formats
  • Dataset formatting
  • Inference behavior

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

theyruinedelise: Just saw this congrats <:catok:1238495845549346927>


Unsloth AI (Daniel Han) ▷ #community-collaboration (2 messages):

  • Callback for mtbench
  • Decision on implementation
  • Hugging Face evaluate metrics

OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • Claude 3.5 Haiku
  • Free Llama 3.2
  • PDF functionality
  • Increased Credit Limit

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (7 messages):

  • vnc-lm Discord Bot
  • Freebie Alert Chrome Extension
  • OpenRouter API integration
  • Scraping and tool calling

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (663 messages🔥🔥🔥):

  • Hermes 405b issues
  • API rate limits
  • Pricing changes for Haiku
  • Lambda API integration
  • OpenRouter features

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (16 messages🔥):

  • Beta access requests
  • Integrations feature
  • Custom provider keys

LM Studio ▷ #general (166 messages🔥🔥):

  • LM Studio Features
  • Embedding Models
  • Mixed GPU Configurations
  • Model Output Structuring
  • Python for LLM Usage

Links mentioned:


LM Studio ▷ #hardware-discussion (396 messages🔥🔥):

  • LM Studio Performance
  • Hardware Recommendations for LLMs
  • Context Management in LLM Inference
  • High-Performance Computing with AMD CPUs
  • System Monitoring Tools for Hardware Performance

Links mentioned:


Nous Research AI ▷ #general (438 messages🔥🔥🔥):

  • Hermes 405b Model Performance
  • Data Sources for Hermes Models
  • Lambda API Issues
  • Future of Nous Research Models
  • Hyperparameter Optimization for Smaller Models

Links mentioned:


Nous Research AI ▷ #ask-about-llms (22 messages🔥):

  • Jailbreaking LLMs
  • AI Search Engine Performance
  • AI Energy Usage and Hardware Inefficiency

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

  • MDAgents
  • Matchmaker
  • UltraMedical
  • AI in Healthcare Ethics
  • Clinical Evidence Synthesis

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): Last Week in Medical AI: Top Research Papers/Models 🏅 (October 26 - November 2, 2024) 🏅 Medical AI Paper of the Week: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making by Aut...


Nous Research AI ▷ #research-papers (2 messages):

  • MDAgents
  • Matchmaker Model
  • FEDKIM Framework
  • DiaMond Application
  • AI in Healthcare Ethics

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): Last Week in Medical AI: Top Research Papers/Models 🏅 (October 26 - November 2, 2024) 🏅 Medical AI Paper of the Week: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making by Aut...


Perplexity AI ▷ #general (329 messages🔥🔥):

  • Perplexity AI Models
  • Claude 3.5 Haiku
  • Model Switching
  • New Subscription Pricing
  • User Experience with Perplexity

Links mentioned:


Perplexity AI ▷ #sharing (26 messages🔥):

  • AI that can smell
  • SearchGPT
  • Chinese Military's Llama AI Model
  • Affordable cars in India
  • Packing tips

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (7 messages):

  • Home appliance review analysis
  • Citations feature in the API
  • Discord bot development
  • Support contact for API issues

Eleuther ▷ #general (15 messages🔥):

  • Attention Mechanisms
  • Generative AI Learning
  • Understanding ODE/SDE for Diffusion Models

Link mentioned: transformers/src/transformers/models/llama/modeling_llama.py at 65753d6065e4d6e79199c923494edbf0d6248fb1 · huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - huggingface/transformers


Eleuther ▷ #research (323 messages🔥🔥):

  • Gradient Dualization
  • Mathematical Foundations of ML
  • Reading Group Formats
  • Optimizer Configurations
  • Scaling in Neural Networks

Links mentioned:


Eleuther ▷ #scaling-laws (2 messages):

  • Adafactor Schedule Changes
  • Hyperparameter Tuning Insights

Eleuther ▷ #interpretability-general (1 messages):

  • w2s Model Files
  • GitHub Repository

Link mentioned: GitHub - EleutherAI/w2s: Contribute to EleutherAI/w2s development by creating an account on GitHub.


Eleuther ▷ #lm-thunderdome (8 messages🔥):

  • GPT-NeoX Experimentation
  • Evaluating Tasks with Harness
  • Modifying Evaluation Metrics

Link mentioned: lm-evaluation-harness/lm_eval/filters/selection.py at c0745fec3062328e0ab618f36334848cdf29900e · EleutherAI/lm-evaluation-harness): A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


Eleuther ▷ #multimodal-general (11 messages🔥):

  • DINO's ImageNet Pretraining
  • Jordan's Anime Art Thoughts

Eleuther ▷ #gpt-neox-dev (1 messages):

  • GPT-NeoX Demo
  • Colab Notebooks

Link mentioned: GPT-NeoX-Colab/notebooks/shakespeare_training.ipynb at main · markNZed/GPT-NeoX-Colab: Example Colab notebooks for GPT-NeoX. Contribute to markNZed/GPT-NeoX-Colab development by creating an account on GitHub.


aider (Paul Gauthier) ▷ #announcements (2 messages):

  • Claude 3.5 Haiku performance
  • Aider v0.62.0 features
  • Code editing leaderboard
  • File edits application
  • Bugfixes in Aider

Link mentioned: Aider LLM Leaderboards: Quantitative benchmarks of LLM code editing skill.


aider (Paul Gauthier) ▷ #general (240 messages🔥🔥):

  • Aider Features and Updates
  • AI Model Comparisons
  • Benchmark Performance
  • Token Usage and Management
  • Command Line Instructions

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (80 messages🔥🔥):

  • Prompt Caching in Aider
  • Issues with Aider Installation
  • Configuration Management in Aider
  • LLM Token Limit Discrepancies
  • Using Multiple Configuration Files

Links mentioned:


aider (Paul Gauthier) ▷ #links (4 messages):

  • OpenAI o1 anticipation
  • IBM Granite coding models
  • Granite performance evaluation
  • YouTube discussion

Links mentioned:


OpenAI ▷ #ai-discussions (184 messages🔥🔥):

  • Potential AGI Capabilities
  • Dependency on Technology
  • AI Relationships
  • OpenAI's Future Developments
  • AI Versus Human Interaction

Links mentioned:


OpenAI ▷ #gpt-4-discussions (14 messages🔥):

  • Middleware Products
  • Channel Naming Preferences
  • O1 Observations
  • GPT-5 Announcement
  • Discord Search Functionality

OpenAI ▷ #prompt-engineering (63 messages🔥🔥):

  • Best practices for code documentation
  • Tools for writing good prompts
  • Measuring user interaction with LLMs
  • Planning with LLMs
  • Using LLMs in production

OpenAI ▷ #api-discussions (63 messages🔥🔥):

  • Best Practices for Code Block Input
  • Prompt Measurement Tools
  • Sentiment Analysis for User Frustration
  • Using LLMs for Automated Planning
  • Benefits of Custom Prompts

Notebook LM Discord ▷ #use-cases (49 messages🔥):

  • NotebookLM Language Configuration
  • Improving Podcast Engagement
  • Use Cases for Special Needs
  • Fractal Grid Concept Discussion
  • Copyright Issues with Podcast Distribution

Links mentioned:


Notebook LM Discord ▷ #general (204 messages🔥🔥):

  • Podcast Quality Issues
  • Using PDFs as Sources
  • Language Support in NotebookLM
  • Audio Overviews and Summaries
  • API Development Plans

Links mentioned:


GPU MODE ▷ #general (18 messages🔥):

  • Low-bit Triton Kernels
  • CUDA Matrix Multiplication Optimization
  • Custom CUDA Kernels vs PyTorch
  • PyTorch H100 Optimizations
  • Flash Attention Package

Links mentioned:


GPU MODE ▷ #triton (100 messages🔥🔥):

  • Triton Kernel Optimization
  • Fused Swiglu Performance
  • FP8 Quantization Techniques
  • Dynamic Activation Scaling
  • Caching for Triton Kernels

Links mentioned:


GPU MODE ▷ #torch (7 messages):

  • Hardware/System-Agnostic Reproducibility
  • Torch Compile Overhead
  • Determinism Across Different GPUs

Link mentioned: BC-Breaking Change: torch.load is being flipped to use weights_only=True by default in the nightlies after #137602: TL;DR After warning of this change since version 2.4, #137602 has been merged which will change the default for the weights_only argument of torch.load from False to True in the nightlies (and version...


GPU MODE ▷ #cool-links (2 messages):

  • Anyscale
  • Databricks

GPU MODE ▷ #jobs (1 messages):

  • Hiring LLM Infra Engineers
  • AI-based Gaming
  • LLM Training Techniques

GPU MODE ▷ #beginner (18 messages🔥):

  • CUDA Programming Basics
  • Coalesced Memory Access
  • Setting Up NCU for Profiling
  • Introduction to Triton
  • Using torch.compile

GPU MODE ▷ #youtube-recordings (1 messages):

  • Low Bit Triton Kernel Performance
  • Gemlite

GPU MODE ▷ #torchao (1 messages):

  • TorchAO planning
  • Feature requests
  • User pain points

GPU MODE ▷ #off-topic (2 messages):

  • Noob Questions
  • Unanswered Questions

GPU MODE ▷ #rocm (11 messages🔥):

  • ROCm version recommendations
  • Register spill and bank conflicts
  • Kernel profiling with rocprofv3
  • XOR based permutation complexities
  • Matrix core layout issues

Link mentioned: Using rocprofv3 — ROCprofiler-SDK 0.5.0 Documentation: no description found


GPU MODE ▷ #intel (1 messages):

lavawave03: Hehehe


GPU MODE ▷ #arm (25 messages🔥):

  • LLM Inference on ARM CPUs
  • Tensor Parallelism Benefits
  • Distributed CPU Inference Documentation
  • Bitnet Model Constraints
  • Research on Grace CPU and Graviton

Link mentioned: GitHub - pytorch/torchchat: Run PyTorch LLMs locally on servers, desktop and mobile: Run PyTorch LLMs locally on servers, desktop and mobile - pytorch/torchchat


GPU MODE ▷ #liger-kernel (15 messages🔥):

  • Liger Kernel CI Issues
  • Liger Rope Function Failures
  • Custom Triton Kernel Integration

Links mentioned:


GPU MODE ▷ #self-promotion (4 messages):

  • __CUDA_ARCH__ Macro Usage
  • Undefined Behavior in CUDA
  • TinyGEMM Kernel Issues

Links mentioned:


GPU MODE ▷ #🍿 (1 messages):

  • Discord Cluster Manager Updates

Link mentioned: GitHub - gpu-mode/discord-cluster-manager: Contribute to gpu-mode/discord-cluster-manager development by creating an account on GitHub.


GPU MODE ▷ #thunderkittens (10 messages🔥):

  • vLLM Demos
  • Performance Optimization for Llama 70B
  • Integration with Flash Attention
  • LoLCATS Project
  • Kernel Features Implementation

Link mentioned: using out argument will change the output · Issue #1299 · Dao-AILab/flash-attention: Hi, thanks for the great library! I'm trying to use the out argument of flash_attn_varlen_func and flash_attn_with_kvcache, however, I find that it will lead to incorrent output. Is it expected? W...


Latent Space ▷ #ai-general-chat (70 messages🔥🔥):

  • Claude PDF support
  • AI Search Wars
  • O1 model leak
  • Skills and AI agents

Links mentioned:


Latent Space ▷ #ai-in-action-club (126 messages🔥🔥):

  • Open Interpreter
  • Voice Input for AI Tools
  • Huly and Height Collaboration Tools
  • Screenpipe Integration
  • Entropix Benchmark Results

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (192 messages🔥🔥):

  • Stable Diffusion 3.5 and ComfyUI
  • Reporting Scam Bots
  • Image Prompting Techniques
  • Model Training and Resources
  • Dynamic Prompts and Errors

Links mentioned:


Modular (Mojo 🔥) ▷ #general (1 messages):

  • Community Meeting Q&A
  • Project Submissions

Link mentioned: Modular Community Q&A: no description found


Modular (Mojo 🔥) ▷ #mojo (112 messages🔥🔥):

  • Mojo Hardware Lowering Support
  • Managing References in Mojo
  • Slab List Implementation
  • Mojo Stability and Nightly Releases
  • Custom Tensor Structures in Neural Networks

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (30 messages🔥):

  • OLMo language model
  • OpenAI and Anthropic hiring trends
  • Grok model API release
  • Claude 3.5 Haiku pricing
  • Goodhart's Law in evaluation metrics

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (17 messages🔥):

  • AI's impact on families
  • Voice mode context retention
  • AnthropicAI Token Counting API
  • Claude's unique chat template
  • UCSC seminar series

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (7 messages):

  • Claude's Humor Generation
  • YOLOv3 Paper Importance
  • Claude's System Feedback

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rlhf (17 messages🔥):

  • Model behavior on user insistence
  • Incentives for labs
  • Alignment definitions
  • User preferences for responses
  • Flow of assistant responses

Interconnects (Nathan Lambert) ▷ #reads (15 messages🔥):

  • Search Model Psychology
  • Self-Debugging Chains
  • Repo Replication in OSS
  • Initial Conditions for Models
  • Chinese Military Use of Llama Model

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (2 messages):

  • AI podcasts
  • Conversational format in interviews

LlamaIndex ▷ #blog (7 messages):

  • OilyRAGs
  • Multi-Agent Workflow
  • Local RAG-augmented voice chatbot
  • Report Generation Workflow
  • Create Llama

Link mentioned: GitHub - drunkwcodes/bb7: A TDD coding bot using ollama: A TDD coding bot using ollama. Contribute to drunkwcodes/bb7 development by creating an account on GitHub.


LlamaIndex ▷ #general (45 messages🔥):

  • Custom Agent Creation
  • Discord User Inquiry
  • Cost Estimation for RAG Pipelines
  • LlamaParse Object Parameters
  • RAG Pipeline Visualization

Links mentioned:


LlamaIndex ▷ #ai-discussion (2 messages):

  • API Testing for Data Analysis
  • LlamaIndex Reader Experience

Link mentioned: no title found: no description found


OpenInterpreter ▷ #general (34 messages🔥):

  • Open Interpreter issues
  • Even Realities G1 glasses
  • Brilliant Frames
  • Anthropic errors
  • Microsoft Omniparser integration

Link mentioned: GitHub - even-realities/EvenDemoApp: Contribute to even-realities/EvenDemoApp development by creating an account on GitHub.


OpenInterpreter ▷ #ai-content (9 messages🔥):

  • Oasis AI Model
  • Claude 3.5 Haiku Release
  • Direct Access to O1 Model
  • Price Increase for Claude
  • OpenInterpreter Updates

Links mentioned:


DSPy ▷ #show-and-tell (21 messages🔥):

  • Full Vision Support in DSPy
  • VLM Use Case Discussions
  • Docling Document Processing Tool
  • CLI Manufacturing Plant Progress
  • Meeting Replay Request

Links mentioned:


DSPy ▷ #general (19 messages🔥):

  • Contribution opportunities
  • STORM module modifications
  • Output field manipulation in signatures
  • Few-shot optimization techniques
  • VLM support updates

Links mentioned:


tinygrad (George Hotz) ▷ #general (17 messages🔥):

  • PR #7477 Tests Fix
  • Company Meeting Updates
  • WebGPU Discussion
  • Code Refactoring Efforts
  • ALU Usage in Tests

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (20 messages🔥):

  • Apache TVM Features
  • Resource Errors in Docker
  • MobileNetV2 Implementation Issues
  • Fake PyTorch Backend Development
  • Oasis AI Model Release

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (8 messages🔥):

  • Instruct Datasets for Meta Llama 3.1
  • Catastrophic Forgetting in Fine-Tuning
  • Granite 3.0 Model Evaluation
  • Distributed Training of LLMs
  • RAG vs Fine-Tuning Approach

OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (11 messages🔥):

  • Inference script chat format
  • Inference mismatch issue
  • Fine-tuning Llama 3 on GPUs
  • Zero1 vs Zero2 optimization

Link mentioned: Mismatch between training & inference for chat modesl in do_inference · Issue #2014 · axolotl-ai-cloud/axolotl: Please check that this issue hasn't been reported before. I searched previous Bug Reports didn't find any similar reports. Expected Behavior There is mismatch bewteen how chat models are train...


OpenAccess AI Collective (axolotl) ▷ #community-showcase (1 messages):

  • Aloe Beta
  • Open healthcare LLMs
  • Axolotl SFT phase

LAION ▷ #general (6 messages):

  • Adding CLS tokens
  • Sharky blob development
  • Downsampling latents
  • Inference API standardization
  • ADs server issues

Links mentioned:


LAION ▷ #research (6 messages):

  • RAR Image Generator
  • Position Embedding Shuffle
  • NLP Implications

Link mentioned: Randomized Autoregressive Visual Generation: no description found


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

  • OpenAI API Setup
  • Lambda Labs Access

LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

  • Project GR00T
  • Jim Fan
  • NVIDIA robotics
  • Generalist Agents

LLM Agents (Berkeley MOOC) ▷ #mooc-questions (8 messages🔥):

  • LLM Agents Hackathon
  • Study Groups
  • Compute Resources Access
  • Team Formation
  • Course Announcements

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

  • Polygonia interrogationis
  • Moth identification

Cohere ▷ #discussions (7 messages):

  • Manual Simulation of ChatGPT Searching
  • Future of Browsing and Chatbots
  • Augmented Reality in Information Access

Cohere ▷ #api-discussions (4 messages):

  • Cohere API
  • Channel Purpose
  • Community Engagement

Torchtune ▷ #general (6 messages):

  • Torchtune Weight Gathering
  • Checkpointing Issues
  • Distributed Checkpointing
  • Llama 90B Pull Request

Link mentioned: Llama 3.2 Vision - 90B by felipemello1 · Pull Request #1880 · pytorch/torchtune: Context What is the purpose of this PR? Is it to add a new feature fix a bug update tests and/or documentation other (please add here) This PR adds llama 90B to torchtune. A few issues were f...


Torchtune ▷ #dev (5 messages):

  • Gradient Norm Clipping
  • Duplicate Compile Key in Config
  • ForwardKLLoss Misconception

Links mentioned:


Alignment Lab AI ▷ #announcements (1 messages):

  • Aphrodite Windows Support
  • PagedAttention Implementation
  • High-throughput local AI

Links mentioned:


Alignment Lab AI ▷ #general (2 messages):

  • LLMs for auto-patching vulnerabilities
  • Self Healing Code blog post
  • Generative AI innovations
  • Podcast discussions

Link mentioned: Self Healing Code – D-Squared: no description found


MLOps @Chipro ▷ #events (1 messages):

  • Learning on Graphs Conference
  • Graph Machine Learning
  • Local Meetups

Links mentioned:


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

  • Benchmarking function calling
  • Retrieval based approach
  • Function definitions



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}