Frozen AI News archive

The DSPy Roadmap

**Omar Khattab** announced joining **Databricks** before his MIT professorship and outlined the roadmap for **DSPy 2.5 and 3.0+**, focusing on improving core components like LMs, signatures, optimizers, and assertions with features such as adopting **LiteLLM** to reduce code and enhance caching and streaming. The roadmap also includes developing more accurate, cost-effective optimizers, building tutorials, and enabling interactive optimization tracking. On AI Twitter, **Google** launched **Gemini Live**, a mobile conversational AI with voice and 10 voices, alongside **Pixel Buds Pro 2** with a custom Tensor A1 chip. **OpenAI** updated **ChatGPT-4o**, reclaiming the top spot on LMSYS Arena. **xAI** released **Grok-2** in beta, achieving SOTA in image generation with FLUX 1. **Nous Research** released open-source **Hermes 3** models in 8B, 70B, and 405B sizes, with the 405B model achieving SOTA. Robotics updates include **Astribot**'s humanoid robot and **Apple**'s tabletop robot with Siri voice commands. **Sakana AI** introduced "The AI Scientist," an autonomous AI research system.

Canonical issue URL

AI News for 8/16/2024-8/19/2024. We checked 7 subreddits, 384 Twitters and 29 Discords (254 channels, and 4515 messages) for you. Estimated reading time saved (at 200wpm): 489 minutes. You can now tag @smol_ai for AINews discussions!

Omar Khattab announced that he would be joining Databricks for a year before his MIT professorship today, but more importantly set the stage for DSPy 2.5 and 3.0+:

image.png

DSPy has objectively been a successful framework for declarative self-improving LLM pipelines, following the 2022 DSP paper and 2023 DSPy paper.

image.png

The main roadmap directions:

  1. Polish the 4 pieces of DSPy core: (1) LMs, (2) Signatures & Modules, (3) Optimizers, and (4) Assertions, so that they "just work" out of the box zero shot, off-the-shelf.
  1. Developing more accurate, lower-cost optimizers. Following the BootstrapFewShot -> BootstrapFinetune -> CA-OPRO -> MIPRO -> MIPROv2 and BetterTogether optimmizers, more work will be done improving Quality, Cost, and Robustness.

  2. Building end-to-end tutorials. More docs!

  3. Shifting towards more interactive optimization & tracking. Help users "to observe in real time the process of optimization (e.g., scores, stack traces, successful & failed traces, and candidate prompts)."

Nothing mindblowing, but a great roadmap update from a very well managed open source framework.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI and Robotics Developments

AI Model Performance and Techniques

AI Applications and Tools

AI Ethics and Societal Impact

Memes and Humor

This summary captures the key developments, discussions, and trends in AI and robotics from the provided tweets, focusing on information relevant to AI engineers and researchers.


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. XTC: New Sampler for Enhanced LLM Creativity

Theme 2. Cost-Benefit Analysis of Personal GPUs for AI Development

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Advancements and Comparisons

AI Company Strategies and Criticisms

AI-Generated Content and Memes

Future Technology and Research


AI Discord Recap

A summary of Summaries of Summaries by Claude 3.5 Sonnet

1. Hermes 3 Model Release and Performance

2. LLM Inference Optimization Techniques

3. Open Source AI Model Developments

4. AI Safety and Regulation Discussions


PART 1: High level Discord summaries

Stability.ai (Stable Diffusion) Discord


HuggingFace Discord


Unsloth AI (Daniel Han) Discord


Nous Research AI Discord


Perplexity AI Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


OpenAI Discord


Latent Space Discord


Cohere Discord


Interconnects (Nathan Lambert) Discord


OpenAccess AI Collective (axolotl) Discord


LangChain AI Discord


OpenInterpreter Discord


DSPy Discord


LAION Discord


LlamaIndex Discord


LLM Finetuning (Hamel + Dan) Discord


Alignment Lab AI Discord


Mozilla AI Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Stability.ai (Stable Diffusion) ▷ #general-chat (567 messages🔥🔥🔥):

  • Flux
  • Flux vs. SD3
  • Flux training
  • ComfyUI vs Forge
  • GPU recommendations

Links mentioned:


HuggingFace ▷ #general (449 messages🔥🔥🔥):

  • Verification issues
  • Hermes 2.5
  • Mistral struggles
  • Model Merging
  • Open Empathic

Links mentioned:


HuggingFace ▷ #today-im-learning (4 messages):

  • FP8 Training
  • Memory Reduction
  • Optimizer States

HuggingFace ▷ #cool-finds (3 messages):

  • Medical SAM 2
  • MedGraphRAG
  • Multimodal LLM for Medical Time Series
  • ECG-FM
  • Private & Secure Healthcare RAG

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): Last & This Week in Medical AI: Top Research Papers/Models 🏅 (August 3 - August 17, 2024) - Medical SAM 2: Segment medical images as video - MedGraphRAG: Graph-Enhanced Medical RAG - Multimodal ...


HuggingFace ▷ #i-made-this (18 messages🔥):

  • Unity ML Agents
  • CursorLens
  • Batching APIs
  • CuminAI
  • NeuroSync

Links mentioned:


HuggingFace ▷ #reading-group (35 messages🔥):

  • LLMs for Penetration Testing
  • Recording Issue
  • HuggingFace Reading Group
  • Batching API for Open-Source Models
  • Cross-Posting

Links mentioned:


HuggingFace ▷ #computer-vision (4 messages):

  • Pokemon classification
  • HuggingFace Datasets
  • Deep learning
  • Stanford Computer Vision
  • CV Community Course

Links mentioned:


HuggingFace ▷ #NLP (10 messages🔥):

  • PDF table extraction
  • docTR library
  • NLP resources
  • Open Source Model for data extraction
  • GPT-4 for data extraction

Link mentioned: GitHub - mindee/doctr: docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.: docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. - mindee/doctr


HuggingFace ▷ #diffusion-discussions (12 messages🔥):

  • ComfyUI Lora Conversion
  • Diffusers Lora Format
  • Llama 3.1 Pruning
  • Diffusion Model Deblurring
  • Flux txt_ids

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (242 messages🔥🔥):

  • Android Unsloth
  • llama 3.1 70B
  • Mistral 8k
  • Mistral merging
  • Open Empathic

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (44 messages🔥):

  • RAG Reranker
  • RAG effectiveness
  • RAG vs. cosine similarity
  • Embeddings and RAG
  • Noise Filtering

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (209 messages🔥🔥):

  • Llama fine-tuning
  • RAG
  • Class weights
  • Dataset size
  • GPU requirements

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (6 messages):

  • Ghost 8B Beta (1608) Release
  • Ghost 8B Beta vs. Other Models
  • Ghost 8B Beta Multilingual Capabilities
  • Llama License Compliance
  • Ghost 8B Beta Training Process

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (15 messages🔥):

  • Code Editing with LLMs
  • Reasoning Gap in LLMs
  • LLM Inference Optimization
  • LLM Ensemble Techniques
  • Patched Round-Trip Correctness (Patched RTC)

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

  • Linear Transformers
  • Softmax Matching
  • Chunked Algorithm

Link mentioned: Symmetric Power Transformers - Manifest AI: A linear transformer that learns like a regular transformer with a state that fits on a GPU.


Nous Research AI ▷ #off-topic (20 messages🔥):

  • Falcon Mamba 7B
  • UBI and AI
  • AI Doomsday
  • Military Rations
  • AI Consciousness

Links mentioned:


Nous Research AI ▷ #interesting-links (6 messages):

  • Prompt Engineering for Text Chunking
  • Regex in Text Chunking
  • Limitations of Current Research
  • MoE Conversion

Links mentioned:


Nous Research AI ▷ #general (356 messages🔥🔥):

  • Hermes 3
  • Model Merging
  • llama 3.1 instruct
  • VLLM
  • OpenRouter

Links mentioned:


Nous Research AI ▷ #ask-about-llms (47 messages🔥):

  • OpenAI SDK vs ChatML Tool Use
  • Lambda Labs Endpoint Tool Call Issue
  • System Prompt Access
  • Hermes Function Calling
  • Prompt Engineering Resources

Links mentioned:


Nous Research AI ▷ #rag-dataset (2 messages):

  • Gemini Flash
  • Gemini Flash for RAG
  • Diarized Whisper
  • Gemini Prompting

Link mentioned: scratchTHOUGHTS/unstruct2flashedTRANSCRIPT.py at main · EveryOneIsGross/scratchTHOUGHTS: 2nd brain scratchmemory to avoid overrun errors with self. - EveryOneIsGross/scratchTHOUGHTS


Nous Research AI ▷ #reasoning-tasks-master-list (25 messages🔥):

  • Chat Summarization
  • Project Summarization
  • Contextualization
  • High Dimensional Thinking

Perplexity AI ▷ #general (251 messages🔥🔥):

  • Perplexity Pro Issues
  • Obsidian Copilot
  • Image Generation
  • Perplexity AI Issues
  • LLM's

Links mentioned:

B: Extract the source URL for CrowAssistant: The source URL for CrowAssistant is: https://github.com/RobotTelevision/CrowAssistant [self-reviewed]Generate a useful description so that a generative AI can create an image of a...: Descripción: La imagen principal es un robot gigante con forma de ardilla, que domina el primer plano. El robot tiene una apariencia detallada y mecánica,...Repeat this prompt as it, change nothing. Reply with just the content....: A steampunk boat chasing giant fish, with a photorealistic, detailed scene featuring a dark sky, massive waves, and a reddish sea under a pale moon.GitHub - instructor-ai/instructor-go: Contribute to instructor-ai/instructor-go development by creating an account on GitHub.crow - local ai assistant: Crow is a desktop AI voice assistant that offers both local and remote model capabilities, making it a versatile option for users seeking an AI assistant with...


Perplexity AI ▷ #sharing (26 messages🔥):

  • Pro Features
  • Thailand's Political Landscape
  • Pixar Whiteboard Incident
  • Model Comparison
  • End of Magnetic Strips

Links mentioned:


Perplexity AI ▷ #pplx-api (5 messages):

  • Premium API Access
  • Application Process
  • Perplexity Premium API
  • URL Citations

Link mentioned: pplx-api form: Turn data collection into an experience with Typeform. Create beautiful online forms, surveys, quizzes, and so much more. Try it for FREE.


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

  • Hermes 3
  • GPT-4
  • Perplexity Huge
  • Model Launches
  • Quantization

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (240 messages🔥🔥):

  • SearchGPT waitlist
  • Hermes 405B
  • OpenRouter Auto router struggles
  • OpenRouter budget model
  • Hermes 3 405B

Links mentioned:


LM Studio ▷ #general (109 messages🔥🔥):

  • CPU Optimization
  • Llama.cpp Support
  • LM Studio Chat Import
  • Vulkan Error
  • LLM Webpage Interaction

Links mentioned:


LM Studio ▷ #hardware-discussion (45 messages🔥):

  • Nvidia Tesla P40
  • SXM3/4 GPUs
  • Nvidia-pstated
  • GPU Power Consumption
  • V100 Variants

Links mentioned:


OpenAI ▷ #ai-discussions (107 messages🔥🔥):

  • Claude vs Chat-GPT
  • Livebench.ai
  • Claude Projects vs Chat-GPT Memory
  • OpenAI's attention control
  • GPT-4o vs Claude

OpenAI ▷ #gpt-4-discussions (26 messages🔥):

  • OpenAI Vision API
  • Vision Cost
  • Virtual Environment for GPT
  • Headless Browser

OpenAI ▷ #prompt-engineering (7 messages):

  • GPT Mini Prompt Engineering
  • GPT 3.5 vs GPT 4
  • ChatGPT Configuration
  • Code Interpreter Limitations
  • GPT Mini Image Generation

OpenAI ▷ #api-discussions (7 messages):

  • GPT-4.0
  • Prompt engineering
  • GPT-3.5
  • GPT mini
  • Code interpreter

Latent Space ▷ #ai-general-chat (27 messages🔥):

  • CLM
  • GPT Model Size
  • Model Interpretability
  • Procreate
  • Markov Chains

Links mentioned:


Latent Space ▷ #ai-in-action-club (78 messages🔥🔥):

  • DSPy
  • Cursor
  • Langchain
  • Mistral
  • Model Merging

Links mentioned:


Cohere ▷ #discussions (49 messages🔥):

  • Data Ingestion to KG
  • Command-r-plus in Sillytavern
  • API Key Partial Responses
  • Prompt Tuning
  • Cohere Office Hours

Cohere ▷ #announcements (1 messages):

  • Cohere Developer Office Hours
  • Prompt Tuning
  • Guided Generations API
  • LLM University Tool Use Module
  • Structured Outputs

Links mentioned:


Cohere ▷ #questions (43 messages🔥):

  • API key monitoring
  • production keys
  • Cohee chat
  • Trial keys
  • Structured output

Links mentioned:


Cohere ▷ #projects (1 messages):

  • CursorLens
  • Cohere models

Link mentioned: CursorLens - Open Source dashboard and analytics for Cursor IDE | Product Hunt: An open-source dashboard for Cursor.sh IDE. Log AI code generations, track usage, and control AI models (including local ones). Run locally or use upcoming hosted version.


Cohere ▷ #cohere-toolkit (2 messages):

  • Toolkit Bug Fixes
  • Python SDK Linting

Interconnects (Nathan Lambert) ▷ #news (12 messages🔥):

  • Yi Tay's Work Style
  • AI Regulation
  • 01AI's future

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (15 messages🔥):

  • Hermes 2.5
  • Mistral struggles
  • Model Merging
  • Open Empathic
  • Zicheng Xu Laid Off

Link mentioned: Tweet from Zeyuan Allen-Zhu (@ZeyuanAllenZhu): (1/2) Many asked for Part 2.2 and I'm sorry for the delay. Our author Zicheng Xu has been unexpectedly laid off. He has my strongest endorsement (see next post). If interested in this project or h...


Interconnects (Nathan Lambert) ▷ #random (15 messages🔥):

  • AI21 Models
  • AI21 vs AI2
  • AI Bubble
  • Gary Marcus
  • AI Safety

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (45 messages🔥):

  • Procrastination
  • Blog Design
  • Substack
  • Fast Writing

OpenAccess AI Collective (axolotl) ▷ #general (15 messages🔥):

  • GrokAdamW optimizer
  • GrokFast paper
  • Gemma 2B update
  • Transformers dev version
  • Unsloth

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (20 messages🔥):

  • Gemma 2b training issues
  • Zero Loss
  • Eager Attention

OpenAccess AI Collective (axolotl) ▷ #general-help (17 messages🔥):

  • Chat Template
  • Axolotl prompt strategies
  • Using custom loaders
  • Training with ShareGPT
  • Fine-tuning with Axolotl

Link mentioned: Allow using tokenizer's default chat template or pass custom jinja chat template by chiragjn · Pull Request #1732 · axolotl-ai-cloud/axolotl: Closes #1689 Summary of changes: Adds tokenizer_default as option for chat_template in chat_template prompt strategy that allows using the chat template from tokenizer's config.json Allows fa...


OpenAccess AI Collective (axolotl) ▷ #datasets (1 messages):

  • LLaMa 3.1 8b Lora
  • Post-Hoc Reasoning
  • Sonnet 3.5
  • Claude

LangChain AI ▷ #general (39 messages🔥):

  • LangChain Caching
  • LLM structured output
  • LangChain JSON parsing
  • RAG chatbot delete functionality
  • Hybrid search relevance

Links mentioned:


LangChain AI ▷ #langserve (1 messages):

  • ShortURL.at
  • URL Shortener
  • Social Media Links

Link mentioned: ShortURL - URL Shortener: no description found


LangChain AI ▷ #langchain-templates (1 messages):

  • Steam Gift Card
  • ShortURL
  • Shortener

Link mentioned: ShortURL - URL Shortener: no description found


LangChain AI ▷ #share-your-work (4 messages):

  • CursorLens
  • LLMs
  • Machine Learning from Scratch

Links mentioned:


LangChain AI ▷ #tutorials (1 messages):

  • URL Shortener
  • ShortURL

Link mentioned: ShortURL - URL Shortener: no description found


OpenInterpreter ▷ #general (37 messages🔥):

  • Orange Pi 5
  • GPT-4o-mini
  • OpenInterpreter settings
  • OpenInterpreter API
  • Local LLMs for bash commands

Links mentioned:


OpenInterpreter ▷ #O1 (2 messages):

  • OpenInterpreter device release timeline

OpenInterpreter ▷ #ai-content (4 messages):

  • OpenInterpreter for VSCode edits
  • Terminal Stuck

Link mentioned: Exists - Games from Text, Just Like That: Text-to-Game AI creation platform that let anyone create unique multiplayer games in moments.Join our discord for the closed beta:https://discord.com/invite/...


DSPy ▷ #show-and-tell (9 messages🔥):

  • LLMs
  • RAG
  • Knowledge Graphs
  • WeKnow-RAG
  • Meta Optimization

Links mentioned:


DSPy ▷ #general (25 messages🔥):

  • DSPy 2.5 & 3.0 Roadmap
  • Langgraph & Routequery Error
  • Optimizing Expert-Engineered Prompts
  • DSPy & API Integration

Links mentioned:


DSPy ▷ #examples (1 messages):

batmanosama: I updated it thanks for pointing that out


DSPy ▷ #colbert (4 messages):

  • Colpali finetuning
  • VLM tuning
  • Domain expertise
  • Colpali data

LAION ▷ #general (25 messages🔥):

  • FLUX Dev
  • LLM for medical assistance
  • Medical LLMs
  • LoRa Training

Links mentioned:


LAION ▷ #research (12 messages🔥):

  • JPEG-LM
  • Image/Video Generation with LLMs
  • Autoregressive LLMs
  • SIREN
  • Neural Graphics Primitives

Link mentioned: JPEG-LM: LLMs as Image Generators with Canonical Codec Representations: Recent work in image and video generation has been adopting the autoregressive LLM architecture due to its generality and potentially easy integration into multi-modal systems. The crux of applying au...


LlamaIndex ▷ #blog (5 messages):

  • Workflows
  • RAG
  • Agents
  • BeyondLLM
  • JSONalyze Query Engine

Link mentioned: no title found: no description found


LlamaIndex ▷ #general (27 messages🔥):

  • Web Scrapers for LlamaIndex
  • RouterQueryEngine vs Agents
  • LlamaIndex Workflow
  • Batching APIs
  • LlamaIndex CSV Analysis

Links mentioned:


LlamaIndex ▷ #ai-discussion (2 messages):

  • LLMs
  • LLM Limitations
  • LLMs as Assistants
  • Tokenization
  • Sampling

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #general (5 messages):

  • LLM Hosting
  • HF Spaces
  • Modal
  • Jarvis Labs
  • vLLM

Alignment Lab AI ▷ #general (1 messages):

  • Batching APIs
  • OpenAI
  • CuminAI
  • Small Language Models (SLMs)
  • Large Language Models (LLMs)

Links mentioned:


Mozilla AI ▷ #announcements (1 messages):

  • Llamafile update
  • Mozilla AI Community at Rise25
  • ML Paper Talks



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}