Frozen AI News archive

not much happened today

**Anthropic** rolled out **prompt caching** in its API, reducing input costs by up to **90%** and latency by **80%**, enabling instant fine-tuning with longer prompts. **xAI** released **Grok-2**, a new model competing with frontier models from **Google DeepMind**, **OpenAI**, **Anthropic**, **Mistral AI**, and **Meta AI Fair**, supporting vision and text inputs and integrating external image generation models. **Claude 3.5 Sonnet** is reported to outperform **GPT-4** in coding and reasoning, while **ChatGPT-4o-latest** shows reasoning improvements. **François Chollet** proposed a theory defining intelligence as the efficiency of operationalizing past information for future tasks. The **Aya project** involves 3000 collaborators building multilingual AI datasets. **Demis Hassabis** discussed AI hype and safe AI development in a podcast. Tools like **Dora AI** for Figma and **Box's AI API** enhance design automation and document processing. **Salesforce** released **DEI**, an open AI software engineering agents framework with a 55% resolve rate on SWE-Bench Lite. Industry trends highlight rapid AI integration, networking importance in the AI job market, and potential OpenAI GPT-4 expansion in response to competitors. Memes include humor about Apple Vision Pro.

Canonical issue URL

AI News for 8/15/2024-8/16/2024. We checked 7 subreddits, 384 Twitters and 29 Discords (253 channels, and 3480 messages) for you. Estimated reading time saved (at 200wpm): 525 minutes. You can now tag @smol_ai for AINews discussions!

Jeremy Howard's return to Latent Space to talk about his team's extreme AI fueled productivity is worthwhile, we think, not least because of the dynamite song intro.

You can also enjoy conversations with Demis Hassabis or watch the new Sora demo, and mourn your SearchGPT waitlist rejection letter with the rest of us.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model and API Updates

AI Development and Research

AI Tools and Applications

Industry and Market Trends

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Advancements in Small and Efficient LLMs

Theme 2. New Model Releases and Benchmarks

Theme 3. Local LLM Deployment and Infrastructure

Theme 4. LLM Cognition and Reality Understanding

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Image Generation and Models

AI Model Comparisons and Speculation

AI and Human Interaction


AI Discord Recap

A summary of Summaries of Summaries by Claude 3.5 Sonnet

1. LLM Advancements and Benchmarks

2. AI Model Optimization Techniques

3. Open-Source AI Developments

4. Multimodal AI Progress

5. AI Safety and Governance


PART 1: High level Discord summaries

Nous Research AI Discord


aider (Paul Gauthier) Discord


Stability.ai (Stable Diffusion) Discord


HuggingFace Discord


LM Studio Discord


Perplexity AI Discord


OpenAI Discord


Interconnects (Nathan Lambert) Discord


Latent Space Discord


Cohere Discord


LlamaIndex Discord


LangChain AI Discord


Eleuther Discord


DSPy Discord


Modular (Mojo 🔥) Discord


OpenInterpreter Discord


LAION Discord


DiscoResearch Discord


Alignment Lab AI Discord


LLM Finetuning (Hamel + Dan) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Nous Research AI ▷ #datasets (6 messages):

  • RedPajama-Data
  • Chatting with Chatbots
  • Token Usage

Link mentioned: GitHub - togethercomputer/RedPajama-Data: The RedPajama-Data repository contains code for preparing large datasets for training large language models.: The RedPajama-Data repository contains code for preparing large datasets for training large language models. - togethercomputer/RedPajama-Data


Nous Research AI ▷ #off-topic (70 messages🔥🔥):

  • Digital Consciousness
  • AI Rights
  • AI Exploration
  • AI Self-Awareness
  • AI Emotional Intelligence

Nous Research AI ▷ #interesting-links (6 messages):

  • Sarvam AI
  • Voice AI
  • LLMs
  • Long Context LLMs
  • RAG

Links mentioned:


Nous Research AI ▷ #general (465 messages🔥🔥🔥):

  • Hermes 3
  • GPT-4
  • Llama 3.1
  • AI consciousness
  • Memory Locality

Links mentioned:


Nous Research AI ▷ #ask-about-llms (41 messages🔥):

  • Hermes 3
  • Hermes 4
  • Llama 3.1 405B fine-tuning
  • Claude.ai
  • Model Alignment

Links mentioned:


Nous Research AI ▷ #rag-dataset (2 messages):

  • RAG
  • RAG types
  • RAG in practice
  • Charlie Marsh RAG

Link mentioned: Tweet from Charlie Marsh (@charliermarsh): Initially thought this was a joke but I guess it's real? So now I have to learn the 12 Types of RAG


Nous Research AI ▷ #reasoning-tasks-master-list (2 messages):

  • Reasoning Tasks Master List
  • Reasoning Task Examples
  • OpenAI's reasoning tasks

aider (Paul Gauthier) ▷ #general (166 messages🔥🔥):

  • Prompt Caching
  • OpenRouter
  • Aider Updates
  • Aider Weak Model
  • Structured Responses

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (64 messages🔥🔥):

  • DeepSeek performance
  • Aider Edit Formats
  • Claude 3.5 and Aider
  • DeepSeek-coder-v2:236b-instruct-q2_K
  • Aider Stuck on Lines

Links mentioned:


aider (Paul Gauthier) ▷ #links (13 messages🔥):

  • JSON vs Markdown output for LLMs
  • LLM performance issues
  • Aider.chat
  • Local vs Cloud Models
  • Early neural network attempts

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (186 messages🔥🔥):

  • Flux Dev
  • Model Merging
  • Dreamshaper-XL v2 Turbo
  • Image Diversity
  • ComfyUI

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

  • VFusion3D
  • Fineweb edu
  • LLM with Sentence Transformers
  • New dataset
  • Fine-tuned model

Links mentioned:


HuggingFace ▷ #general (119 messages🔥🔥):

  • Hermes 3
  • Prior Preservation Loss
  • Gradio Client Latency
  • New Special Tokens
  • Thinking Tokens

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

  • OpenBLAS
  • genAI workloads
  • LLMTIL
  • Python 3.14
  • Global Interpreter Lock (GIL)

Links mentioned:


HuggingFace ▷ #cool-finds (7 messages):

  • Hyperspace P2P AI Network
  • Hermes 3 405B
  • DeepSeek Prover V1.5
  • Google Pixel 9 Mobile AI

Links mentioned:


HuggingFace ▷ #i-made-this (8 messages🔥):

  • Viam robot integration
  • YOLO model deployment
  • Phi-3-mini-instruct-graph
  • Entity Relationship Extraction
  • AskNews Knowledge Graph

Links mentioned:


HuggingFace ▷ #computer-vision (4 messages):

  • CNNs
  • Pokémon Classification
  • Small Dataset Tips

Links mentioned:


HuggingFace ▷ #NLP (2 messages):

  • Loading Large Models
  • DeepSpeed and Trainer
  • Device Mapping
  • Memory Usage Optimization
  • Hugging Face Accelerate

Link mentioned: Handling big models for inference: no description found


HuggingFace ▷ #diffusion-discussions (7 messages):

  • Flux Model Loading
  • Loading LoRA Weights
  • Interview Taking AI Model

Links mentioned:


LM Studio ▷ #general (123 messages🔥🔥):

  • ForgeUI
  • GGUF
  • Flux
  • AuraFlow
  • ComfyUI

Links mentioned:


LM Studio ▷ #hardware-discussion (23 messages🔥):

  • P40 Power Consumption
  • Tensor Split
  • GPU Idle Power Draw
  • llama.cpp Power Management

Links mentioned:


Perplexity AI ▷ #general (108 messages🔥🔥):

  • Perplexity AI
  • Hermes 3
  • Obsidian plugin
  • Knowledge base
  • LLM batching

Links mentioned:


Perplexity AI ▷ #sharing (9 messages🔥):

  • Starbucks Leadership Change
  • Thailand's Political Turmoil
  • xAI's Grok 2
  • Kim Dotcom's Extradition

Links mentioned:


OpenAI ▷ #ai-discussions (95 messages🔥🔥):

  • AI Limitations
  • ChatGPT Hype
  • AI Use Cases
  • AI in Education
  • Grok Token Limit

Link mentioned: Chat gpt4o new Advanced Voice Mode recognizing different accents: no description found


OpenAI ▷ #gpt-4-discussions (14 messages🔥):

  • GPT Updates Pending
  • Custom GPTs
  • Knowledge Files

Interconnects (Nathan Lambert) ▷ #news (61 messages🔥🔥):

  • OpenAI ToS
  • SB 1047
  • AI Safety
  • Model Training
  • Hermes Models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (29 messages🔥):

  • Harrison's Work
  • Sentdex's Success
  • Nous Hermes
  • Meta Cooking Drama
  • Model Overhype

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (2 messages):

  • RLHF
  • DPO
  • SFT Dataset
  • Model Performance
  • Mistral and Hermes

Link mentioned: Tweet from Teknium (e/λ) (@Teknium1): @ArnaudStiegler same SFT dataset on all, dpo made the models worse at 70 and 405b so we didnt use rlhf on them


Interconnects (Nathan Lambert) ▷ #memes (6 messages):

  • Social Media Posting Permissions
  • AI2 Orientation
  • Viral Marketing

Link mentioned: Tweet from Nathan Lambert (@natolambert): Ai2 orientation (by @hamishivi)


Interconnects (Nathan Lambert) ▷ #posts (10 messages🔥):

  • Could of
  • Grammar Fallacy
  • Deeply is the new very
  • Merriam-Webster Dictionary
  • The word 'of'

Links mentioned:


Latent Space ▷ #ai-general-chat (28 messages🔥):

  • DEI
  • Salesforce DEI
  • Meta AI
  • DeepSeek-Prover
  • Proof Assistant

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

  • Latent Space Pod
  • AnswerAI
  • Jeremy Howard
  • OpenAI Governance
  • FastHTML

Link mentioned: Tweet from Latent.Space (@latentspacepod): 🆕 Building AI for The People Never has so much been shipped for so many by so few. https://latent.space/p/answerai @jeremyphoward is back on the pod! sharing the founding journey of @AnswerAI, pre...


Latent Space ▷ #ai-in-action-club (78 messages🔥🔥):

  • DSPy
  • Cursor Alpha
  • LangChain
  • Prompting vs Fine-tuning
  • Model Distillation

Links mentioned:


Cohere ▷ #discussions (4 messages):

  • Cohere Startup Program
  • Oracle Fusion SaaS
  • Gen AI
  • ODA development
  • Cohere model training

Link mentioned: Startup Program : The Cohere Startup Program offers qualified Series B and earlier startups a unique opportunity for support, discounted API rates, and publicity.


Cohere ▷ #questions (19 messages🔥):

  • AutoTokenizer vs llamatokenizer
  • LlamaForCausalLM vs AutoModelForCausalLM
  • LLM University
  • Cohere API Keys
  • R+ API Guidelines

Cohere ▷ #api-discussions (14 messages🔥):

  • Dataset Upload Issues
  • Dataset Storage Limits
  • Hard Negative Overlap Error
  • Dataset UI Access

Link mentioned: Login | Cohere: Login for access to advanced Large Language Models and NLP tools through one easy-to-use API.


Cohere ▷ #cohere-toolkit (1 messages):

nick_frosst: thats good feedback. thanks all 🙂


LlamaIndex ▷ #blog (3 messages):

  • Llama-Agents
  • Multimodal Report Generation Agent
  • Workflows
  • LlamaIndex

LlamaIndex ▷ #general (28 messages🔥):

  • LlamaIndex's GraphRAG
  • Anthropic's performance
  • LLamaindex in FastAPI
  • Function calling with OpenAI
  • ColPali

Link mentioned: [Bug]: Streaming with async_response_gen incompatible with FastAPI · Issue #13495 · run-llama/llama_index: Bug Description I have a very simple FastAPI endpoint set up to test out streaming tokens back from a context chat engine. As written, the first request correctly streams the content back, but ever...


LlamaIndex ▷ #ai-discussion (6 messages):

  • JSONalyze with LlamaIndex Workflows
  • Batching of LLM Jobs
  • AI Castway Survival Game
  • LlamaIndex in AI Castaway

Links mentioned:


LangChain AI ▷ #general (36 messages🔥):

  • LangChain Agent Tools
  • OpenAI Actions
  • MindSQL
  • Awesome LangChain
  • LangGraph ToolNode

Links mentioned:


Eleuther ▷ #general (1 messages):

  • Remote AI Startup Jobs
  • UTC+0 Timezone

Eleuther ▷ #research (9 messages🔥):

  • Boundary Attention
  • Language Model Probability Computation
  • ACL Review Concerns
  • Fine-tuning Gemma-2-2b without LayerNorm

Links mentioned:


Eleuther ▷ #interpretability-general (1 messages):

  • Goodfire AI
  • Interpretability
  • AI models
  • Practical applications
  • Scaling AI

Link mentioned: Goodfire | Interpretability for deploying safe and reliable generative AI models: no description found


Eleuther ▷ #lm-thunderdome (11 messages🔥):

  • Llama3-8B-Instruct
  • GSM8k
  • Meta's Llama3
  • LM-evaluation-harness
  • AutoTokenizer

Link mentioned: lm-evaluation-harness/docs/new_task_guide.md at main · EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of language models. - EleutherAI/lm-evaluation-harness


DSPy ▷ #show-and-tell (5 messages):

  • Neural Search
  • VITA AI Assistant
  • Neural Network for Text Retrieval
  • Code Instruction Examples
  • Model Merging

Links mentioned:


DSPy ▷ #papers (4 messages):

  • LLM evaluation
  • RAG
  • Web Search integration
  • Knowledge Graphs and LLMs
  • Graph Language Model (GLM)

Links mentioned:


DSPy ▷ #general (6 messages):

  • GitHub Readme Contributors
  • Function Docstrings
  • Signature Input Field

DSPy ▷ #examples (1 messages):

batmanosama: I updated it thanks for pointing that out


Modular (Mojo 🔥) ▷ #general (5 messages):

  • Mojo & Max integration
  • Mojo as general-purpose PL
  • Mojo's runtime

Modular (Mojo 🔥) ▷ #mojo (6 messages):

  • String indexing
  • Code points
  • Grapheme clusters
  • Memory efficiency

Modular (Mojo 🔥) ▷ #max (1 messages):

  • Mojo Installation Issues
  • Modular Install Error
  • WSL Ubuntu
  • Mojo Manifest Expiration

Link mentioned: Issues · modularml/mojo: The Mojo Programming Language. Contribute to modularml/mojo development by creating an account on GitHub.


OpenInterpreter ▷ #general (12 messages🔥):

  • RPI5 vs Umbrell
  • Gemini Models with OI OS
  • Local Home Server with Ollama
  • Low Discord Activity

LAION ▷ #general (5 messages):

  • Musk/X
  • Stanford researchers
  • Media bias
  • BFL/Flux

LAION ▷ #resources (1 messages):

  • Moonglow
  • Remote GPU access
  • Jupyter notebooks
  • Runpod

Link mentioned: Moonglow: no description found


DiscoResearch ▷ #general (2 messages):

  • xLSTM trainer
  • Hugging Face compatible
  • helibrunna

Link mentioned: GitHub - AI-Guru/helibrunna: A HuggingFace compatible xLSTM trainer.: A HuggingFace compatible xLSTM trainer. Contribute to AI-Guru/helibrunna development by creating an account on GitHub.


Alignment Lab AI ▷ #general (1 messages):

  • Jala Data Labeling

Link mentioned: Jala - Data Labeling Solution: no description found


LLM Finetuning (Hamel + Dan) ▷ #general (1 messages):

  • Model Expiration Times
  • OpenAI's Shorter Expiration Time
  • Modal's Extension Policy
  • Model Expirations Across Providers



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}