Frozen AI News archive

not much happened today

**GPT-5** delayed again amid a quiet news day. **Nous Research** released Hermes 3 finetune of **Llama 3** base models, rivaling FAIR's instruct tunes but sparking debate over emergent existential crisis behavior with 6% roleplay data. **Nvidia** introduced Minitron finetune of **Llama 3.1**. **Salesforce** launched a DEI agent scoring 55% on SWE-Bench Lite. **Goodfire AI** secured $7M seed funding for mechanistic interpretability work. **Anthropic** rolled out prompt caching in their API, cutting input costs by up to 90% and latency by 80%, aiding coding assistants and large document processing. **xAI** released **Grok-2**, matching **Claude 3.5 Sonnet** and **GPT-4 Turbo** on LMSYS leaderboard with vision+text inputs and image generation integration. **Claude 3.5 Sonnet** reportedly outperforms **GPT-4** in coding and reasoning. **François Chollet** defined intelligence as efficient operationalization of past info for future tasks. **Salesforce's** DEI framework surpasses individual agent performance. **Google DeepMind's** Demis Hassabis discussed AGI's role in scientific discovery and safe AI development. **Dora AI** plugin generates landing pages in under 60 seconds, boosting web team efficiency. **Box AI API** beta enables document chat, data extraction, and content summarization. **LangChain** updated Python & JavaScript integration docs.

Canonical issue URL

AI News for 8/14/2024-8/15/2024. We checked 7 subreddits, 384 Twitters and 29 Discords (254 channels, and 5043 messages) for you. Estimated reading time saved (at 200wpm): 945 minutes. You can now tag @smol_ai for AINews discussions!

A smattering of notables but no major story:

Since it's a quiet day, you could check out our sponsor Box's AI API!


[Sponsored by Box] You have files. The files are full of nonsense. Box AI has an API that extracts useful metadata from the nonsense. See for yourself.

Swyx's comment: compared to last week's sponsored post, this tutorial goes into metadata extraction from your Box items, aka structured data, and showing practical usecases for querying that metadata. All RAG eventually evolves into hybrid embedding + metadata queries, and Box's template approach is perhaps a more practical take on the JSONSchema API by the big labs.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Updates and Releases

AI Research and Development

AI Tools and Applications

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. New Open Models

Theme 2. Grok-2 and Grok-2 Mini: x.AI's Latest Benchmark Results

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Image Generation Advancements

AI in Commercial Applications

AI Model Behavior and Capabilities

Humor and Memes


AI Discord Recap

A summary of Summaries of Summaries by GPT4O (gpt-4o-2024-05-13)

1. LLM Advancements and Benchmarking

2. Model Optimization and Caching

3. AI Tools and Plugins

4. Open-Source AI Frameworks and Community Efforts


PART 1: High level Discord summaries

aider (Paul Gauthier) Discord


Stability.ai (Stable Diffusion) Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


OpenAccess AI Collective (axolotl) Discord


tinygrad (George Hotz) Discord


OpenInterpreter Discord


Alignment Lab AI Discord


MLOps @Chipro Discord


AI21 Labs (Jamba) Discord


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

aider (Paul Gauthier) ▷ #general (148 messages🔥🔥):

  • OpenRouter Aider Usage
  • Model Caching
  • Aider Scripting
  • LLM Caching
  • OpenAI Update

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (17 messages🔥):

  • Llama 3.1
  • Grok-2
  • Aider Image Support
  • OpenRouter
  • Prompt Caching

Links mentioned:


aider (Paul Gauthier) ▷ #links (6 messages):

  • Aider UI in IDE
  • Aider Conflict Resolution
  • Aider Edit Confirmation

Link mentioned: Add option to force the AI to ask the user to confirm each change before doing it · Issue #649 · paul-gauthier/aider: Issue Sometimes it is necessary to supervise each atomic change to the code or to the project configurations. Aider already shows us every atomic change like a diff. It would be great if, when spec...


Stability.ai (Stable Diffusion) ▷ #general-chat (166 messages🔥🔥):

  • Free SD model deployment
  • NFT scams
  • Stable Diffusion on phone
  • Free image to video
  • GPU throttling

Link mentioned: Humanity is Doomed: Asmongold Clips / Asmongold Reacts To: AI catfishing is getting out of handAI Videos By: https://x.com/ai_for_success/status/1821975861698154993https://x.com...


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • Flavor of the Week Model Removal

Link mentioned: Flavor of The Week - API, Providers, Stats: This is a router model that rotates its underlying model weekly. It aims to be a simple way to explore the capabilities of new models while using the same model ID. Run Flavor of The Week with API


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

  • 4oSo agent
  • OpenRouter
  • Claude 3.5 Sonnet
  • GPT-4o

OpenRouter (Alex Atallah) ▷ #general (120 messages🔥🔥):

  • OpenRouter Arena
  • OpenRouter LLM API
  • OpenRouter LLM Model Availability
  • OpenRouter Privacy
  • OpenRouter PDF upload

Links mentioned:


Eleuther ▷ #general (7 messages):

  • Meteorological ML Models
  • LLM Training Stopping Conditions

Eleuther ▷ #research (58 messages🔥🔥):

  • Hyperbolic Embeddings
  • Activation Quantization
  • Boundary Attention
  • NAS

Links mentioned:


Eleuther ▷ #lm-thunderdome (57 messages🔥🔥):

  • Meta Llama benchmarks
  • Benchmark differences
  • Eval Harness settings
  • Prompt engineering
  • Eval harness differences

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (87 messages🔥🔥):

  • Anthropic API
  • Deepseek
  • SB1047

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

  • ACL
  • Emily Bender's Talk
  • Response to Bender's Talk

Link mentioned: acl-presedential-response.md: GitHub Gist: instantly share code, notes, and snippets.


Interconnects (Nathan Lambert) ▷ #memes (5 messages):

  • Trump account

Link mentioned: Tweet from Donald J. Trump (@realDonaldTrump): no description found


Modular (Mojo 🔥) ▷ #general (73 messages🔥🔥):

  • Mojo Networking
  • Mojo vs Go
  • Mojo vs Rust
  • Mojo's Future
  • Mojo's Branding

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (18 messages🔥):

  • Mojo Community Meetings
  • Mojo String optimizations
  • Mojo String Unicode support
  • Mojo String implementation
  • Small String Optimization

Links mentioned:


Modular (Mojo 🔥) ▷ #max (1 messages):

  • MAX real-time safety
  • C API for MAX
  • Real-time audio applications
  • Onnxruntime and libtorch limitations

LlamaIndex ▷ #blog (2 messages):

  • LlamaIndex Workflows
  • RAG system
  • Azure AI Search
  • Azure OpenAI
  • Citation Query Engine

LlamaIndex ▷ #general (62 messages🔥🔥):

  • GraphRAG Apps
  • LlamaIndex Agent tool expectations
  • LlamaIndex agent and chat history
  • LlamaIndex embedding update
  • LlamaIndex GraphRAG and Microsoft's implementation

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (21 messages🔥):

  • Mistral Large 2 training
  • KTO Trainer
  • TRL
  • SmolLM models
  • Lambda Clusters

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (2 messages):

  • llama.cpp
  • GGUF conversion

tinygrad (George Hotz) ▷ #general (9 messages🔥):

  • Tinygrad Typechecking
  • Compiler Book Recommendations
  • Cuda.py Documentation
  • ONNX Support in Tinygrad

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (10 messages🔥):

  • Tinygrad vs Jax/Flux
  • Tinygrad's CUDA/NV Issues
  • Google's TPU dominance
  • PyTorch vs Tinygrad Benchmarks

OpenInterpreter ▷ #general (12 messages🔥):

  • Local LLMs
  • Home Server Setup
  • Umbrel
  • TuringPi
  • 01 Device

Links mentioned:


OpenInterpreter ▷ #O1 (4 messages):

  • Personalized Education
  • AI Tutors
  • Science Education

OpenInterpreter ▷ #ai-content (1 messages):

8i8__papillon__8i8d1tyr: https://www.youtube.com/watch?v=gujAar8NZKo


Alignment Lab AI ▷ #general (1 messages):

  • Jala

Link mentioned: Jala - Data Labeling Solution: no description found


MLOps @Chipro ▷ #events (1 messages):

  • AI Capabilities and Risks Demo-Jam Hackathon
  • Pre-Hackathon Workshop
  • AI safety
  • AI risks and potential

AI21 Labs (Jamba) ▷ #general-chat (1 messages):

  • AI21 FusionLabs plugin for bubble
  • Conversation RAG
  • AI21Labs models on bubble



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}