Frozen AI News archive

Stripe lets Agents spend money with StripeAgentToolkit

**Stripe** has pioneered an AI SDK specifically designed for agents that handle payments, integrating with models like **gpt-4o** to enable financial transactions and token-based charging. The AI developer tooling trend emphasizes better "AI-Computer Interfaces" for improved agent reliability, with tools like **E2B** and the `llms.txt` documentation trend gaining traction, notably adopted by **Anthropic**. In AI model news, **Gemini-Exp-1114** topped the Vision Leaderboard and improved in Math Arena, while discussions continue around model overfitting and the limits of scaling laws for **AGI**. **OpenAI** released a **ChatGPT desktop app for macOS** with integrations for **VS Code**, **Xcode**, and **Terminal**, enhancing developer workflows and pair programming. **Anthropic** introduced a prompt improver using chain-of-thought reasoning, and **Meta AI** shared top research from **EMNLP2024** on image captioning, dialogue systems, and memory-efficient fine-tuning. Highlights from **ICLR 2025** include diffusion-based illumination harmonization, open mixture-of-experts language models, and hyperbolic vision-language models. A new adaptive decoding method optimizes creativity and factuality per token. Tools like **LlamaParse** and **RAGformation** were also introduced for document parsing and retrieval-augmented generation.

Canonical issue URL

AI News for 11/14/2024-11/15/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (217 channels, and 1812 messages) for you. Estimated reading time saved (at 200wpm): 191 minutes. You can now tag @smol_ai for AINews discussions!

One of the rising theses in AI developer tooling this year is that tools with better "AI-Computer Interfaces" will do better as a medium-term solve for agent reliability/accuracy. You can see these with tools like E2B and the rising llms.txt docs trend started by Jeremy Howard and now adopted by Anthropic, and Vercel has a generalist AI SDK, but Stripe is the first dev tooling company that has specifically created an SDK for agents that take money:

import {StripeAgentToolkit} from '@stripe/agent-toolkit/ai-sdk';
import {openai} from '@ai-sdk/openai';
import {generateText} from 'ai';

const toolkit = new StripeAgentToolkit({
  secretKey: "sk_test_123",
  configuration: {
    actions: {
      // ... enable specific Stripe functionality
    },
  },
});

await generateText({
  model: openai('gpt-4o'),
  tools: {
    ...toolkit.getTools(),
  },
  maxSteps: 5,
  prompt: 'Send <<email address>> an invoice for $100',
});

and spend money:

image.png

and charge based on token usage. A very very forward thinking move here, solving common pain points, and in retrospect unsurprising that Stripe was the first to build financial services for AI Agents.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Models and Benchmarks

AI Company News

AI Research and Papers

AI Tools and Software Updates

AI Agents and Applications

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Gemini Exp 1114 Achieves Top Rank in Chatbot Arena

Theme 2. Omnivision-968M Optimizes Edge Device Vision Processing

Theme 3. Qwen 2.5 7B Dominates Livebench Rankings

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. Claude Surges Past GPT-4O: Major Shift in Code Generation Quality

Theme 2. FrontierMath Benchmark: Models Score Only 2% on Advanced Math

Theme 3. Chat.com Domain Sells for $15M to OpenAI - Major Corporate Move

Theme 4. Claude Rollback: 3.6 Issues Lead to Version Reversal


AI Discord Recap

A summary of Summaries of Summaries by O1-mini

Theme 1: Hardware and Performance Optimization for AI Models

Theme 2: Model Releases and Integration Enhancements

Theme 3: AI Tool Integration and Feature Development

Theme 4: Training Techniques and Dataset Management


PART 1: High level Discord summaries

LM Studio Discord


Perplexity AI Discord


aider (Paul Gauthier) Discord


HuggingFace Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


Interconnects (Nathan Lambert) Discord


GPU MODE Discord


Notebook LM Discord Discord


OpenAccess AI Collective (axolotl) Discord


OpenAI Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Cohere Discord


LAION Discord


DSPy Discord


Modular (Mojo 🔥) Discord


LLM Agents (Berkeley MOOC) Discord


Torchtune Discord


Mozilla AI Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LM Studio ▷ #general (83 messages🔥🔥):

  • Qwen2.5-Math-72B-Instruct issues
  • LM Studio local server setup
  • Function calling beta in LM Studio
  • SSD speed comparisons
  • Using multiple apps for LLM tasks

LM Studio ▷ #hardware-discussion (223 messages🔥🔥):

  • GPU Comparisons
  • AMD Radeon PRO W7800
  • Apple hardware for AI
  • Qualcomm X Elite v2
  • Training costs for LLMs

Links mentioned:


Perplexity AI ▷ #general (238 messages🔥🔥):

  • Perplexity API and Features
  • Mobile App Issues
  • Chinese Language and Culture Discussions
  • Plagiarism Check in AI Writing
  • Curator Applications for Perplexity

Links mentioned:


Perplexity AI ▷ #sharing (10 messages🔥):

  • DeepMind AlphaFold Code Release
  • Chegg's Decline Due to ChatGPT
  • Google Gemini App Launch
  • Microsoft Quantum Logic
  • Best Work Mouse

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (2 messages):

  • PPLX API Performance
  • URL Injection Issue

aider (Paul Gauthier) ▷ #general (203 messages🔥🔥):

  • Gemini Experimental Model
  • Aider Configuration Issues
  • Costs of Using Models
  • Integrating Aider with Hyperbolic's Qwen 2.5 Coder
  • Using Aider for Commit Messages

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (38 messages🔥):

  • Aider's API Key Configuration
  • Aider Permission Errors
  • Using XAI with Aider
  • Cheap Models for Aider
  • Aider Running in Docker

Links mentioned:


aider (Paul Gauthier) ▷ #links (1 messages):

renanfranca9480: https://supermaven.com/blog/cursor-announcement


HuggingFace ▷ #general (118 messages🔥🔥):

  • Hugging Face Sign-Up Issues
  • OCR Accuracy and Improvement
  • Reinforcing OCR Models
  • Loading LoRA State Dicts
  • Shortening URLs for Hugging Face

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

  • Missing user updates
  • Online tutorials progress

HuggingFace ▷ #cool-finds (3 messages):

  • Post Transparency
  • Community Feedback

HuggingFace ▷ #i-made-this (9 messages🔥):

  • SnowballTarget environment
  • OKReddit dataset
  • RWKV models
  • Runtime errors in spaces
  • Search frustrations on Hugging Face

Links mentioned:


HuggingFace ▷ #reading-group (6 messages):

  • GPU compatibility
  • Motherboard decisions
  • PCIe bandwidth considerations

HuggingFace ▷ #NLP (2 messages):

  • Legal Embedding Models
  • AI Assistant for Legal Systems
  • GTE Finetuning Data

Unsloth AI (Daniel Han) ▷ #general (102 messages🔥🔥):

  • Importance of Learning Triton and CUDA
  • Current State of Language Models
  • Input Processing for Fine Tuning
  • Model Comparison and Utilization
  • Personal Experiences of Community Members

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (8 messages🔥):

  • Dietary Choices
  • Rice Preferences

Unsloth AI (Daniel Han) ▷ #help (20 messages🔥):

  • Unsloth installation issues
  • Support for Lora+
  • Svelte dataset for Qwen2.5-Coder
  • Song lyrics dataset preparation
  • Fine-tuning Llama3.1b for math equations

Links mentioned:


Eleuther ▷ #general (12 messages🔥):

  • Histogram plotting alternatives
  • Open Source AI definitions
  • Discussion on coding in AI
  • Datashader for large datasets

Link mentioned: The Open Source AI Definition – 1.0: version 1.0 Preamble Why we need Open Source Artificial Intelligence (AI) Open Source has demonstrated that massive benefits accrue to everyone after removing the barriers to learning, using, sharing ...


Eleuther ▷ #research (97 messages🔥🔥):

  • Scaling Laws in AI
  • LLM Capabilities
  • Encrypted LLM Communication
  • Diminishing Returns in Model Training
  • Synthetic Tasks and Relevance

Links mentioned:


Eleuther ▷ #interpretability-general (6 messages):

  • Pythia model suite
  • Mixture-of-Experts (MoE)
  • Hidden States Unconference
  • Transformer heads and antonyms

Links mentioned:


Nous Research AI ▷ #general (66 messages🔥🔥):

  • TEE Wallet Collation Concerns
  • NVIDIA NV-Embed-v2 Release
  • RLHF vs SFT for Fine-tuning
  • Tee's Twitter Activity

Links mentioned:


Nous Research AI ▷ #ask-about-llms (2 messages):

  • Mixing datasets for LLM training
  • Matching strategies for datasets

Nous Research AI ▷ #research-papers (1 messages):

adjectiveallison: https://arxiv.org/abs/2411.04732

Image recognition sota with a model 29x smaller


Nous Research AI ▷ #interesting-links (6 messages):

  • AI-driven translation tool
  • Resume website transformation
  • Translation services
  • Slang translation
  • Cultural nuances in translation

Links mentioned:


Nous Research AI ▷ #research-papers (1 messages):

adjectiveallison: https://arxiv.org/abs/2411.04732

Image recognition sota with a model 29x smaller


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

  • MistralNemo StarCannon
  • Celeste
  • Perplexity Citations
  • Beta Features
  • Model Updates

OpenRouter (Alex Atallah) ▷ #general (65 messages🔥🔥):

  • Gemini API Availability
  • Rate Limits on OpenRouter
  • Hermes 405B Model
  • Perplexity Citations Live
  • Magnum 72B Model Evaluation

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (9 messages🔥):

  • Custom Provider Keys Access

Stability.ai (Stable Diffusion) ▷ #general-chat (60 messages🔥🔥):

  • Stable Diffusion GPU Usage
  • WebUI Installation Guides
  • Image Blending Research Paper
  • Video Upscaling Techniques
  • Using Low Denoise Inpaint

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (16 messages🔥):

  • Scaling Laws Theory
  • GPT-3 Scaling Hypothesis
  • Funding Round Update
  • Changes in Twitter's Terms of Service
  • Meta AI Developments

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (9 messages🔥):

  • Ilya Sutskever emails
  • AI perceptions
  • Misalignment discussions

Links mentioned:


Interconnects (Nathan Lambert) ▷ #random (12 messages🔥):

  • Expectations for the New Model
  • Sam Altman's ICO Proposal
  • Apple Silicon vs NVIDIA Performance
  • PyTorch Anaconda Package Deprecation

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (8 messages🔥):

  • Squidward & SpongeBob Discord Decor
  • Discord Shopping Experience

Interconnects (Nathan Lambert) ▷ #posts (2 messages):

  • Personas
  • Prompts

Interconnects (Nathan Lambert) ▷ #retort-podcast (7 messages):

  • Audio Experience
  • The Gradient
  • Podcast Cadence

GPU MODE ▷ #general (1 messages):

memorypaladin: just go ahead and ask your question.


GPU MODE ▷ #triton (3 messages):

  • Kernel Variable Passing
  • Grid Configuration Issues

GPU MODE ▷ #torch (7 messages):

  • FSDP and torch.compile
  • Using nsys for profiling
  • Proton profiler with Triton
  • Memory issues with nsys
  • Using emit_nvtx() for profiling

Links mentioned:


GPU MODE ▷ #cool-links (2 messages):

  • ZLUDA
  • NVIDIA GPUs
  • AMD and Intel GPUs

Link mentioned: #246 Developer Of ZLUDA: CUDA For Non Nvidia GPUs | Andrzej Janik: CUDA is one of the primary reasons people buy NVIDIA GPUs but what if there was a way to have this compute power on AMD and Intel GPUs as well. Well there is...


GPU MODE ▷ #beginner (13 messages🔥):

  • NVCC vs Clang performance
  • GPU memory calculation for LLMs
  • Debugging register usage in Kokkos
  • Loop unrolling effects on performance

Link mentioned: Execution Policies - Kokkos documentation: no description found


GPU MODE ▷ #triton-puzzles (1 messages):

  • Online Softmax

GPU MODE ▷ #webgpu (4 messages):

  • TFLOPS performance
  • Apple M2 memory model
  • Efficient caching strategies

GPU MODE ▷ #liger-kernel (2 messages):

  • Liger-Kernel Bug Fix
  • Convergence Test Issues

Link mentioned: Fix flce not being patched after reverting in convergence test by Tcc0403 · Pull Request #385 · linkedin/Liger-Kernel: Summary Resolve #355: 1. revert patching causes flce not taking effect (comment out revert patching for now, and only test float32). The bug occurs because we define a model config dictionary befor...


GPU MODE ▷ #self-promotion (2 messages):

  • cudaDeviceSynchronize
  • C++ development experience

Link mentioned: Tweet from Daniel Galvez (@memorypaladin): cudaDeviceSynchronize() is an anti-pattern that people should avoid. It's not clearly stated, but basically it iterates over all cuda streams in the cuda context and calls cudaStreamSynchronize() ...


GPU MODE ▷ #🍿 (2 messages):

  • Discord Cluster Manager Issue
  • Finetuning Loop
  • Trion Docs and Examples

Link mentioned: Job Queue · Issue #23 · gpu-mode/discord-cluster-manager: I recently read @TimDettmers thread on quantization scaling. Very inspiring. I think that this is the sort of research, research on how to do research, and popularizing the methods and results, is ...


GPU MODE ▷ #thunderkittens (11 messages🔥):

  • Kernel Compilation Delays
  • Complexity Threshold in Code
  • Template Parameter Impact
  • Effective Debugging Strategies

GPU MODE ▷ #edge (5 messages):

  • React Native LLM Library
  • LLM Inference on Android
  • Memory Bound vs Compute Bound in LLMs
  • Bitnet for Fast Inference
  • GGUF Q8 Performance

Link mentioned: GitHub - software-mansion/react-native-executorch: Contribute to software-mansion/react-native-executorch development by creating an account on GitHub.


Notebook LM Discord ▷ #use-cases (17 messages🔥):

  • Top Shelf Podcast
  • AI and violence
  • Podcast dialogue techniques
  • Microlearning through audio
  • Physics textbook assistance

Links mentioned:


Notebook LM Discord ▷ #general (34 messages🔥):

  • Art of Prompting with AI
  • Issues with NotebookLM
  • Audio Summaries and Customization
  • Uploading Documents
  • Exploration of Relevant Links

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (36 messages🔥):

  • Liger Kernel Improvements
  • Cut Cross-Entropy Method
  • Orca3 Insights
  • Event Invitation from Meta
  • Tokenization and Fine-Tuning

Link mentioned: Tweet from Kearm (@Nottlespike): So this is how my day has been going


OpenAccess AI Collective (axolotl) ▷ #datasets (1 messages):

  • Orca
  • Orca 2
  • Orca-AgentInstruct
  • Synthetic Data Generation

Link mentioned: Orca-AgentInstruct: Agentic flows can be effective synthetic-data generators: Orca-AgentInstruct, from Microsoft Research, can generate diverse, high-quality synthetic data at scale to post-train and fine-tune base LLMs for expanded capabilities, continual learning, and increas...


OpenAccess AI Collective (axolotl) ▷ #announcements (1 messages):

  • Office Hours
  • Axolotl feedback session

Link mentioned: Discord - Group Chat That’s All Fun & Games: Discord is great for playing games and chilling with friends, or even building a worldwide community. Customize your own space to talk, play, and hang out.


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (9 messages🔥):

  • Qwen/Qwen2 Model Pretraining
  • Eval Steps in Training
  • Phorm Bot Malfunction

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search): Understand code, faster.


OpenAI ▷ #ai-discussions (25 messages🔥):

  • P100 vs P40 Scam
  • Photo Selection Techniques
  • PDF Upload Issues
  • GPT-4o Pricing Mechanics
  • AI Club Development Plans

Links mentioned:


OpenAI ▷ #gpt-4-discussions (6 messages):

  • Content Flags
  • GPT Issues
  • User Input Policies

OpenAI ▷ #prompt-engineering (5 messages):

  • Game of 24 AI performance
  • Exploring past experiences
  • RAG prompt and few-shot examples

OpenAI ▷ #api-discussions (5 messages):

  • Game of 24
  • Exploring Past AI Iterations
  • Prompting Techniques for AI
  • RAG Prompt with Few-Shot Examples

OpenInterpreter ▷ #general (17 messages🔥):

  • OpenAI's 'OPERATOR' AI Agent
  • Beta App Performance
  • Azure AI Search Methodologies
  • Open Interpreter Shell Integration
  • Devin AI Preview Access

Links mentioned:


OpenInterpreter ▷ #ai-content (4 messages):

  • Probabilistic computing
  • ChatGPT desktop compatibility
  • New computing performance

Link mentioned: New Computing Breakthrough achieves 100 MILLION Times GPU Performance!: In this video I discuss probabilistic computing that reportedly allows for 100 million times better energy efficiency compared to the best NVIDIA GPUs.Check ...


tinygrad (George Hotz) ▷ #general (12 messages🔥):

  • tinybox pro preorder
  • int64 indexing bounty
  • buffer transfer function
  • community support in development

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (2 messages):

  • tinygrad Contributions
  • Buffer Transfer on CLOUD Devices
  • GPU Tensor Transfer

Link mentioned: Buffer transfer on CLOUD devices by mdaiter · Pull Request #7709 · tinygrad/tinygrad: From some feedback, re-opening this PR. My last PR focused on reading data synchronously out of a web buffer, onto a host device, and then dumping it in another web buffer. Seems silly in hindsight...


LlamaIndex ▷ #blog (2 messages):

  • LlamaIndex Community Call
  • Python Documentation Upgrade
  • Knowledge Graphs
  • RAG system

LlamaIndex ▷ #general (10 messages🔥):

  • condenseQuestionChatEngine
  • CondensePlusContext
  • retrieving context
  • customizing prompts

Link mentioned: Postgres Vector Store - LlamaIndex: no description found


Cohere ▷ #discussions (6 messages):

  • New Users on Cohere
  • Issues with Model Settings

Cohere ▷ #questions (3 messages):

  • Playwright Python file upload
  • Cohere discussion

Cohere ▷ #projects (1 messages):

  • Agentic Chunking
  • LlamaChunk Method
  • RAG Pipelines
  • Regular Expressions Challenges

Links mentioned:


LAION ▷ #general (6 messages):

  • Copyright infringement debate
  • Public link index discussion

DSPy ▷ #show-and-tell (1 messages):

  • ChatGPT for macOS
  • Integration with desktop apps
  • dspy GPTs functionality

Link mentioned: Tweet from OpenAI Developers (@OpenAIDevs): ChatGPT 🤝 VS Code, Xcode, Terminal, iTerm2 ChatGPT for macOS can now work with apps on your desktop. In this early beta for Plus and Team users, you can let ChatGPT look at coding apps to provide be...


DSPy ▷ #general (4 messages):

  • LLM Document Generation for Infractions
  • DSPy Language Compatibility

Link mentioned: Feature request: localization · Issue #1803 · stanfordnlp/dspy: Hi! It is me again :-) . As am trying out some basics of the DSPy some more I realised one thing: the currently is no option to set diff. lang than English. The thing is, I want to make sure the LM...


Modular (Mojo 🔥) ▷ #general (5 messages):

  • New ABIs Research
  • ALLVM Project
  • Cross-Language LTO
  • Mojo's ABI Potential

Link mentioned: ALLVM Research Project | LLVM All the Things! - University of Illinois at Urbana-Champaign): no description found


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

  • Intel Tiber AI Cloud
  • Intel Liftoff Program
  • AMA with Intel
  • AI development tools
  • Hackathon opportunities

Link mentioned: Building with Intel: Tiber AI Cloud and Intel Liftoff · Luma: Building with Intel: Tiber AI Cloud and Intel Liftoff About the AMA Join us for an exclusive AMA session featuring specialists from Intel, our esteemed sponsor…


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

  • Quizzes Feedback
  • Course Deadlines

Torchtune ▷ #announcements (1 messages):

  • Torchtune v0.4.0
  • Activation Offloading
  • Qwen2.5 Support
  • Multimodal Training Enhancement

Torchtune ▷ #papers (2 messages):

  • Orca and Orca 2
  • Agentic Solutions for Synthetic Data
  • Causal Language Models

Link mentioned: Orca-AgentInstruct: Agentic flows can be effective synthetic-data generators: Orca-AgentInstruct, from Microsoft Research, can generate diverse, high-quality synthetic data at scale to post-train and fine-tune base LLMs for expanded capabilities, continual learning, and increas...


Mozilla AI ▷ #announcements (2 messages):

  • Local LLM Workshop
  • SQLite-Vec Metadata Filtering
  • Autonomous AI Agents
  • Landing Page Development Assistance





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}