Frozen AI News archive

Replit Agent - How did everybody beat Devin to market?

**Replit Agent** launched as a fully integrated Web IDE enabling text-to-app generation with planning and self-healing, available immediately to paid users without a waitlist. Other notable developments include **Melodio**, a new text-to-music model, and **Together AI**'s kernel and speculative decoding work. **Anthropic AI** announced a new enterprise plan featuring a **500K context window** and enhanced security. Discussions on **JPEG-LM** and **AVC-LM** models for improved image and video generation, and GPU market trends around the **H100 GPU** pricing were highlighted. Influential voices like **Andrej Karpathy** shared insights on AI agents and automation.

Canonical issue URL

AI News for 9/4/2024-9/5/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (214 channels, and 2723 messages) for you. Estimated reading time saved (at 200wpm): 303 minutes. You can now tag @smol_ai for AINews discussions!

A packed day. The annual Time 100 AI outrage piece. Maitai, AnythingLLM, Laminar launched. Melodio - new text-to-music model. Together ai announced some kernel work and speculative decoding work. Andrej Karpathy on a podcast. $2000/mo ChatGPT. We very nearly featured Matt Shumer + Sahil Chaudhary's Reflection Tuned finetune of Llama 3.1 70B as today's title story, but the 405B + paper is coming next week, so we will just give you a heads up that it is coming.

The big launch of the day is Replit Agent.

image.png

If you've been paying attention to the coding agent company launches - like Claude Artifacts, Cursor Composer, Val.town Townie, Cosie Genie, Honeycomb, and even the You.com pivot yesterday, this is pretty much what you'd expect Replit to do, just very very well executed - full text to running app generation with planning and self healing. What's laudable is the lack of waitlist - it is live today to paid users - and can deploy on a live URL with postgres backend, from people who cannot code, including on your phone. Of course, Replit Agent can even make a Replit clone.

There are unfortunately no benchmarks or even blogposts to write about. Which makes our job simple. Watch video, try it, or scroll on.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Development and Models

AI Tools and Technologies

Philosophy and Ethics in AI

Community and Collaboration


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. GitHub's Automated Flagging: Impact on AI Model Repositories

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Research and Development

AI Funding and Competition

AI Image Generation


AI Discord Recap

A summary of Summaries of Summaries by Claude 3.5 Sonnet

1. LLM Advancements and Benchmarking

2. AI Industry News and Funding

3. Multimodal AI Innovations


PART 1: High level Discord summaries

Stability.ai (Stable Diffusion) Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


HuggingFace Discord


LM Studio Discord


Nous Research AI Discord


Interconnects (Nathan Lambert) Discord


Modular (Mojo 🔥) Discord


OpenRouter (Alex Atallah) Discord


Perplexity AI Discord


CUDA MODE Discord


Eleuther Discord


Latent Space Discord


OpenInterpreter Discord


Torchtune Discord


Cohere Discord


LlamaIndex Discord


LangChain AI Discord


LAION Discord


tinygrad (George Hotz) Discord


OpenAccess AI Collective (axolotl) Discord


DSPy Discord


DiscoResearch Discord


LLM Finetuning (Hamel + Dan) Discord


MLOps @Chipro Discord


Gorilla LLM (Berkeley Function Calling) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Stability.ai (Stable Diffusion) ▷ #general-chat (321 messages🔥🔥):

  • Model Recommendations for Hash Rosin
  • Using ControlNet with ComfyUI
  • Installations and Model Pairings
  • Technical Challenges and Performance
  • Cloud Computing Options

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (254 messages🔥🔥):

  • Y Combinator backing
  • Unsloth Model Updates
  • Model Recommendations
  • Fine-tuning with Dora
  • Reflection Llama-3.1 70B

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (35 messages🔥):

  • Back to school humor
  • Age perception in conversation
  • Discussion about age
  • Meme sharing

Link mentioned: Fivie Kuu0001 GIF - Fivie Kuu0001 Lynxdenis - Discover & Share GIFs: Click to view the GIF


Unsloth AI (Daniel Han) ▷ #help (18 messages🔥):

  • Trailing Newline Issue in Vim
  • Using Unsloth for Text Summarization
  • Running Unsloth Locally for Private Data
  • Gemma Model Comparisons
  • Finetuning Chatbots with User Data

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

rodrigo_meireles: Do you have some report comparing them somehow? Would be interesting to read.


Unsloth AI (Daniel Han) ▷ #community-collaboration (2 messages):

  • Channel Etiquette
  • Suggested Channels

Unsloth AI (Daniel Han) ▷ #research (4 messages):

  • Illya's billion-dollar funding
  • LLMs scaling and AGI
  • Reasoning and planning in LLMs

OpenAI ▷ #ai-discussions (259 messages🔥🔥):

  • AI Consciousness Debate
  • Perplexity Use Case
  • Gemini Performance
  • OpenAI Subscription Growth
  • UI Changes in ChatGPT

Links mentioned:


OpenAI ▷ #gpt-4-discussions (3 messages):

  • GPT response issues
  • Icons disappearing
  • Browser compatibility
  • App frustrations

OpenAI ▷ #prompt-engineering (25 messages🔥):

  • Font Issues
  • AI Author Imitation
  • Exporting Outputs with Errors
  • Tool Calls in Prompts

OpenAI ▷ #api-discussions (25 messages🔥):

  • Font Missing Issues
  • Incorporating Tool Calls
  • Character Encoding Errors
  • API Response Consistency
  • Creating Effective Tool Chains

HuggingFace ▷ #announcements (1 messages):

  • Vision Language Models
  • Optimization of Tau LLM
  • InkubaLM-0.4B Release
  • Shadowbox Tool
  • Selective Fine-tuning

HuggingFace ▷ #general (193 messages🔥🔥):

  • Comparison of Coding Models
  • Transformer Attention Explainer
  • Evaluation of Code Generation
  • Coding Benchmark Quality
  • Future Research Ideas in Code Generation

Links mentioned:


HuggingFace ▷ #today-im-learning (8 messages🔥):

  • Residual Connections in AI
  • Jeeds Agent Models
  • Transformers and Attention Mechanism
  • Python Microservice with Ollama

HuggingFace ▷ #i-made-this (6 messages):

  • GPT4FREE
  • Kyber Odyssey Encryption Implementation
  • Yi-Coder Release
  • Advancements in Vision Language Models
  • Minimalist UI for Comfy

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

quantaussie99: Have to read this… I don’t get it


HuggingFace ▷ #computer-vision (3 messages):

  • Tracking Algorithms for Multi-Object Tracking
  • Retrieving Screen Items from Internal Data
  • Running BLIP-2 on AWS SageMaker

HuggingFace ▷ #NLP (2 messages):

  • Qwen2-VL-7B-Instruct
  • requirements.txt update
  • fp16 performance

Link mentioned: hperkins/Qwen2-VL-7B-Instruct at main: no description found


HuggingFace ▷ #diffusion-discussions (2 messages):

  • PixArt-Alpha performance
  • FluxImageToImagePipeline availability

LM Studio ▷ #general (111 messages🔥🔥):

  • LM Studio 0.3.2 Issues
  • Image API Providers
  • Reflection 70B Model
  • Change in UI Elements
  • Advanced Model Techniques

Links mentioned:


LM Studio ▷ #hardware-discussion (60 messages🔥🔥):

  • Mac RAM and storage needs
  • Local server versus cloud options
  • Raspberry Pi and LMStudio compatibility
  • Performance of RTX 3060 for inference
  • NAS advantages for Apple users

Links mentioned:


Nous Research AI ▷ #general (140 messages🔥🔥):

  • Reflection-Tuning Techniques
  • Hermes Model Speculations
  • Fine-tuning LLMs
  • Dataset Creation for AI Models
  • Nvidia Driver Issues

Links mentioned:


Nous Research AI ▷ #ask-about-llms (20 messages🔥):

  • Mamba API inquiries
  • Mergekit issues
  • Scaling and LLM reasoning
  • Llama 3.1 utilization
  • Open reasoning tasks

Link mentioned: bartowski/Meta-Llama-3.1-8B-Instruct-GGUF at main: no description found


Nous Research AI ▷ #research-papers (2 messages):

  • Falcon Mamba release
  • Loopy video diffusion model

Links mentioned:


Nous Research AI ▷ #interesting-links (1 messages):

adjectiveallison: https://github.com/Cognitive-AI-Systems/MAPF-GPT


Nous Research AI ▷ #research-papers (2 messages):

  • Falcon Mamba Model
  • Loopy Video Diffusion Model

Links mentioned:


Nous Research AI ▷ #reasoning-tasks (1 messages):

  • Illya's fundraising for AGI
  • Scaling and LLM reasoning

Interconnects (Nathan Lambert) ▷ #news (11 messages🔥):

  • xAI's GPU Cluster
  • Unsloth backed by YCombinator
  • Reflection Llama-3.1
  • Intrinsic-self correction technique

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (6 messages):

  • Reasoning Datasets
  • HuggingFace Numina
  • MATH Benchmark
  • GSM8k Benchmark
  • CHAMP Dataset

Link mentioned: CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities: Recent large language models (LLMs) have shown indications of mathematical reasoning ability on challenging competition-level problems, especially with self-generated verbalizations of intermediate re...


Interconnects (Nathan Lambert) ▷ #random (42 messages🔥):

  • Cursor chat documentation
  • QwenLM GitHub disappearance
  • Model naming confusion with OpenAI
  • Vendors for Llama fine-tuning
  • Artificial analysis and new image models

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (74 messages🔥🔥):

  • Autoformalization in AI
  • Superhuman AI Mathematicians by 2026
  • OpenAI's Pricing Strategy
  • Google's Challenges with AI Deployment
  • SnailBot's Performance

Links mentioned:


Modular (Mojo 🔥) ▷ #announcements (1 messages):

  • Magic package manager
  • MAX and Mojo integration
  • Conda ecosystem
  • Virtual environment management

Link mentioned: Get started with Magic | Modular Docs: Magic is a package manager and virtual environment manager for MAX and Mojo


Modular (Mojo 🔥) ▷ #mojo (117 messages🔥🔥):

  • Mojo performance
  • Async function support
  • Memory management in Mojo
  • Mojo standard library enhancements
  • Compiler and debugging tools

Links mentioned:


Modular (Mojo 🔥) ▷ #max (8 messages🔥):

  • Model Serialization Format
  • Containerization Techniques
  • MAX Engine Support

Link mentioned: max/examples/graph-api at main · modularml/max: A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform - modularml/max


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

  • Bank Account Expansion
  • Infinite Dilution Concept

OpenRouter (Alex Atallah) ▷ #general (91 messages🔥🔥):

  • Opus vs Sonnet Performance
  • DeepSeek V2.5 Release
  • Reflection 70B Announcement
  • Claude Caching Feature
  • Model Throughput Comparisons

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (5 messages):

  • AI Studio key issues
  • Bug reports
  • Activity logging

Perplexity AI ▷ #general (77 messages🔥🔥):

  • Perplexity subscription offers
  • Referral program details
  • Changes in membership
  • Merchandise promotions for students
  • Technical support and inquiries

Links mentioned:


Perplexity AI ▷ #sharing (10 messages🔥):

  • World's Most Powerful Supercomputer
  • Benefits of Cold Showers
  • Memory Storage in the Brain
  • Oldest Known Board Game
  • Dark Souls Innovations

Perplexity AI ▷ #pplx-api (2 messages):

  • File Upload Implementation with Perplexity API
  • Configuring Perplexity API Requests

CUDA MODE ▷ #general (36 messages🔥):

  • Cursor game change
  • AI coding tools
  • vLLM Open Office Hours
  • Reflection 70B announcement
  • SaaS startups leveraging AI

Link mentioned: Tweet from Matt Shumer (@mattshumer_): I'm excited to announce Reflection 70B, the world’s top open-source model. Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes. 405B coming next week ...


CUDA MODE ▷ #triton (6 messages):

  • MLIR_DEBUGGING
  • Triton Environment Variables

Link mentioned: GitHub - triton-lang/triton at 7480ef5028b724cb434b7841b016c6d6debf3b84: Development repository for the Triton language and compiler - GitHub - triton-lang/triton at 7480ef5028b724cb434b7841b016c6d6debf3b84


CUDA MODE ▷ #beginner (2 messages):

  • Optimization techniques for convolution
  • Memory access patterns in CUDA

CUDA MODE ▷ #jax (1 messages):

  • Pallas kernels
  • Splash Attention kernel
  • Video primer on Pallas

Links mentioned:


CUDA MODE ▷ #off-topic (5 messages):

  • Llm.c Alternatives
  • AI Summit in Mumbai
  • Burnout Prevention Strategies

Link mentioned: Join NVIDIA AI Summit 2024: October 23–25, Mumbai, India


CUDA MODE ▷ #irl-meetup (1 messages):

  • Joining Tenstorrent
  • CUDA kernel development
  • CUDA Mode IRL event

CUDA MODE ▷ #hqq-mobius (17 messages🔥):

  • Performance Insights on Batch Sizes
  • Autotune Configurations from PyTorch
  • Triton Code Limitations with GROUP_M
  • GemV Implementation Challenges
  • Memory-Bound Performance Analysis

Link mentioned: pytorch/torch/_inductor/kernel/mm_common.py at main · pytorch/pytorch: Tensors and Dynamic neural networks in Python with strong GPU acceleration - pytorch/pytorch


CUDA MODE ▷ #llmdotc (4 messages):

  • Open Sora Implementation
  • Graphics Progress

CUDA MODE ▷ #cudamode-irl (1 messages):

  • Third Wave Delays
  • Inbox Notifications

CUDA MODE ▷ #liger-kernel (8 messages🔥):

  • Jupyter Notebook Versioning
  • Python Script for Benchmark Visualizations
  • Implementation of MoE Models

Eleuther ▷ #general (19 messages🔥):

  • MCTS Application in Image Tasks
  • Creative AI Workshops
  • Keyword-Driven Generative Models
  • Undergraduate Internships in Labs
  • Minimalist UI Development

Link mentioned: Tweet from Julien Blanchon (@JulienBlanchon): Trying to figure out how to fix Comfy 👀


Eleuther ▷ #research (52 messages🔥):

  • Inefficiency of Scaling Parameters
  • Transfusion Model Insights
  • Gradient Behavior During Training
  • Effects of Generative AI on Work
  • Numerical Stability in Optimizers

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):

  • Leaderboard IFeval
  • IFeval differences

Eleuther ▷ #multimodal-general (1 messages):

bennib2407: what’s the SOTA video captioning model?


Eleuther ▷ #gpt-neox-dev (1 messages):

  • RoPE Compatibility
  • Attention Output Discrepancies

Latent Space ▷ #ai-general-chat (68 messages🔥🔥):

  • SSI Inc funding
  • You.com funding
  • Karpathy insights
  • OpenAI pricing
  • Replit Agent launch

Links mentioned:


OpenInterpreter ▷ #general (55 messages🔥🔥):

  • Open Interpreter Birthday
  • Teach Mode
  • Open Interpreter Repositories
  • AGI Discussion
  • Fulcra App Availability

Links mentioned:


OpenInterpreter ▷ #O1 (7 messages):

  • O1 recent demos
  • O1 shipping date
  • House Party event
  • Discord links

Torchtune ▷ #general (27 messages🔥):

  • Compile errors with PyTorch 2.4
  • Input padding performance
  • Memory footprint during training
  • Allocation of tokens in datasets
  • CI testing for torch.compile

Links mentioned:


Torchtune ▷ #dev (13 messages🔥):

  • DeepFusionModel Caches
  • Testing DeepFusionModel
  • Unsloth Backed by YC
  • Daniel Han's Contribution
  • Meta Employment Clarification

Link mentioned: [RFC] Adding overrides for max cache seq length by SalmanMohammadi · Pull Request #1449 · pytorch/torchtune: Context What is the purpose of this PR? Is it to add a new feature fix a bug update tests and/or documentation other (please add here) #1364 Changelog This PR: Adds support for overriding th...


Cohere ▷ #discussions (13 messages🔥):

  • System prompt optimization
  • Cohere updates
  • Community engagement

Link mentioned: The Cohere Blog: Explore our collection of insightful blog posts covering a diverse range of generative AI topics. Our articles offer in-depth analyses, expert opinions, and practical advice to inform and inspire.


Cohere ▷ #questions (8 messages🔥):

  • Text Suggestions Feature
  • LLM Agents for Report Generation
  • Cohere Usage Best Practices

Link mentioned: The Cohere Platform — Cohere: Cohere offers world-class Large Language Models (LLMs) like Command, Rerank, and Embed. These help developers and enterprises build LLM-powered applications such as conversational agents, summarizatio...


Cohere ▷ #projects (3 messages):

  • LLM Agents for Report Generation
  • OpenSesame 2.0 Launch

LlamaIndex ▷ #blog (3 messages):

  • Netchex AI using LlamaIndex
  • create-llama templates
  • llama-deploy microservices

LlamaIndex ▷ #general (20 messages🔥):

  • llama-index-experimental-param-tuner installation
  • Getting embedding vectors from ChromaVectorStore
  • Integrating Claude with LlamaIndex
  • Text-to-SQL functionality and embeddings
  • Optimizing prompts in RAG applications

Links mentioned:


LangChain AI ▷ #general (14 messages🔥):

  • Building AI Agents
  • Chatbot Development
  • ReAct Agent Deployment
  • Database Solutions for AI Agents

Links mentioned:


LangChain AI ▷ #share-your-work (2 messages):

  • Vision Language Models
  • CodeMaster App
  • Gamification in Learning
  • DSA Learning Techniques

Links mentioned:


LAION ▷ #general (10 messages🔥):

  • Comfy Rewrite Project
  • Complimentary GUI for Comfy
  • SwarmUI
  • ComfyBox Project

Links mentioned:


LAION ▷ #research (6 messages):

  • Transfusion Model
  • Reflection 70B
  • Causal UNET Performance
  • Unified Multi-modal Models

Links mentioned:


tinygrad (George Hotz) ▷ #general (8 messages🔥):

  • Bounty Payments
  • Tinyboxes Rental Model
  • Pricing Models for Performance

tinygrad (George Hotz) ▷ #learn-tinygrad (6 messages):

  • phi operation in IR
  • UOps.UPDATE
  • cstyle renderer insights

Link mentioned: Kernel Fusion part 3: the linear layer UOps: Tutorials on tinygrad


OpenAccess AI Collective (axolotl) ▷ #general-help (7 messages):

  • Unsloth Phi to Llama Conversion
  • Challenges with Phi3
  • Small Model for Rapid Iteration
  • Dora Support in Axolotl

Link mentioned: DoRA Support · Issue #1328 · axolotl-ai-cloud/axolotl: ⚠️ Please check that this feature request hasn't been suggested before. I searched previous Ideas in Discussions didn't find any similar feature requests. I searched previous Issues didn't...


OpenAccess AI Collective (axolotl) ▷ #community-showcase (5 messages):

  • Llama-3.1-8B fine-tuning
  • Chemical Language Model
  • Molecule generation
  • DPO optimization
  • SmileyLlama

Links mentioned:


DSPy ▷ #show-and-tell (2 messages):

  • DSPy usecase list
  • Livecoding sessions

Link mentioned: Tweet from isaac 🧩 (@isaacbmiller1): What are people building with LMs? What are they deploying in production? @lateinteraction and I want to begin to answer that question through a DSPy lens. We compiled an initial list of nearly 10...


DSPy ▷ #papers (1 messages):

batmanosama: https://huggingface.co/papers/2409.02889


DSPy ▷ #general (1 messages):

  • ColPali
  • Visual Document Retrieval Benchmark

Link mentioned: ColPaLi: Efficient Document Retrieval with Contextualized Language Model: ColPaLi, a new document retrieval system, leverages Vision Language Models (VLMs) to efficiently handle visually rich documents. By combining visual and textual information, ColPaLi outperforms existi...


DiscoResearch ▷ #general (2 messages):

  • Multimodal LLMs
  • Training/Finetuning

LLM Finetuning (Hamel + Dan) ▷ #general (1 messages):

  • Meeting Transcription
  • Agent Workflows
  • Evaluation Challenges

MLOps @Chipro ▷ #events (1 messages):

  • AI Enterprise Summit
  • San Francisco event
  • Keynote speakers
  • Networking opportunities

Link mentioned: AI Realized – The Enterprise AI Summit · Luma: Christina Ellwood & David Yakobovitch Present... AI Realized Summit 2024 For Enterprise Executives, Entrepreneurs & AI Innovators. Join us in San Francisco…


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

huanzhimao: Thanks for the issue! Will take a look




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}