Frozen AI News archive

The world's first fully autonomous AI Engineer

**Cognition Labs's Devin** is highlighted as a potentially groundbreaking AI software engineer agent capable of learning unfamiliar technologies, addressing bugs, deploying frontend apps, and fine-tuning its own AI models. It integrates **OpenAI's GPT-4** with reinforcement learning and features tools like asynchronous chat, browser, shell access, and an IDE. The system claims advanced long-term reasoning and planning abilities, attracting praise from investors like **Patrick Collison** and **Fred Ehrsam**. The technology is noted for its potential as one of the most advanced AI agents, sparking excitement about agents and AGI.

Canonical issue URL

Warm welcome to the >3000 people who joined from Andrej's shoutout! As we said last time, this is a side project that we're kind of embarrassed by but we are honored and hope you find this as useful as we do. The email has gotten unwieldy (originally this was only recapping the LS discord) and the plan is to move sections of this off to a more dedicated news service + offer personalization.

Cognition Labs's Devin is the headline AI news of the day - on the surface one of many, many "AI software engineer" startups - but the difference is in the execution:

These are all very big claims, and if generally true rather than cherrypicked, would almost certainly qualify to be one of the most advanced AI agents the world has ever seen. This should of course attract skepticism, especially since only prerecorded videos were released, but credible investors like Patrick Collison and Fred Ehrsam, and beta testers like Varun and Andrew have praised the live demos.

Details are scarce:

image.png

And because the videos are all edited/sped up, it's unclear whether the latency is a concern or a temporary issue. Since Devin reports minutes worked, there's no real incentive to save here apart from UX.

Overall though, people are excited about agents and AGI again, which is always cause for celebration.

image.png


Table of Contents

[TOC]

PART X: AI Twitter Recap

all recaps done by Claude 3 Opus, lightly edited by swyx for now. We are working on antihallucination, NER, and context addition pipelines.

Advances in Language Models and Architectures

Retrieval Augmented Generation (RAG) and Tools

Multimodal AI and Video Understanding

Responsible AI and Bias

Memes and Humor


PART 0: Summary of Summaries of Summaries

Claude 3 Sonnet (14B?)

1. New AI Model Releases and Capabilities

2. Accelerating and Optimizing Large Language Models

3. Open Source AI Tools and Resources

4. Analyzing and Interpreting Large Language Models

Claude 3 Opus (>220B?)

ChatGPT (GPT4T)


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord Summary


Perplexity AI Discord Summary


Nous Research AI Discord Summary


LM Studio Discord Summary


OpenAI Discord Summary


HuggingFace Discord Summary

Hugging Face Introduces Handy New Features: Hugging Chat now lets users filter and search for Assistant names, and the Hugging Face blog includes a new "table of contents" for ease of access. All-time stats are now available in Hugging Face Spaces, enabling creators to assess their space's popularity more comprehensively.

WebGPU Poised to Accelerate In-Browser ML: @osanseviero from Hugging Face indicated that WebGPU could potentially speed up machine learning in browsers by up to 40 times.

Expanding Developer Resources and Learning: The latest releases of Transformers.js 2.16.0, Gradio 4.21, and Accelerate v0.28.0 bring developers new features. Additionally, a new course titled Machine Learning for Games was announced by @ThomasSimonini.

Cutting-Edge Tools and Model Discoveries Across Channels:

Concepts and Models Discussed for Practical AI Implementation:

Evolving the AI Discourse in Natural Language Processing:


Eleuther Discord Summary


LlamaIndex Discord Summary


LAION Discord Summary


Latent Space Discord Summary


Interconnects (Nathan Lambert) Discord Summary


CUDA MODE Discord Summary

Nvidia's Moat vs. Vulkan's Potential: Nvidia's dominance in the GPU landscape continues to be a point of fascination, with discussions highlighting Nvidia's compelling competitive advantage and software edge as nearly insurmountable, despite Vulkan's potential Pytorch backend posing a theoretical challenge. Users also expressed the complexities of working with Vulkan due to setup and packaging reminiscent of CUDA issues. Meta's significant investment in AI infrastructure with a 24k GPU cluster and a roadmap for 350,000 NVIDIA H100 GPUs reinforces Nvidia's dominance in the field (Meta's GenAI Infrastructure Article).

Triton Community Gathers: The Triton programming language community is preparing for an upcoming meetup on 3/28 at 10 AM PT. Interaction with the community and information about the meeting can be accessed through the Triton Lang Slack channel and its GitHub discussions page.

CUDA Development Insights and Tips: Discussions related to CUDA included the benefits of thread coarsening for enhanced performance, the optimization of Visual Studio Code for CUDA development, and suggestions for learning specific CUDA data types and threads. A detailed c_cpp_properties.json configuration setup for VS Code was shared, highlighting necessary includes for CUDA toolkit and PyTorch.

PyTorch Ecosystem Active Discussions: Within the PyTorch community, questions were raised regarding the performance differences between libtorch and load_inline, clarification on the role of Modular in optimizing kernel compatibility with GPU architectures, and an open call for feedback on torchao RFC #47 to simplify the integration of new quantization algorithms and data types.

NVIDIA Innovations and Training Resources: The CUDA community touched upon NVIDIA's leading-edge techniques like Stream-K and Graphene IR, which promise significant speedups and optimizations in matrix multiplication on GPUs, and shared a link to the CUTLASS repository (NVIDIA Stream-K Example). For CUDA learners, a comprehensive CUDA Training Series on YouTube, along with its associated GitHub materials, was recommended (CUDA Training Series GitHub).

PMPP and Other Learning Resources: The "Programming Massively Parallel Processors" (PMPP) book was noted for not extensively covering profiling tools, with ancillary content available through associated YouTube videos. Additional CUDA coursework concerns were addressed, including questions about spacing in CUDA C++ syntax and exercise solutions for the PMPP 2023 edition.

Ring Attention Troubleshooting and Coordination: A user offered GPU availability for stress testing ring attention code and coordinated meeting times aligned with US daylight saving changes, while seeking advice after encountering high training loss. WANDB was used as an evaluation tool for training sessions.

Off-topic Rumors and AI Developments: Speculative discussions about Inflection AI and Claude-3 led to clarification via a debunking tweet. A cryptic image sparked curiosity, and attention was drawn to a new AI software engineer named Devin, developed by Cognition Labs, which promises new benchmarks in software engineering, with a real-world test publicized by @itsandrewgao (Andrew Kean Gao's Tweet).


LangChain AI Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


OpenRouter (Alex Atallah) Discord Summary


DiscoResearch Discord Summary


Alignment Lab AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


Skunkworks AI Discord Summary


AI Engineer Foundation Discord Summary


PART 2: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (237 messages🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #welcome (12 messages🔥):


Unsloth AI (Daniel Han) ▷ #random (9 messages🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (272 messages🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (10 messages🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #suggestions (4 messages):

Links mentioned:


Perplexity AI ▷ #general (424 messages🔥🔥🔥):

Links mentioned:


Perplexity AI ▷ #sharing (17 messages🔥):


Perplexity AI ▷ #pplx-api (9 messages🔥):


Nous Research AI ▷ #off-topic (24 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (6 messages):

Links mentioned:


Nous Research AI ▷ #general (267 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (120 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #collective-cognition (3 messages):


LM Studio ▷ #💬-general (207 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (96 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🎛-hardware-discussion (63 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (6 messages):


LM Studio ▷ #autogen (1 messages):


LM Studio ▷ #memgpt (1 messages):


LM Studio ▷ #amd-rocm-tech-preview (14 messages🔥):


LM Studio ▷ #crew-ai (1 messages):


OpenAI ▷ #ai-discussions (248 messages🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (29 messages🔥):


OpenAI ▷ #prompt-engineering (55 messages🔥🔥):

Links mentioned:


OpenAI ▷ #api-discussions (55 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

Links mentioned:


HuggingFace ▷ #general (149 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (2 messages):

Links mentioned:

wav2vec2-codebook-indices/scripts/helpers/w2v2_codebook.py at master · fauxneticien/wav2vec2-codebook-indices: Contribute to fauxneticien/wav2vec2-codebook-indices development by creating an account on GitHub.


HuggingFace ▷ #cool-finds (11 messages🔥):

Links mentioned:

A fun project over the last…"](https://huggingface.co/posts/chansung/716968829982789): no description found


HuggingFace ▷ #i-made-this (19 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (9 messages🔥):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (2 messages):

Links mentioned:

wav2vec2-codebook-indices/scripts/helpers/w2v2_codebook.py at master · fauxneticien/wav2vec2-codebook-indices: Contribute to fauxneticien/wav2vec2-codebook-indices development by creating an account on GitHub.


HuggingFace ▷ #computer-vision (23 messages🔥):


HuggingFace ▷ #NLP (17 messages🔥):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (2 messages):

Links mentioned:

wav2vec2-codebook-indices/scripts/helpers/w2v2_codebook.py at master · fauxneticien/wav2vec2-codebook-indices: Contribute to fauxneticien/wav2vec2-codebook-indices development by creating an account on GitHub.


Eleuther ▷ #general (105 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (68 messages🔥🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (3 messages):

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):

Links mentioned:


Eleuther ▷ #multimodal-general (2 messages):


Eleuther ▷ #gpt-neox-dev (1 messages):


LlamaIndex ▷ #announcements (1 messages):

Links mentioned:

LlamaIndex Webinar: Long-Term, Self-Editing Memory with MemGPT · Zoom · Luma: Long-term memory for LLMs is an unsolved problem, and doing naive retrieval from a vector database doesn’t work. The recent iteration of MemGPT (Packer et al.) takes a big step in this...


LlamaIndex ▷ #blog (5 messages):

Links mentioned:

Local & open-source AI developer meetup (Paris) · Luma: Ollama and Friends are in Paris! Ollama and Friends will be hosting a local & open-source AI developer meetup on Thursday, March 21st at 6pm at Station F in Paris. Come gather with developers...


LlamaIndex ▷ #general (162 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (3 messages):

Links mentioned:


LAION ▷ #general (65 messages🔥🔥):

Links mentioned:


LAION ▷ #research (75 messages🔥🔥):

Links mentioned:


LAION ▷ #learning-ml (1 messages):

Links mentioned:

Download & stream 400M images + text - a Lightning Studio by thomasgridai: Use, explore, & create from scratch the LAION-400-MILLION images & captions dataset.


Latent Space ▷ #ai-general-chat (117 messages🔥🔥):

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (59 messages🔥🔥):

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (34 messages🔥):


Interconnects (Nathan Lambert) ▷ #random (15 messages🔥):

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (2 messages):


CUDA MODE ▷ #general (38 messages🔥):

Links mentioned:


CUDA MODE ▷ #triton (3 messages):

Links mentioned:


CUDA MODE ▷ #cuda (13 messages🔥):


CUDA MODE ▷ #torch (4 messages):

Links mentioned:

[RFC] Plans for torchao · Issue #47 · pytorch-labs/ao: Summary Last year, we released pytorch-labs/torchao to provide acceleration of Generative AI models using native PyTorch techniques. Torchao added support for running quantization on GPUs, includin...


CUDA MODE ▷ #algorithms (4 messages):

Links mentioned:


CUDA MODE ▷ #suggestions (1 messages):

Links mentioned:


CUDA MODE ▷ #beginner (8 messages🔥):


CUDA MODE ▷ #pmpp-book (12 messages🔥):


CUDA MODE ▷ #ring-attention (8 messages🔥):

Links mentioned:

iron-bound: Weights & Biases, developer tools for machine learning


CUDA MODE ▷ #off-topic (6 messages):

Links mentioned:


LangChain AI ▷ #general (67 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #langserve (5 messages):

Links mentioned:

Refactor Anthropic import to langchain_anthropic and update model to v3 by donbr · Pull Request #524 · langchain-ai/langserve: Transition Anthropic API import to the langchain_anthropic package for enhanced compatibility. Upgrade the AI model to claude-3-sonnet-20240229 for improved performance and features.


LangChain AI ▷ #langchain-templates (1 messages):


LangChain AI ▷ #share-your-work (2 messages):

Links mentioned:


LangChain AI ▷ #tutorials (4 messages):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (40 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (10 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #community-showcase (2 messages):


OpenRouter (Alex Atallah) ▷ #general (63 messages🔥🔥):

Links mentioned:


DiscoResearch ▷ #disco_judge (6 messages):


DiscoResearch ▷ #general (7 messages):

Links mentioned:


DiscoResearch ▷ #benchmark_dev (3 messages):

Links mentioned:

tinyBenchmarks (tinyBenchmarks): no description found


DiscoResearch ▷ #discolm_german (2 messages):


Alignment Lab AI ▷ #general-chat (1 messages):


Alignment Lab AI ▷ #oo (5 messages):

Links mentioned:

GitHub - mermaid-js/mermaid: Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown: Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown - mermaid-js/mermaid


Alignment Lab AI ▷ #oo2 (11 messages🔥):


LLM Perf Enthusiasts AI ▷ #general (10 messages🔥):

Links mentioned:

必应: no description found


LLM Perf Enthusiasts AI ▷ #gpt4 (1 messages):


LLM Perf Enthusiasts AI ▷ #opensource (2 messages):

Links mentioned:

Tweet from Elon Musk (@elonmusk): This week, @xAI will open source Grok


Skunkworks AI ▷ #general (1 messages):


Skunkworks AI ▷ #off-topic (2 messages):

Links mentioned:


AI Engineer Foundation ▷ #general (1 messages):

Links mentioned:

Guide to Submit Projects to AI Engineer Foundation: no description found