Frozen AI News archive

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4T version)

**OpenAI** launched **GPT-4o**, a frontier model supporting real-time reasoning across **audio, vision, and text**, now free for all ChatGPT users with enhanced coding capabilities and upcoming advanced voice and video features. Discussions cover **open-source LLMs** like **Llama 3**, fine-tuning techniques including knowledge distillation for **GPT-3.5**, and hardware optimization strategies such as quantization. Emerging architectures include multimodal integrations with ChatGPT voice and Open Interpreter API, Mixture of Experts models combining autoregressive and diffusion approaches, and novel designs like the **YOCO architecture** and **ThunderKittens DSL** for efficient GPU use. Research advances in efficient attention methods like **Conv-Basis** using FFT and model scaling techniques such as depth upscaling were also highlighted.

Canonical issue URL

AI News for 5/10/2024-5/13/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (426 channels, and 7769 messages) for you. Estimated reading time saved (at 200wpm): 763 minutes.

As is tradition on Frontier Model days on AINews, we're publishing two editions of AINews. You're currently reading the one where all Part 1 and Part 2 summaries are done by GPT4T - the previous email was done with GPT4O and has the normal commentary. We envision that you will pull them up side by side to get comparisons on discords you care about to better understand the improvements/regressions.


Table of Contents

[TOC]


AI Discord Recap

A summary of Summaries of Summaries

Claude 3 Sonnet

1. GPT-4o Launch and Capabilities

2. Open Source LLM Exploration and Fine-tuning Techniques

3. Multimodal AI and Emerging Architectures

4. Advancements in Efficient Attention and Model Scaling

Claude 3 Opus

GPT4T (gpt-4-turbo-2024-04-09)

Major Themes and Discussions:

  1. AI Model Discussions and Comparisons: Substantial discourse is observed regarding the performance and specifications of various AI models like GPT-4, GPT-4o, Llama models, and more across several Discords. Users express mixed feelings about model performance, specializing in tasks like model training, comparison between new releases, and integration.

  2. Technological Innovations and Updates: Several channels report on updates regarding new functionalities, integrations, and technological advancements such as multimodal capabilities, changes in tokenizer, and speed enhancements. Updates from tech giants and community programmers are evaluated and dissected.

  3. Community Engagement and Project Collaborations: Robust discussions are evident around engaging community in collaborative projects, contributing to open-source repositories, or sharing custom projects. Such engagements span coding practices, developing AI utilities, or solving complex AI-driven tasks.

  4. Educational Content and Tutorials: A notable amount of educational content, tutorials, and discussions aimed at disseminating knowledge about AI technologies, programming, model training, etc., are shared. Links to academic papers, YouTube videos, and detailed blog posts are common as users seek to deepen their understanding or explain concepts to peers.

  5. Privacy, Legal, and Ethical Concerns: Several discussions touch upon the privacy implications of using AI technologies, concerns about data usage, legal implications of AI-generated content, and ethical considerations. Legal discussions in particular span a range of topics from artist rights in generated content to implications of AI in existing legal frameworks.

Key Knowledge Sharing and Resources:

GPT4O (gpt-4o-2024-05-13)

1. Model Performance and Releases

2. Technical Challenges and Solutions

3. AI Integration and Enhancements

4. Industry Trends and Events

5. Ethics and Legal Concerns

6. Educational and Support Resources


Detailed by-Channel Summaries and Links:

Unsloth AI (Daniel Han) ▷ General

Stability.ai (Stable Diffusion) ▷ General-Chat

OpenAI ▷ General Discussions

LangChain AI ▷ General

OpenRouter (Alex Atallah) ▷ General

HuggingFace ▷ General

For more detailed summaries and links, refer to the full compiled guide above.


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


OpenAI Discord


Nous Research AI Discord


Latent Space Discord


Perplexity AI Discord


HuggingFace Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


CUDA MODE Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


LAION Discord


LangChain AI Discord


LlamaIndex Discord


OpenAccess AI Collective (axolotl) Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


Cohere Discord


Datasette - LLM (@SimonW) Discord


Mozilla AI Discord


DiscoResearch Discord


LLM Perf Enthusiasts AI Discord


Alignment Lab AI Discord


AI Stack Devs (Yoko Li) Discord


Skunkworks AI Discord

The provided text does not contain enough information for a meaningful summary.


YAIG (a16z Infra) Discord

Apologies, but a summarized report cannot be generated for this channel. The provided message "Agree!" from user "pranay01" lacks sufficient context and substantive content to be included in a technical summary.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (834 messages🔥🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #random (15 messages🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (312 messages🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Link mentioned: LlamaForTokenClassification - a SauravMaheshkar Collection: no description found


Stability.ai (Stable Diffusion) ▷ #general-chat (976 messages🔥🔥🔥):

Links mentioned:


OpenAI ▷ #annnouncements (2 messages):


OpenAI ▷ #ai-discussions (689 messages🔥🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (126 messages🔥🔥):


OpenAI ▷ #prompt-engineering (32 messages🔥):


OpenAI ▷ #api-discussions (32 messages🔥):


OpenAI ▷ #api-projects (2 messages):


Nous Research AI ▷ #ctx-length-research (1 messages):

king.of.kings_: i am struggling to get llama 3 70b to be coherent over 8k tokens lol


Nous Research AI ▷ #off-topic (16 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (6 messages):

Links mentioned:


Nous Research AI ▷ #general (741 messages🔥🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (48 messages🔥):

Links mentioned:


Nous Research AI ▷ #rag-dataset (5 messages):

Links mentioned:


Nous Research AI ▷ #world-sim (22 messages🔥):

Links mentioned:


Latent Space ▷ #ai-general-chat (94 messages🔥🔥):

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

Link mentioned: Join the Latent Space (née /dev/invest) Discord Server!: Check out the Latent Space (née /dev/invest) community on Discord - hang out with 3747 other members and enjoy free voice and text chat.


Latent Space ▷ #llm-paper-club-west (710 messages🔥🔥🔥):

Links mentioned:


Perplexity AI ▷ #general (674 messages🔥🔥🔥):

Links mentioned:


Perplexity AI ▷ #sharing (21 messages🔥):

Link mentioned: Alexandr Yarats, Head of Search at Perplexity – Interview Series: Alexandr Yarats is the Head of Search at Perplexity AI. He began his career at Yandex in 2017, concurrently studying at the Yandex School of Data Analysis. The initial years were intense yet rewarding...


Perplexity AI ▷ #pplx-api (4 messages):


HuggingFace ▷ #general (389 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

Links mentioned:


HuggingFace ▷ #cool-finds (10 messages🔥):

Links mentioned:


HuggingFace ▷ #i-made-this (7 messages):

Links mentioned:


HuggingFace ▷ #reading-group (2 messages):

Link mentioned: You Only Cache Once: Decoder-Decoder Architectures for Language Models: We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once. It consists of two components, i.e., a cross-decoder stacked upon a self-decoder. ...


HuggingFace ▷ #computer-vision (6 messages):

Links mentioned:


HuggingFace ▷ #NLP (7 messages):


HuggingFace ▷ #diffusion-discussions (14 messages🔥):

Links mentioned:


LM Studio ▷ #💬-general (185 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (92 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (4 messages):


LM Studio ▷ #⚙-configs-discussion (7 messages):

Link mentioned: Shoo Go Away GIF - Shoo Go Away Johnny Depp - Discover & Share GIFs: Click to view the GIF


LM Studio ▷ #🎛-hardware-discussion (106 messages🔥🔥):


LM Studio ▷ #🧪-beta-releases-chat (12 messages🔥):

Link mentioned: Big Code Models Leaderboard - a Hugging Face Space by bigcode: no description found


LM Studio ▷ #memgpt (4 messages):


LM Studio ▷ #amd-rocm-tech-preview (2 messages):


LM Studio ▷ #open-interpreter (4 messages):


LM Studio ▷ #model-announcements (1 messages):

Links mentioned:


LM Studio ▷ #🛠-dev-chat (19 messages🔥):

Link mentioned: Introducing lms - LM Studio's companion cli tool | LM Studio: Today, alongside LM Studio 0.2.22, we're releasing the first version of lms — LM Studio's companion cli tool.


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (254 messages🔥🔥):

Links mentioned:


Modular (Mojo 🔥) ▷ #general (65 messages🔥🔥):

Link mentioned: PEP 604 – Allow writing union types as X | Y | peps.python.org: no description found


Modular (Mojo 🔥) ▷ #💬︱twitter (1 messages):

ModularBot: From Modular: https://twitter.com/Modular/status/1790046377613144201


Modular (Mojo 🔥) ▷ #📺︱youtube (1 messages):


Modular (Mojo 🔥) ▷ #🔥mojo (85 messages🔥🔥):

Links mentioned:


Modular (Mojo 🔥) ▷ #performance-and-benchmarks (1 messages):

Link mentioned: GitHub - dorjeduck/mostring: variations over StringBuilder ideas in Mojo: variations over StringBuilder ideas in Mojo. Contribute to dorjeduck/mostring development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #nightly (64 messages🔥🔥):

Links mentioned:


CUDA MODE ▷ #general (5 messages):


CUDA MODE ▷ #triton (43 messages🔥):

Links mentioned:


CUDA MODE ▷ #cuda (9 messages🔥):

Links mentioned:


CUDA MODE ▷ #announcements (1 messages):

Link mentioned: Join our Cloud HD Video Meeting: Zoom is the leader in modern enterprise video communications, with an easy, reliable cloud platform for video and audio conferencing, chat, and webinars across mobile, desktop, and room systems. Zoom ...


CUDA MODE ▷ #algorithms (1 messages):

random_string_of_character: https://arxiv.org/abs/2405.05219


CUDA MODE ▷ #beginner (14 messages🔥):

Links mentioned:


CUDA MODE ▷ #pmpp-book (1 messages):

Link mentioned: Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


CUDA MODE ▷ #off-topic (5 messages):


CUDA MODE ▷ #irl-meetup (1 messages):

boxxy_ms: anyone in Toronto?


CUDA MODE ▷ #triton-puzzles (2 messages):


CUDA MODE ▷ #llmdotc (67 messages🔥🔥):

Links mentioned:


CUDA MODE ▷ #lecture-qa (48 messages🔥):

Links mentioned:


CUDA MODE ▷ #youtube-watch-party (5 messages):

Link mentioned: ECE408: Applied Parallel Programming, Spring 2019 ZJUI Section: no description found


Eleuther ▷ #general (61 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (79 messages🔥🔥):

Links mentioned:


Eleuther ▷ #scaling-laws (7 messages):

Links mentioned:


Eleuther ▷ #interpretability-general (3 messages):


Eleuther ▷ #gpt-neox-dev (1 messages):

oleksandr07173: Hello


Interconnects (Nathan Lambert) ▷ #news (120 messages🔥🔥):

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (1 messages):

Link mentioned: PPO / Reinforce Trainers by vwxyzjn · Pull Request #1540 · huggingface/trl: This RP supports the REINFORCE RLOO trainers in https://arxiv.org/pdf/2402.14740.pdf. Note that REINFORCE's loss is a special case of PPO, as shown below it matches the REINFORCE loss presented i...


Interconnects (Nathan Lambert) ▷ #random (5 messages):


Interconnects (Nathan Lambert) ▷ #reads (11 messages🔥):


LAION ▷ #general (109 messages🔥🔥):

Links mentioned:


LAION ▷ #research (5 messages):

Link mentioned: Tweet from LAION (@laion_ai): Wanna train transformers with audio as if it was text? - Here is how. :) https://youtu.be/NwZufAJxmMA https://discord.gg/6jWrFngyPe


LangChain AI ▷ #general (105 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #share-your-work (4 messages):

Links mentioned:


LangChain AI ▷ #tutorials (3 messages):

Links mentioned:


LlamaIndex ▷ #blog (8 messages🔥):

Links mentioned:


LlamaIndex ▷ #general (89 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (3 messages):

Link mentioned: Knowledge Distillation for Fine-Tuning a GPT-3.5 Judge: Enhancing Accuracy and Performance : no description found


OpenAccess AI Collective (axolotl) ▷ #general (30 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #general-help (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (10 messages🔥):

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search: Understand code, faster.


OpenAccess AI Collective (axolotl) ▷ #axolotl-phorm-bot (9 messages🔥):

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search: Understand code, faster.


OpenInterpreter ▷ #general (41 messages🔥):

Links mentioned:


OpenInterpreter ▷ #O1 (21 messages🔥):


OpenInterpreter ▷ #ai-content (4 messages):

Link mentioned: GitHub - a-real-ai/pywinassistant: The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.: The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualizati...


tinygrad (George Hotz) ▷ #learn-tinygrad (38 messages🔥):

Links mentioned:


Cohere ▷ #general (24 messages🔥):

Link mentioned: Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models: The disconnect between tokenizer creation and model training in language models has been known to allow for certain inputs, such as the infamous SolidGoldMagikarp token, to induce unwanted behaviour. ...


Cohere ▷ #project-sharing (2 messages):

Link mentioned: Zindi: no description found


Datasette - LLM (@SimonW) ▷ #ai (23 messages🔥):


Datasette - LLM (@SimonW) ▷ #llm (1 messages):

simonw: https://twitter.com/simonw/status/1790121870399782987


Mozilla AI ▷ #llamafile (15 messages🔥):

Links mentioned:


DiscoResearch ▷ #general (9 messages🔥):

Links mentioned:


DiscoResearch ▷ #discolm_german (2 messages):


LLM Perf Enthusiasts AI ▷ #general (4 messages):


LLM Perf Enthusiasts AI ▷ #gpt4 (6 messages):

Link mentioned: Introducing GPT-4o: OpenAI Spring Update – streamed live on Monday, May 13, 2024. Introducing GPT-4o, updates to ChatGPT, and more.


Alignment Lab AI ▷ #general-chat (3 messages):

Link mentioned: AlphaFold3 [AF3] Federation Meet · Luma: Current Progress Update A talk by the lead developer on the current status of Alpha Fold 3 integration. Discussion of any issues encountered during the initial…


Alignment Lab AI ▷ #fasteval-dev (3 messages):


AI Stack Devs (Yoko Li) ▷ #app-showcase (1 messages):


AI Stack Devs (Yoko Li) ▷ #ai-town-dev (1 messages):


Skunkworks AI ▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=KQ-xGVFHDkw


YAIG (a16z Infra) ▷ #tech-discussion (1 messages):

pranay01: Agree!