Frozen AI News archive

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)

**OpenAI** has released **GPT-4o**, a new **multimodal** model capable of reasoning across text, audio, and video in real time with low latency (~300ms). It features voice and vision capabilities, improved non-English language performance with an expanded 200k vocabulary tokenizer, and is available to all ChatGPT users including free plans. GPT-4o is half the price and twice as fast as GPT-4-turbo with 5x rate limits. The model supports real-time voice and video input/output and shows strong coding capabilities. The release includes a new desktop app that can read screen and clipboard history, challenging existing desktop agent startups. The announcement was accompanied by demos including image generation and 3D object handling, with OpenAI achieving state-of-the-art performance in ASR and vision tasks. The update was widely discussed on social media, with comparisons to GPT-4T highlighting GPT-4o's speed and versatility. *"GPT-4o is smart, fast, natively multimodal, and a step towards more natural human-computer interaction"* and *"extremely versatile and fun to play with"*.

Canonical issue URL

AI News for 5/10/2024-5/13/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (426 channels, and 7769 messages) for you. Estimated reading time saved (at 200wpm): 763 minutes.

Say hello to GPT-4O!

https://www.youtube.com/watch?v=DQacCB9tDaw

It turns out that the numerous leaks about a "Her" like-chatbot announcement were most accurate, with a surprisingly "hot" voice but also the ability to respond with (an average 300ms, down from ~3000ms) low latency, have vision, handle interruptions and sing, speak faster or in pirate/whale, and more. There's also a waitlisted new desktop app that has the ability to read from the screen and clipboard history that directly challenges the desktop agent startups like Multion/Adept.

But nobody leaked that this also comes with a new versioned model, now confirmed to be the "gpt2-chatbot" that was previewed on LMsys, that is confirmed to be substantially above all other prior frontier models:

image.png

The official blogpost has a lot more video examples demonstrating the app and model, including new versions of image output that may or may not be Dall-E or some completely new thing:

image.png

Lots of people are making noise about the 3d object demo, but we can't be sure if that's just code generation since there were hidden steps in there.

To do this, OpenAI had to beat SOTA on everything all at once, including ASR and Vision:

image.png

image.png

The tiktokenizer update revealed an expanded 200k vocab size that makes non-English cheaper/more native.

Lots more takes are flying, but as is tradition on Frontier Model days on AINews, we're publishing two editions of AINews. You're currently reading the one where all Part 1 and Part 2 summaries are done by GPT4O - the next email you get is the same but with GPT4T (update: it completed here, 74% slower than GPT4O). We envision that you will pull them up side by side (like this!) to get comparisons on discords you care about to better understand the improvements/regressions.

image.png


Table of Contents

[TOC]


AI Twitter Recap

all recaps done by Claude 3 Opus, best of 4 runs. We are working on clustering and flow engineering with Haiku.

OpenAI Releases GPT-4o, a Multimodal Model with Voice and Vision Capabilities

Key Demos and Capabilities

Reactions and Implications

Other AI News and Discussions


AI Reddit Recap

Across r/LocalLlama, r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity. Comment crawling works now but has lots to improve!

OpenAI's Upcoming Announcement

Advances in AI Capabilities

Open Source AI Developments

Optimizing AI Performance

Humor and Memes


AI Discord Recap

A summary of Summaries of Summaries

Claude 3 Sonnet

1. Efficient AI Model Training and Inference:

2. Open-Source LLM Developments:

3. Multimodal AI Capabilities:

4. Debates on AI Safety, Ethics, and Regulation:

Claude 3 Opus

Here is a high-level summary of the top 3-4 major themes across the Discord channels, with important key terms, facts, and URLs bolded and linked to sources where relevant:

Let me know if you would like me to elaborate on any part of this summary or if you have additional questions!

GPT4T (gpt-4-turbo-2024-04-09)

Major Themes:

  1. Regulatory Concerns and Monopolistic Moves: There's significant discussion and concern over OpenAI's regulatory actions, particularly around practices that may favor larger companies, potentially leading to a monopolistic environment. Members expressed mixed feelings about OpenAI's moves, with criticisms particularly about potential restrictions that harm smaller competitors.

  2. New Model Releases and Enhancements: Several discords discussed the release and capabilities of new models like GPT-4o, WizardLM, and Falcon 2. The release of these models sparked discussions about their enhanced multi-modal capabilities, performance improvements, and general excitement or skepticism about their real-world applications.

  3. Technical Tools and Innovations: Various communities delved into technical aspects, discussing new tools and updates such as ThunderKittens for optimizing CUDA kernels, stable diffusion innovations, and advancements in model training techniques. There was a strong focus on optimizing performance and integrating the latest technological advancements.

  4. Community Engagement and Speculations: Across several platforms, members engaged in forward-looking speculations about the impact of AI on various sectors. There were debates about the legal implications of deploying AI-driven services, discussions on the potential monopolistic behavior of AI giants, and the community's role in shaping the ethics and policies of AI development.

Significant Discussions Linked to URLs:

GPT4O (gpt-4o-2024-05-13)

  1. Regulatory Challenges and Platform Control:

    • OpenAI's Regulatory Moves: Discussions spanned multiple communities about OpenAI's implementation of tighter control through measures like compulsory GPU signing and collaboration with the White House, raising concerns over monopolistic tendencies (e.g., [Unsloth AI (Daniel Han)]).
    • Competitive Landscape: Concerns were also raised about how these moves could marginalize smaller competitors, favoring big tech companies, indicating a broader fear of restricted innovation in the AI space Nous Research AI.
  2. Advancements in and Deployment of New Models:

    • GPT-4o Release: Enthusiasm was noted for GPT-4o's launch, highlighting its free public access with certain limitations and multi-modal capabilities integrating audio, vision, and text reasoning OpenAI.
    • Community Response: Some noted mixed emotions about GPT-4o's performance compared to previous models, with some excitement over new features overshadowed by noted reasoning inconsistencies Perplexity AI and HuggingFace.
  3. Focus on Technical Optimization and Fine-Tuning:

    • ThunderKittens: Gained attention for its promising kernel performance improvements, suggested to outperform existing methods like Flash Attention 2 CUDA MODE and Unsloth AI (Daniel Han).
    • Fine-Tuning Issues: Multiple communities mentioned difficulties in fine-tuning models like Llama3, with discussions about specific solutions and optimization techniques HuggingFace.
  4. Application and Use-Case Innovations:

    • World Simulation and AI Agents: Platforms for running simulations like Websim and AI agents for tasks like generating PowerPoint presentations were shared. There was also notable interest in enhancing simulation capabilities, including integrating Digital Audio Workstations Nous Research AI.
    • Community Tool Sharing: Users frequently shared code examples, scripts, and tutorials to assist with setting up and configuring AI tools, emphasizing collaborative knowledge sharing across projects like LangChain AI and HuggingFace.

Important Links:

  1. WizardLM GitHub: https://huggingface.co/alpindale/WizardLM-2-8x22B
  2. ThunderKittens GitHub: https://github.com/HazyResearch/ThunderKittens
  3. OpenRouter API Watcher Demo: https://orw.karleo.net/
  4. RAG Pipeline Tutorial: https://zackproser.com/blog/langchain-pinecone-chat-with-my-blog
  5. Deep Learning Initialization Guide: https://www.deeplearning.ai/ai-notes/initialization/index.html
  6. AI Research Papers (various links):

PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


OpenAI Discord


Nous Research AI Discord


Latent Space Discord


Perplexity AI Discord


HuggingFace Discord


LM Studio Discord

GPT Agents in Learning Limbo: GPT agents' inability to assimilate new information into their base knowledge caused buzz, with clarification on how information is stored as "knowledge" files that don't update the agent's core understanding.

Hardware Hurdles for Hi-Tech Pursuits: Engineers faced challenges running advanced models like Llama 3 70B Q8 on hardware with 128GB RAM, with PCIe 3.0 causing bottlenecks remedied by switching to PCIe 4.0 motherboards. Utilizing GPUs with less than 6GB VRAM for weighty models proved futile.

Yi Models Yield Enthusiasm: Yi-1.5 models, including 9B and quantized 34B variants, received praise and recommendations for a variety of tasks, with quantized models leveraging llama.cpp for improved performance.

Tooling Up for Efficiency: LM Studio's 0.2.22 update introduced a CLI tool, lms, for model management and boasted bug fixes in llama.cpp, while the community navigated the complexities of connecting OpenInterpreter to LM Studio and configuring headless installations on Linux servers.

Quest for Research Collaboration: Dispensing with corporate vernacular, the conversation sought aid and shared experiences for running MemGPT on various setups, revealing a collective endeavor to optimize this AI model.


OpenRouter (Alex Atallah) Discord

JetMoE 8B Free Hits a Snag: The JetMoE 8B Free model is experiencing downtime due to upstream overload, returning an error (502) to all requests until further notice.

Eye on the Models—OpenRouter API Watcher: An open-source tool called OpenRouter API Watcher has been unveiled, which keeps track of changes in OpenRouter's model availability, offering hourly updates via a web interface and an RSS feed with low overhead. Check out the demo.

A Beta Tester’s Dream with Rubik's AI Pro: Users can beta test and provide feedback for Rubik's AI Pro, an advanced research assistant and search engine, with 2 months of free premium access using a RUBIX promo code. Further details can be found at Rubik's AI.

Jetmoe’s Caveat: It has been confirmed that Jetmoe lacks internet access, which restricts its use cases, but it remains useful for academic research.

GPT-4o Joins OpenRouter: GPT-4o has been added to OpenRouter’s arsenal, supporting text and image inputs, and generating buzz for its performance and competitive pricing, although it lacks support for video and audio inputs.


Modular (Mojo 🔥) Discord


CUDA MODE Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


LAION Discord


LangChain AI Discord


LlamaIndex Discord


OpenAccess AI Collective (axolotl) Discord

Tech-Savvy Inner Circle Shares AI Insights

Deploying practical solutions and seamless updates remains a collective goal in tackling emergent AI tech puzzles — updates and breakthroughs to follow.


OpenInterpreter Discord

Goofy Errors and Speedy Performances: Claude API users reported "goofy errors" impeding its use, whereas GPT-4o garnered praise for its swift performance, clocking at "minimum 100 tokens/s." Local models such as Mixtral and Llama3 were considered inferior to GPT-4.

PyWinAssistant Showcases AI Control over UI: An open-source project dubbed PyWinAssistant allows control of user interfaces through natural language, leveraging Visualization-of-Thought for spatial reasoning. Excitement grew as users shared a GitHub repo and a live YouTube demo.

Hardware Headaches and Software Solutions: Integration of LiteLLM, Groq and Llama3 successfully confirmed, while another user struggled to connect their 01-Light device. Separate issues arose with Python script execution resolved by importing OpenInterpreter correctly.

Shipment Updates and Support Channels: Queries about the 01 hardware brought news of upcoming batch shipments, and an iOS app for the hardware is in beta, shared on GitHub. Order cancellations were directed to [email protected].

Dev Discussions on Model Swapping: The 01 dev preview prompted exchanges on switching to local models using poetry run 01 --local, offering insights into model selection commands.


tinygrad (George Hotz) Discord


Cohere Discord


Datasette - LLM (@SimonW) Discord


Mozilla AI Discord


DiscoResearch Discord

Searching for German Content: A pursuit for diverse German YouTube channels to train a Text-to-Speech model led to suggestions such as using Mediathekview to download content. The Mediathekview's JSON API was also highlighted as a resourceful tool, as seen in the GitHub repository.

Keep It English: A reminder was issued within the discussions to ensure that English remains the primary language for communication, possibly to maintain the accessibility of discussions.

Demo Status Check: An inquiry about the status of a unidentified demo received no response, indicating either a lack of information or attention to the query.

Thumbs Up for... Something: Positive feedback was expressed with a brief "It's really nice," comment, though the context of this satisfaction wasn't expanded upon.

Curiosity for RT Audio Interface: There's evident curiosity and excitement about the "RT Audio interface" in applications beyond chat, but experiences or results have not yet been shared in the discussions.


LLM Perf Enthusiasts AI Discord


Alignment Lab AI Discord

AlphaFold Goes Social: The AlphaFold3 Federation has sprung into action, inviting participants to a meet on May 12th at 9pm EST focusing on updates and pipeline development, with an open invitation link here.

Fasteval on the Brink: The fasteval project seems to be ending, but hope remains for someone to assume the helm; the current maintainers are open to transferring the project found on GitHub, or else they suggest archiving it.


AI Stack Devs (Yoko Li) Discord


Skunkworks AI Discord


YAIG (a16z Infra) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (833 messages🔥🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #random (15 messages🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (312 messages🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Link mentioned: LlamaForTokenClassification - a SauravMaheshkar Collection: no description found


Stability.ai (Stable Diffusion) ▷ #general-chat (976 messages🔥🔥🔥):

Links mentioned:


OpenAI ▷ #annnouncements (2 messages):


OpenAI ▷ #ai-discussions (684 messages🔥🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (126 messages🔥🔥):

- **Issues Passing Files to GPT Actions**: A member asked if anyone figured out how to pass uploaded files to a GPT action. There wasn't a clear resolution provided in the discussion.

- **GPT-4T API Provides Higher Context**: Discussion highlighted that the API for GPT-4T is less restrained and currently allows a 128k context. Members discussed the nuances of this capability.

- **Random Output with High Temperature Settings**: A member experienced random outputs when setting the temperature above 1.5. Another advised keeping the temperature below 1 for stable and coherent responses.

- **Fetching OpenAI Model Pricing**: Members shared that OpenAI pricing is static and can be reviewed on the [OpenAI pricing page](https://openai.com/api/pricing/). There are no alerts for pricing changes, so users need to monitor the page manually.

- **Custom GPTs and Cross-Session Memory**: There was confusion about custom GPTs' cross-session memory capabilities, clarified by a member noting that per-GPT memories have not rolled out yet. More details about this can be found in the [OpenAI Memory FAQ](https://help.openai.com/en/articles/8590148-memory-faq).

OpenAI ▷ #prompt-engineering (32 messages🔥):


OpenAI ▷ #api-discussions (32 messages🔥):


OpenAI ▷ #api-projects (2 messages):


Nous Research AI ▷ #ctx-length-research (1 messages):

king.of.kings_: i am struggling to get llama 3 70b to be coherent over 8k tokens lol


Nous Research AI ▷ #off-topic (15 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (6 messages):

Links mentioned:


Nous Research AI ▷ #general (741 messages🔥🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (48 messages🔥):

Links mentioned:


Nous Research AI ▷ #rag-dataset (5 messages):

Links mentioned:


Nous Research AI ▷ #world-sim (22 messages🔥):

Links mentioned:


Latent Space ▷ #ai-general-chat (93 messages🔥🔥):

Links mentioned:


Latent Space ▷ #ai-announcements (1 messages):

Link mentioned: Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


Latent Space ▷ #llm-paper-club-west (710 messages🔥🔥🔥):

Links mentioned:


Perplexity AI ▷ #general (658 messages🔥🔥🔥):

- **Cheerio Library Alternatives**: A user asked if there's a faster way than the Cheerio library to extract content from HTML strings. Another user provided a link to [Perplexity's AI search](https://www.perplexity.ai/search/Is-there-a-xOtvxOveTGSfbae88ElQMA) for further exploration.

- **ChatGPT Plus vs. Perplexity Pro**: Discussions highlighted the comparative advantages of ChatGPT Plus and Perplexity Pro, including context window sizes and general AI capabilities. Users shared their experiences, stating Perplexity as more focused on being an AI search engine with specific features such as collections and model flexibility.

- **Claude 3 Opus Limits**: Users frequently mentioned dissatisfaction with the imposed limits on Claude 3 Opus usage in Perplexity Pro. One user suggested considering YesChat as an alternative, which offers more generous usage quotas.

- **GPT-4o Release Buzz**: Conversations were abuzz with the release of GPT-4o, noting its improved speed and capabilities. There was anticipation for when Perplexity would integrate GPT-4o, with comparisons to how it might outclass existing models like Claude 3 Opus.

- **Perplexity's Context Handling**: Users discussed the effectiveness of Perplexity in handling context windows and RAG (retrieval-augmented generation). The consensus was that while 32k tokens seem standard, there is uncertainty and a desire for greater context capabilities.

Links mentioned:


Perplexity AI ▷ #sharing (21 messages🔥):

Link mentioned: Alexandr Yarats, Head of Search at Perplexity – Interview Series: Alexandr Yarats is the Head of Search at Perplexity AI. He began his career at Yandex in 2017, concurrently studying at the Yandex School of Data Analysis. The initial years were intense yet rewarding...


Perplexity AI ▷ #pplx-api (4 messages):


HuggingFace ▷ #general (389 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

Links mentioned:


HuggingFace ▷ #cool-finds (10 messages🔥):

Links mentioned:


HuggingFace ▷ #i-made-this (7 messages):

Links mentioned:


HuggingFace ▷ #reading-group (2 messages):

Link mentioned: You Only Cache Once: Decoder-Decoder Architectures for Language Models: We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once. It consists of two components, i.e., a cross-decoder stacked upon a self-decoder. ...


HuggingFace ▷ #computer-vision (6 messages):

Links mentioned:


HuggingFace ▷ #NLP (7 messages):

Link mentioned: Building a new tokenizer: Learn how to use the 🤗 Tokenizers library to build your own tokenizer, train it, then how to use it in the 🤗 Transformers library.This video is part of the...


HuggingFace ▷ #diffusion-discussions (14 messages🔥):

Links mentioned:


LM Studio ▷ #💬-general (183 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (92 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (4 messages):


LM Studio ▷ #⚙-configs-discussion (7 messages):

Link mentioned: Shoo Go Away GIF - Shoo Go Away Johnny Depp - Discover & Share GIFs: Click to view the GIF


LM Studio ▷ #🎛-hardware-discussion (106 messages🔥🔥):


LM Studio ▷ #🧪-beta-releases-chat (12 messages🔥):

Link mentioned: Big Code Models Leaderboard - a Hugging Face Space by bigcode: no description found


LM Studio ▷ #memgpt (4 messages):


LM Studio ▷ #amd-rocm-tech-preview (2 messages):


LM Studio ▷ #open-interpreter (4 messages):


LM Studio ▷ #model-announcements (1 messages):

Links mentioned:


LM Studio ▷ #🛠-dev-chat (19 messages🔥):

Link mentioned: Introducing lms - LM Studio's companion cli tool | LM Studio: Today, alongside LM Studio 0.2.22, we're releasing the first version of lms — LM Studio's companion cli tool.


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (251 messages🔥🔥):

- **Jetmoe lacks online access**: When asked if **Jetmoe** has online access, the response was clear, *“No, it doesn’t.”* Jetmoe is considered good for academic research despite this limitation.
  
- **OpenRouter tackles anti-fraud measures aggressively**: Discussion on anti-fraud updates revealed that **OpenRouter** has implemented measures to combat fraud due to losses from credit card skimming. Users can opt for crypto transactions to avoid providing personal information.

- **Embedding models support in consideration**: When asked about embedding models support, it was mentioned that **OpenRouter** is working on improving the backend and has embedding models in the queue, but there is no immediate roadmap yet.

- **Inconsistent prompt formatting issues**: Users discussed how models like **Claude** handle instructions differently than models focused on RP (role-playing) or generic tasks. The need for trial and error in crafting effective prompts for different models was highlighted.

- **OpenRouter adds GPT-4o**: Excitement surrounded the addition of **GPT-4o** to OpenRouter, with users noting its competitive pricing and high performance in benchmarks. OpenRouter will support text and image inputs for GPT-4o, although video and audio are not available.

Links mentioned:


Modular (Mojo 🔥) ▷ #general (65 messages🔥🔥):

Link mentioned: PEP 604 – Allow writing union types as X | Y | peps.python.org: no description found


Modular (Mojo 🔥) ▷ #💬︱twitter (1 messages):

ModularBot: From Modular: https://twitter.com/Modular/status/1790046377613144201


Modular (Mojo 🔥) ▷ #📺︱youtube (1 messages):


Modular (Mojo 🔥) ▷ #🔥mojo (85 messages🔥🔥):

Links mentioned:


Modular (Mojo 🔥) ▷ #performance-and-benchmarks (1 messages):

Link mentioned: GitHub - dorjeduck/mostring: variations over StringBuilder ideas in Mojo: variations over StringBuilder ideas in Mojo. Contribute to dorjeduck/mostring development by creating an account on GitHub.


Modular (Mojo 🔥) ▷ #nightly (64 messages🔥🔥):

Links mentioned:


CUDA MODE ▷ #general (5 messages):


CUDA MODE ▷ #triton (43 messages🔥):

Links mentioned:


CUDA MODE ▷ #cuda (9 messages🔥):

Links mentioned:


CUDA MODE ▷ #announcements (1 messages):

Link mentioned: Join our Cloud HD Video Meeting: Zoom is the leader in modern enterprise video communications, with an easy, reliable cloud platform for video and audio conferencing, chat, and webinars across mobile, desktop, and room systems. Zoom ...


CUDA MODE ▷ #algorithms (1 messages):

random_string_of_character: https://arxiv.org/abs/2405.05219


CUDA MODE ▷ #beginner (14 messages🔥):

Links mentioned:


CUDA MODE ▷ #pmpp-book (1 messages):

Link mentioned: Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


CUDA MODE ▷ #off-topic (5 messages):


CUDA MODE ▷ #irl-meetup (1 messages):

boxxy_ms: anyone in Toronto?


CUDA MODE ▷ #triton-puzzles (2 messages):


CUDA MODE ▷ #llmdotc (67 messages🔥🔥):

- **ZeRO-1 empowers VRAM battle**: ZeRO-1 integration was discussed, with benchmarks showing a 54% training throughput improvement by optimizing VRAM usage, allowing batch size increase from 4 to 10, maxing out the A100's 40GB VRAM capacity. Catch more details [here](https://github.com/karpathy/llm.c/pull/309).
- **Optimization insights on GPU workloads**: Members discussed the benefit of performing calculations outside of CUDA kernels to optimize integer divisions and memory-bound kernels. Perspectives were shared on using 2D/3D grids and thread coarsening for efficiency, backed by detailed [code discussions](https://github.com/karpathy/llm.c/blob/master/train_gpt2.cu#L689).
- **ThunderKittens catches interest**: The potential of HazyResearch's [ThunderKittens](https://github.com/HazyResearch/ThunderKittens) for H100 llm.c optimization sparked excitement. Members see it as a lower-level abstraction than CUTLASS for managing tensor core layouts.
- **Efforts to improve CI with GPU support**: Talks revolved around the lack of GPUs in llm.c’s CI and ways to bridge this gap, noting GitHub Actions' recent GPU runner beta. Suggestions included upgrading GitHub plans and references to current pricing [details](https://docs.github.com/en/billing/managing-billing-for-github-actions/about-billing-for-github-actions#per-minute-rates-for-larger-runners).

Links mentioned:


CUDA MODE ▷ #lecture-qa (48 messages🔥):

Links mentioned:


CUDA MODE ▷ #youtube-watch-party (5 messages):

Links mentioned:


Eleuther ▷ #general (61 messages🔥🔥):

Link mentioned: No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance: Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image gener...


Eleuther ▷ #research (79 messages🔥🔥):

Links mentioned:


Eleuther ▷ #scaling-laws (7 messages):

Links mentioned:


Eleuther ▷ #interpretability-general (3 messages):


Eleuther ▷ #gpt-neox-dev (1 messages):

oleksandr07173: Hello


Interconnects (Nathan Lambert) ▷ #news (120 messages🔥🔥):

- **First Look at VideoFX Generations**: A user shared a [link to VideoFX footage](https://fxtwitter.com/bedros_p/status/1789256595123179701?s=46), stating there are more examples but it's still a WIP. The shared footage demonstrates early capabilities of VideoFX generations.
  
- **GPT-4o Steals the Spotlight**: [Liam Fedus announced](https://x.com/liamfedus/status/1790064963966370209?s=46) GPT-4o as the new state-of-the-art model. Users discussed its superior performance in coding compared to older versions and speculated about its potential in MATH and other benchmarks.

- **OpenAI's New Tokenizer**: A member shared a [GitHub commit](https://github.com/openai/tiktoken/commit/9d01e5670ff50eb74cdb96406c7f3d9add0ae2f8) for the new OpenAI tokenizer. The update appears to improve processing speeds by utilizing a larger vocabulary.

- **OpenAI's Latest Demo Reaction**: Although a user found the demo impressive, they didn't see anything fundamentally new beyond UI improvements. Other discussions included speculation around GPT-4o's capabilities and its availability, with questions about OpenAI’s data strategies.

- **GPT-4o Dominates on LMSys Arena**: LMSys org [shared exciting news](https://x.com/lmsysorg/status/1790097588399779991?s=46) that GPT-4o has surpassed all models on the LMSys Arena with a significant Elo increase. The model's enhancements in reasoning and coding were particularly highlighted by users.

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (1 messages):

Link mentioned: PPO / Reinforce Trainers by vwxyzjn · Pull Request #1540 · huggingface/trl: This RP supports the REINFORCE RLOO trainers in https://arxiv.org/pdf/2402.14740.pdf. Note that REINFORCE's loss is a special case of PPO, as shown below it matches the REINFORCE loss presented i...


Interconnects (Nathan Lambert) ▷ #random (5 messages):


Interconnects (Nathan Lambert) ▷ #reads (11 messages🔥):


LAION ▷ #general (109 messages🔥🔥):

Links mentioned:


LAION ▷ #research (5 messages):

Link mentioned: Tweet from LAION (@laion_ai): Wanna train transformers with audio as if it was text? - Here is how. :) https://youtu.be/NwZufAJxmMA https://discord.gg/6jWrFngyPe


LangChain AI ▷ #general (105 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #share-your-work (4 messages):

Links mentioned:


LangChain AI ▷ #tutorials (3 messages):

Link mentioned: Build a RAG pipeline for your blog with LangChain, OpenAI and Pinecone: You can chat with my writing and ask me questions I've already answered even when I'm not around


LlamaIndex ▷ #blog (8 messages🔥):

Links mentioned:


LlamaIndex ▷ #general (87 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (3 messages):

Link mentioned: Knowledge Distillation for Fine-Tuning a GPT-3.5 Judge: Enhancing Accuracy and Performance : no description found


OpenAccess AI Collective (axolotl) ▷ #general (30 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #general-help (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (10 messages🔥):

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search: Understand code, faster.


OpenAccess AI Collective (axolotl) ▷ #axolotl-phorm-bot (9 messages🔥):

Link mentioned: OpenAccess-AI-Collective/axolotl | Phorm AI Code Search: Understand code, faster.


OpenInterpreter ▷ #general (41 messages🔥):

Links mentioned:


OpenInterpreter ▷ #O1 (21 messages🔥):


OpenInterpreter ▷ #ai-content (4 messages):

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (38 messages🔥):

Links mentioned:


Cohere ▷ #general (24 messages🔥):

Link mentioned: Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models: The disconnect between tokenizer creation and model training in language models has been known to allow for certain inputs, such as the infamous SolidGoldMagikarp token, to induce unwanted behaviour. ...


Cohere ▷ #project-sharing (2 messages):

Link mentioned: Zindi: no description found


Datasette - LLM (@SimonW) ▷ #ai (23 messages🔥):


Datasette - LLM (@SimonW) ▷ #llm (1 messages):

simonw: https://twitter.com/simonw/status/1790121870399782987


Mozilla AI ▷ #llamafile (15 messages🔥):

Links mentioned:


DiscoResearch ▷ #general (9 messages🔥):

Links mentioned:


DiscoResearch ▷ #discolm_german (2 messages):

- **Demo status inquiry**: A user asked, *"Is the demo down?"* but there was no response to this query.
- **Positive feedback**: Another user remarked, *"It's really nice,"* expressing satisfaction without further elaboration.

LLM Perf Enthusiasts AI ▷ #general (4 messages):


LLM Perf Enthusiasts AI ▷ #gpt4 (6 messages):

Watch the full update here

Link mentioned: Introducing GPT-4o: OpenAI Spring Update – streamed live on Monday, May 13, 2024. Introducing GPT-4o, updates to ChatGPT, and more.


Alignment Lab AI ▷ #general-chat (3 messages):

Link mentioned: AlphaFold3 [AF3] Federation Meet · Luma: Current Progress Update A talk by the lead developer on the current status of Alpha Fold 3 integration. Discussion of any issues encountered during the initial…


Alignment Lab AI ▷ #fasteval-dev (3 messages):


AI Stack Devs (Yoko Li) ▷ #app-showcase (1 messages):


AI Stack Devs (Yoko Li) ▷ #ai-town-dev (1 messages):


Skunkworks AI ▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=KQ-xGVFHDkw