Frozen AI News archive

Sora pushes SOTA

**Discord communities** analyzed over **20 guilds**, **312 channels**, and **10550 messages** reveal intense discussions on AI developments. Key highlights include the **Dungeon Master AI assistant** for Dungeons and Dragons using models like **H20 GPT**, GPU power supply debates involving **3090** and **3060 GPUs**, and excitement around **Google's Gemini 1.5** with its **1 million token context window** and **OpenAI's Sora** model. Challenges with **large world models (LWM)** multimodality, **GPT-assisted coding**, and **role-play model optimization** with **Yi models** and **Mixtral Instruct** were discussed. Technical issues like **model merging errors** with **MistralCasualML**, fine-tuning scripts like **AutoFineTune**, and cross-language engineering via **JSPyBridge** were also prominent. NVIDIA's **Chat with RTX** feature leveraging **retrieval-augmented generation (RAG)** on 30+ series GPUs was compared to LMStudio's support for **Mistral 7b** and **Llama 13b** models. The community is cautiously optimistic about these frontier models' applications in media and coding.

Canonical issue URL

If you're reading this you probably are aware of the absolute mayhem unleashed the day after Valentine's. We covered Gemini 1.5 and Sora on a live ThursdAI podcast so you can get our takes there, but also we have been tracking the must-see and must-know Sora takes on the Latent Space discord. Of course, we weren't alone.

image.png

This was a rough day to launch anything if you aren't a frontier model lab.


Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


LM Studio Discord Summary

Chat with RTX Generates RAG Excitement: NVIDIA's "Chat with RTX" feature, utilizing retrieval-augmented generation (RAG) on Nvidia 30+ series GPUs, has been contrasted with LMStudio, which supports RAG but is currently limited to Mistral 7b and Llama 13b models.

Geminis Giant Leap in Context: Conversations are abuzz with Google's Gemini 1.5 model boasting a 1 million token context window; access remains invite-only, underscoring the gap between proprietary and open-source AI tools.

Sora's Synthetic Cinema: OpenAI's Sora model, capable of generating videos from text up to a minute long, is on engineers' radars. With availability initially to a select group, its implications for evidence credibility are under scrutiny.

Model Support and LM Studio Features in Spotlight: Yi-VL models are pending an update to be compatible with LMStudio due to new llama.cpp requirements. Meanwhile, users discuss LMStudio features ranging from enabling function calling to overcoming model and software restrictions.

RAM Bug Uncovered in LMStudio: An acknowledged bug in LMStudio misreports system RAM, misleading users like @pdx_, who saw no change indicated in the software after a hardware upgrade to 64Gb.

Cost and Compatibility Guide the Hardware Debate: Discussions around hardware for LLM tasks involve detailed cost comparisons for high-end builds, potential GPU mixing for optimizing performance, and overclocking intricacies.

Quantum Compression and AVX Instructions: A new development in model compression, specifically 1.5 bit quantization, is expected to greatly improve efficiency, allowing large models to operate on reduced hardware. In the meantime, users are advised to utilize an AVX beta release for CPUs lacking AVX2 support.

Humorous Take on AI Work Ethic and Errors: @wolfspyre brought levity to the conversation with a comical inquiry if bots need to work and a playful depiction of bots stuck in a repetitive output loop.


OpenAI Discord Summary


Nous Research AI Discord Summary


Eleuther Discord Summary


Mistral Discord Summary


LAION Discord Summary


HuggingFace Discord Summary


Perplexity AI Discord Summary


LlamaIndex Discord Summary


LangChain AI Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


CUDA MODE Discord Summary


LLM Perf Enthusiasts AI Discord Summary


Alignment Lab AI Discord Summary


Skunkworks AI Discord Summary


AI Engineer Foundation Discord Summary

Weekly Sync-Up Time: The weekly meeting was initiated with an announcement tagging @._z.

Hackathon Hosting Huddle: An invitation was extended to co-host an AI developers hackathon by @caramelchameleon, considering the proximity to the Game Developers Conference and inviting both online and onsite participation.

Hackathon Experience on the Table: @yikesawjeez indicated interest in the hackathon opportunity, drawing from their background in organizing such events in the Bay Area.

Investor Matchmaking Event: @atalovesyou publicized a chance for startup founders to engage with over 30 venture capital firms at an investor matchmaking session; additional slots are available at Founders x VC Event for interested individuals.


Datasette - LLM (@SimonW) Discord Summary


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1263 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (301 messages🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (58 messages🔥🔥):

Links mentioned:

Tweet from Yohei (@yoheinakajima): Just made a lil' script to easily fine-tune a small model with synthetically generated data... ...calling it "AutoFineTune"! (~110 lines of code) Generates 100+ synthetic message pairs w...


TheBloke ▷ #model-merging (3 messages):


TheBloke ▷ #coding (6 messages):

Links mentioned:

GitHub - extremeheat/JSPyBridge: 🌉. Bridge to interoperate Node.js and Python: 🌉. Bridge to interoperate Node.js and Python . Contribute to extremeheat/JSPyBridge development by creating an account on GitHub.


LM Studio ▷ #💬-general (410 messages🔥🔥🔥):

Links mentioned:

](https://www.squibler.io/pricing): no description found


LM Studio ▷ #🤖-models-discussion-chat (125 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (3 messages):


LM Studio ▷ #🎛-hardware-discussion (200 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (19 messages🔥):

Links mentioned:


LM Studio ▷ #langchain (1 messages):

.ben.com: markdown has linebreaks end your line with two spaces the carriage returns


LM Studio ▷ #avx-beta (7 messages):

Links mentioned:

LM Studio Beta Releases: no description found


LM Studio ▷ #crew-ai (1 messages):


OpenAI ▷ #annnouncements (2 messages):

Links mentioned:


OpenAI ▷ #ai-discussions (338 messages🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (131 messages🔥🔥):

Links mentioned:


OpenAI ▷ #prompt-engineering (86 messages🔥🔥):

Links mentioned:


OpenAI ▷ #api-discussions (86 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ctx-length-research (5 messages):

Links mentioned:

World Model on Million-Length Video And Language With RingAttention: Current language models fall short in understanding aspects of the world not easily described in words, and struggle with complex, long-form tasks. Video sequences offer valuable temporal information ...


Nous Research AI ▷ #off-topic (5 messages):

Links mentioned:

Rock Cat Eyebrow Cat GIF - Rock cat Eyebrow cat Meme - Discover & Share GIFs: Click to view the GIF


Nous Research AI ▷ #interesting-links (37 messages🔥):

Links mentioned:


Nous Research AI ▷ #general (536 messages🔥🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (53 messages🔥):

Links mentioned:


Nous Research AI ▷ #collective-cognition (3 messages):


Eleuther ▷ #announcements (1 messages):

Links mentioned:


Eleuther ▷ #general (228 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (218 messages🔥🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (8 messages🔥):


Eleuther ▷ #lm-thunderdome (17 messages🔥):

Links mentioned:


Eleuther ▷ #gpt-neox-dev (4 messages):


Mistral ▷ #general (282 messages🔥🔥):

Links mentioned:


Mistral ▷ #models (38 messages🔥):

Links mentioned:


Mistral ▷ #deployment (7 messages):

Links mentioned:


Mistral ▷ #finetuning (10 messages🔥):

Links mentioned:


Mistral ▷ #showcase (5 messages):

Links mentioned:


Mistral ▷ #random (19 messages🔥):

Links mentioned:


Mistral ▷ #la-plateforme (113 messages🔥🔥):

Links mentioned:


LAION ▷ #general (427 messages🔥🔥🔥):

Links mentioned:


LAION ▷ #research (41 messages🔥):

Links mentioned:


HuggingFace ▷ #announcements (2 messages):

Links mentioned:


HuggingFace ▷ #general (227 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (11 messages🔥):

<ul>
  <li><strong>Merging Sheets with Caution</strong>: `@lunarflu` discussed the challenges of merging two Google Sheets, emphasizing the need to avoid duplicate records and maintain unique keys. They highlighted the importance of creating distinct records to prevent data issues.</li>
  <li><strong>A Melody of Learning</strong>: `@neuralink` expressed their progress in learning about DoReMi reproduction and training with FP8 3D parallelism, achieving a remarkable 99% and 32% respectively.</li>
  <li><strong>End-to-End Learning Spree</strong>: `@sardarkhan_` engaged in a deep dive into diffusors and transformers before switching gears back to rigorous coursework preparation.</li>
  <li><strong>Face Swapping Exploration</strong>: `@virtual_josh` shared their experience exploring different programs for deep faking videos and asked for recommendations on services for swapping faces in videos.</li>
  <li><strong>Custom Labels in NER</strong>: `@jakemorrison` inquired about the flexibility of `ner_tags` labels in token classification, sparking a discussion where `@cubietom` pointed to custom label usage with references to the CoNLL2003 and Few-NERD datasets.</li>
</ul>

Links mentioned:


HuggingFace ▷ #cool-finds (12 messages🔥):

Links mentioned:

New paper by DeepMind: Buffer Overflow in…"](https://huggingface.co/posts/osanseviero/980907000007376): no description found


HuggingFace ▷ #i-made-this (25 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (40 messages🔥):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (17 messages🔥):

Links mentioned:


HuggingFace ▷ #computer-vision (8 messages🔥):

Links mentioned:


HuggingFace ▷ #NLP (6 messages):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (17 messages🔥):

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):


Perplexity AI ▷ #general (256 messages🔥🔥):

Links mentioned:


Perplexity AI ▷ #sharing (22 messages🔥):

Links mentioned:


Perplexity AI ▷ #pplx-api (60 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #announcements (1 messages):

Links mentioned:

LlamaIndex Webinar: Build No-Code RAG · Zoom · Luma: Flowise is one of the leading no-code tools for building LLM-powered workflows. Instead of learning how to code in a framework / programming language, users can drag and drop the components...


LlamaIndex ▷ #blog (7 messages):


LlamaIndex ▷ #general (285 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #announcements (10 messages🔥):

Links mentioned:


LangChain AI ▷ #general (64 messages🔥🔥):

Links mentioned:

Pinecone | 🦜️🔗 Langchain: You can use Pinecone vectorstores with LangChain.


LangChain AI ▷ #langserve (64 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #share-your-work (4 messages):

Links mentioned:


LangChain AI ▷ #tutorials (1 messages):

Links mentioned:

Multi Document RAG using LangChain codes explained: This tutorial explains how to use multiple diverse files with a single RAG agent for querying your data. This tutorial is a part of my newly launched book "L...


OpenAccess AI Collective (axolotl) ▷ #general (69 messages🔥🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (20 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (11 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #runpod-help (15 messages🔥):


CUDA MODE ▷ #general (9 messages🔥):


CUDA MODE ▷ #cuda (12 messages🔥):

Links mentioned:


CUDA MODE ▷ #algorithms (9 messages🔥):


CUDA MODE ▷ #beginner (9 messages🔥):

Links mentioned:

The Book of Shaders: Gentle step-by-step guide through the abstract and complex universe of Fragment Shaders.


CUDA MODE ▷ #pmpp-book (7 messages):


CUDA MODE ▷ #youtube-recordings (4 messages):

Links mentioned:

Going Further with CUDA for Python Programmers: This technical talk by Jeremy Howard explores advanced programming techniques for maximizing performance when using CUDA with Python. The focus is on optimiz...


CUDA MODE ▷ #jax (2 messages):


LLM Perf Enthusiasts AI ▷ #general (10 messages🔥):

Links mentioned:

Tweet from Jeff Dean (@🏡) (@JeffDean): Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pr...


LLM Perf Enthusiasts AI ▷ #gpt4 (1 messages):

robotums: yeah


LLM Perf Enthusiasts AI ▷ #offtopic (15 messages🔥):

Links mentioned:

Tweet from Vik Paruchuri (@VikParuchuri): Announcing surya OCR - text recognition in 93 languages. It outperforms tesseract in almost all languages, often by large margins. Find it here - https://github.com/VikParuchuri/surya .


LLM Perf Enthusiasts AI ▷ #irl (1 messages):

Links mentioned:

AI Wednesdays · Luma: Let's hang out and build! 🛠️ 🔥 📍 Location: Near Funan Mall (Exact location will be provided to registered attendees) ⏰ Doors open at 5.30pm, and feel free to join any time. 🍕 Pizza, 📶...


LLM Perf Enthusiasts AI ▷ #openai (8 messages🔥):

Links mentioned:


Alignment Lab AI ▷ #ai-and-ml-discussion (4 messages):


Alignment Lab AI ▷ #general-chat (1 messages):


Alignment Lab AI ▷ #oo (2 messages):


Alignment Lab AI ▷ #qa (1 messages):

daydream.nation: o sh


Skunkworks AI ▷ #general (1 messages):


Skunkworks AI ▷ #datasets (1 messages):


Skunkworks AI ▷ #finetuning (1 messages):


Skunkworks AI ▷ #papers (4 messages):


AI Engineer Foundation ▷ #events (7 messages):

Links mentioned:

Founder x Investor Matchmaking · Luma: LIMITED SPOTS REMAINING. We have received interest from over 600+ Pre-Seed, Seed, Series A+ Founders. We are at capacity but opened a few more slots for founders on a ticket...


Datasette - LLM (@SimonW) ▷ #ai (2 messages):

Links mentioned: