Frozen AI News archive

Karpathy emerges from stealth?

**Andrej Karpathy** released a comprehensive 2-hour tutorial on **tokenization**, detailing techniques up to **GPT-4**'s tokenizer and noting the complexity of **Llama 2** tokenization with SentencePiece. Discussions in AI Discord communities covered **model optimization and efficiency**, focusing on **quantization** of models like **Mistral 7B** and **Zephyr-7B** to reduce memory usage for consumer GPUs, including Intel's new weight-only quantization algorithm. Efforts to improve computational efficiency included selective augmentation reducing costs by 57.76% and memory token usage versus kNN for Transformers. Challenges in hardware compatibility and software issues were shared, alongside fine-tuning techniques such as LoRA and model merging. Innovative applications of LLMs in retrieval-augmented generation (RAG), multi-model learning, and meta-reasoning were explored. The community emphasized dataset sharing, open-source releases like SDXL VAE encoded datasets and Audiogen AI codecs, and ethical AI use with censorship and guardrails. Collaboration and resource sharing remain strong in these AI communities.

Canonical issue URL

As mentioned in yesterday's recap, Andrej shipped his Tokenization tutorial with accompanying github repo (tweet):

https://www.youtube.com/watch?v=zduSFxRajkE

It is sobering how this 2hr tutorial is necessary to fully understand tokenization up to the RegEx patterns used in GPT4's tokenizer, but, as Andrej notes, even then it is far from complete to get up to Llama 2 tokenization with SentencePiece, and yet tokenization was at the core of many LLM failure modes at least from GPT2-GPT4.

--

Table of Contents

[TOC]

PART 0: SuperSummary


PART 1: High level Discord summaries

TheBloke Discord Summary


Eleuther Discord Summary


OpenAI Discord Summary


LM Studio Discord Summary


Mistral Discord Summary


LlamaIndex Discord Summary


HuggingFace Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


LAION Discord Summary


Latent Space Discord Summary


CUDA MODE Discord Summary


Perplexity AI Discord Summary


LangChain AI Discord Summary


DiscoResearch Discord Summary


Alignment Lab AI Discord Summary


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1195 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (90 messages🔥🔥):

Links mentioned:

MinervaAI/Aesir-Preview · Datasets at Hugging Face: no description found


TheBloke ▷ #training-and-fine-tuning (13 messages🔥):

Links mentioned:

The LEAKED GPT-4 system prompt is Insane!: 🚨BUY or GIFT Beginners course of Generative AI (with 34% Discount) - https://bit.ly/3HQXsQd (Coupon: LETSGO) 🎉🔗 Links 🔗ChatGPT History - https://chat.op...


TheBloke ▷ #model-merging (1 messages):


TheBloke ▷ #coding (1 messages):


Eleuther ▷ #announcements (1 messages):

Links mentioned:

Tweet from Kyle O'Brien (@KyleDevinOBrien)): How can we make classifiers more robust when we can't modify the weights or assume its architecture — effectively making it a black box? In our preprint, we demonstrate that we can improve robust...


Eleuther ▷ #general (160 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (173 messages🔥🔥):

Links mentioned:


Eleuther ▷ #lm-thunderdome (1 messages):

Links mentioned:

gpqa/prompts/chain_of_thought.txt at main · idavidrein/gpqa: Baselines and analysis for the Google-proof Q&A (GPQA) dataset - idavidrein/gpqa


Eleuther ▷ #multimodal-general (5 messages):

Links mentioned:

GitHub - AudiogenAI/agc: Audiogen Codec: Audiogen Codec. Contribute to AudiogenAI/agc development by creating an account on GitHub.


Eleuther ▷ #gpt-neox-dev (2 messages):


OpenAI ▷ #ai-discussions (103 messages🔥🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


OpenAI ▷ #gpt-4-discussions (115 messages🔥🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


OpenAI ▷ #prompt-engineering (44 messages🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


OpenAI ▷ #api-discussions (44 messages🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


LM Studio ▷ #💬-general (141 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (23 messages🔥):

Links mentioned:


LM Studio ▷ #🎛-hardware-discussion (63 messages🔥🔥):

Links mentioned:


LM Studio ▷ #crew-ai (3 messages):


Mistral ▷ #general (104 messages🔥🔥):

Links mentioned:

Chat with Open Large Language Models: no description found


Mistral ▷ #models (22 messages🔥):


Mistral ▷ #deployment (4 messages):


Mistral ▷ #finetuning (22 messages🔥):

Links mentioned:

base_model: mistralai/Mistral-7B-v0.1model_type: MistralForCausalLMtokenizer - Pastebin.com: Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.


Mistral ▷ #showcase (2 messages):

Links mentioned:

The AI Info Diet ™️: no description found


LlamaIndex ▷ #announcements (1 messages):

Links mentioned:

LlamaIndex Webinar: RAG Beyond Basic Chatbots · Zoom · Luma: RAG is one of the main use cases for LLMs, but many developers are using RAG to build basic Q&A chatbots over simple, static datasets. What are use cases for RAG beyond basic chatbots? We're....


LlamaIndex ▷ #blog (3 messages):


LlamaIndex ▷ #general (118 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (3 messages):


HuggingFace ▷ #general (63 messages🔥🔥):

Links mentioned:

Tweet from Borriss (@Borriss): The Sora videos posted by the OpenAI team are getting wilder.. (Part 2) 7 new ones:


HuggingFace ▷ #cool-finds (4 messages):

Links mentioned:


HuggingFace ▷ #i-made-this (8 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (6 messages):

Links mentioned:

Mamba: The Hard Way: no description found


HuggingFace ▷ #diffusion-discussions (3 messages):

Links mentioned:

Change Clothes on Photo Using AI - Pincel: Change clothes on a photo effortlessly with Pincel AI, the best online app for fast and easy outfit changes using instant AI magic.


HuggingFace ▷ #NLP (4 messages):


HuggingFace ▷ #diffusion-discussions (3 messages):

Links mentioned:

Change Clothes on Photo Using AI - Pincel: Change clothes on a photo effortlessly with Pincel AI, the best online app for fast and easy outfit changes using instant AI magic.


OpenAccess AI Collective (axolotl) ▷ #general (71 messages🔥🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (5 messages):

Links mentioned:

axolotl/src/axolotl/train.py at main · OpenAccess-AI-Collective/axolotl: Go ahead and axolotl questions. Contribute to OpenAccess-AI-Collective/axolotl development by creating an account on GitHub.


OpenAccess AI Collective (axolotl) ▷ #runpod-help (6 messages):

Links mentioned:

RunPod template not working with network volumes, /workspace/axolotl empty · Issue #813 · OpenAccess-AI-Collective/axolotl: Please check that this issue hasn't been reported before. I searched previous Bug Reports didn't find any similar reports. Expected Behavior Other users also encountered this: #467 According t...


LAION ▷ #general (67 messages🔥🔥):

Links mentioned:


LAION ▷ #research (5 messages):

Links mentioned:

Reddit - Dive into anything: no description found


Latent Space ▷ #ai-general-chat (70 messages🔥🔥):

Links mentioned:


CUDA MODE ▷ #general (6 messages):

Links mentioned:

Mamba: The Hard Way: no description found


CUDA MODE ▷ #triton (3 messages):

Links mentioned:

Mamba: The Hard Way: no description found


CUDA MODE ▷ #torch (1 messages):

Links mentioned:

Accelerating Generative AI with PyTorch II: GPT, Fast: This post is the second part of a multi-series blog focused on how to accelerate generative AI models with pure, native PyTorch. We are excited to share a breadth of newly released PyTorch performance...


CUDA MODE ▷ #algorithms (6 messages):

Links mentioned:

Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation: Griewank, Andreas, Walther, Andrea: 9780898716597: Amazon.com: Books: no description found


CUDA MODE ▷ #beginner (10 messages🔥):

Links mentioned:


CUDA MODE ▷ #pmpp-book (4 messages):


CUDA MODE ▷ #jax (10 messages🔥):


CUDA MODE ▷ #ring-attention (28 messages🔥):

Links mentioned:


Perplexity AI ▷ #general (34 messages🔥):

Links mentioned:


Perplexity AI ▷ #sharing (2 messages):


Perplexity AI ▷ #pplx-api (2 messages):


LangChain AI ▷ #general (18 messages🔥):

Links mentioned:


LangChain AI ▷ #share-your-work (1 messages):

Links mentioned:

Langchain: This playlist includes all tutorials around LangChain, a framework for building generative AI applications using LLMs


LangChain AI ▷ #tutorials (2 messages):

Links mentioned:


DiscoResearch ▷ #general (5 messages):

Links mentioned:

GitHub - hiyouga/LLaMA-Factory: Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM): Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM) - hiyouga/LLaMA-Factory


DiscoResearch ▷ #benchmark_dev (3 messages):


DiscoResearch ▷ #discolm_german (4 messages):

Links mentioned:

DiscoLM German 7b Demo: no description found


Alignment Lab AI ▷ #general-chat (1 messages):

Links mentioned:

The AI Info Diet ™️: no description found


Skunkworks AI ▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=DFT0tMBwh04


LLM Perf Enthusiasts AI ▷ #general (1 messages):

jeffreyw128: how do you access it? i can't for the life of me figure it out in the console lol