Frozen AI News archive

The Core Skills of AI Engineering

**AI Discords for 2/2/2024** analyzed **21 guilds**, **312 channels**, and **4782 messages** saving an estimated **382 minutes** of reading time. Discussions included **Eugene Yan** initiating a deep dive into **AI engineering** challenges, highlighting overlaps between software engineering and data science skills. The **TheBloke Discord** featured talks on **MiquMaid**, **OLMo** (an open-source 65B LLM by **AI2** under Apache 2.0), **Aphrodite** model batching, **AWQ** quantization, and **LoRA** fine-tuning techniques like **QLoRA** and **LoftQ**. The **LAION Discord** discussed **SSD-1B** distillation issues, data quality optimization with captioning datasets like **BLIP**, **COCO**, and **LLaVA**, and tokenization strategies for prompt adherence in image generation. Other topics included AI security with watermarking, superconductors and carbon nanotubes for hardware, and deployment of LLMs via **Hugging Face** tools.

Canonical issue URL


We really tried to avoid featuring Latent Space twice in a row, but Eugene Yan kicked off a discussion on AI Engineering:

image.png

Which resulted in the longest ever thread on the topic:

image.png

The central confusion is the high degree of overlap between what are traditionally software engineer skills and data scientist skills, but also what software engineers struggle with when dealing with probabilistic, data-driven systems. Do they need to be reading papers? Do they need to write CUDA kernels?

Some mental models were created:

image.png

as well as a progression path for skill development:

image.png

Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


LAION Discord Summary


Latent Space Discord Summary


Eleuther Discord Summary


Nous Research AI Discord Summary


OpenAI Discord Summary


LM Studio Discord Summary


HuggingFace Discord Summary


Mistral Discord Summary


Perplexity AI Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


LlamaIndex Discord Summary


CUDA MODE (Mark Saroufim) Discord Summary


LangChain AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


Alignment Lab AI Discord Summary


Datasette - LLM (@SimonW) Discord Summary


DiscoResearch Discord Summary


Skunkworks AI Discord Summary


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1441 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (227 messages🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (8 messages🔥):

Links mentioned:

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models: Quantization is an indispensable technique for serving Large Language Models (LLMs) and has recently found its way into LoRA fine-tuning. In this work we focus on the scenario where quantization and L...


TheBloke ▷ #model-merging (1 messages):

kquant: internLM is a solid recommendation.


TheBloke ▷ #coding (2 messages):


LAION ▷ #general (380 messages🔥🔥):

Links mentioned:


LAION ▷ #research (24 messages🔥):

Links mentioned:


Latent Space ▷ #ai-general-chat (158 messages🔥🔥):

Links mentioned:


Latent Space ▷ #ai-announcements (2 messages):

Links mentioned:


Latent Space ▷ #llm-paper-club-east (63 messages🔥🔥):

Links mentioned:


Latent Space ▷ #ai-in-action-club (134 messages🔥🔥):

Links mentioned:


Eleuther ▷ #general (161 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (16 messages🔥):

Links mentioned:

Efficient Exploration for LLMs: We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fi...


Eleuther ▷ #lm-thunderdome (22 messages🔥):

Links mentioned:


Nous Research AI ▷ #off-topic (11 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (17 messages🔥):

Links mentioned:

Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.: A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team


Nous Research AI ▷ #general (114 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (16 messages🔥):

Links mentioned:

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO · Hugging Face: no description found


OpenAI ▷ #ai-discussions (25 messages🔥):

Note: Other participant messages were casual greetings or undetailed mentions and do not contribute substantive discussion points to summarize.


OpenAI ▷ #gpt-4-discussions (117 messages🔥🔥):

Links mentioned:


OpenAI ▷ #prompt-engineering (6 messages):


OpenAI ▷ #api-discussions (6 messages):


LM Studio ▷ #💬-general (58 messages🔥🔥):

Links mentioned:

TheBloke/Mixtral_34Bx2_MoE_60B-GGUF · Hugging Face: no description found


LM Studio ▷ #🤖-models-discussion-chat (19 messages🔥):

Links mentioned:


LM Studio ▷ #🎛-hardware-discussion (32 messages🔥):


LM Studio ▷ #🧪-beta-releases-chat (8 messages🔥):


LM Studio ▷ #autogen (6 messages):


LM Studio ▷ #langchain (2 messages):


HuggingFace ▷ #general (45 messages🔥):

Links mentioned:


HuggingFace ▷ #cool-finds (6 messages):

Links mentioned:


HuggingFace ▷ #i-made-this (6 messages):

Links mentioned:


HuggingFace ▷ #reading-group (51 messages🔥):

Links mentioned:


HuggingFace ▷ #core-announcements (10 messages🔥):

Links mentioned:

Release v0.26.0: New video pipelines, single-file checkpoint revamp, multi IP-Adapter inference with multiple images · huggingface/diffusers: This new release comes with two new video pipelines, a more unified and consistent experience for single-file checkpoint loading, support for multiple IP-Adapters’ inference with multiple reference...


HuggingFace ▷ #computer-vision (2 messages):


HuggingFace ▷ #NLP (4 messages):

Links mentioned:

Convert tiktoken tokenizers to the Hugging Face tokenizers format: Convert tiktoken tokenizers to the Hugging Face tokenizers format - tiktoken-to-hf.ipynb


Mistral ▷ #general (28 messages🔥):

Links mentioned:


Mistral ▷ #models (7 messages):


Mistral ▷ #deployment (5 messages):

Links mentioned:

👾 LM Studio - Discover and run local LLMs: Find, download, and experiment with local LLMs


Mistral ▷ #showcase (15 messages🔥):

Links mentioned:


Perplexity AI ▷ #general (43 messages🔥):

Links mentioned:


Perplexity AI ▷ #sharing (6 messages):

Links mentioned:

I use Perplexity MORE than Google and ChatGPT: Main Takaways From this Video: "I use Perplexity more than ChatGPT, BARD, and Microsoft Copilots for five main reasons, including its use in content creation...


Perplexity AI ▷ #pplx-api (3 messages):


OpenAccess AI Collective (axolotl) ▷ #general (46 messages🔥):

Links mentioned:

Tweet from Damien C. Tanner (@dctanner): The AI SuperServer is live!


OpenAccess AI Collective (axolotl) ▷ #general-help (3 messages):


OpenAccess AI Collective (axolotl) ▷ #datasets (1 messages):

Links mentioned:

PJMixers/Math-Multiturn-100K-ShareGPT · Datasets at Hugging Face: no description found


LlamaIndex ▷ #blog (1 messages):


LlamaIndex ▷ #general (35 messages🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (6 messages):

Links mentioned:


CUDA MODE (Mark Saroufim) ▷ #cuda (32 messages🔥):


CUDA MODE (Mark Saroufim) ▷ #beginner (9 messages🔥):

Links mentioned:

How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog: In this post, I’ll iteratively optimize an implementation of matrix multiplication written in CUDA.My goal is not to build a cuBLAS replacement, but to deepl...


LangChain AI ▷ #general (20 messages🔥):


LangChain AI ▷ #langserve (1 messages):

rebelsandrobots_97106: Thanks!


LangChain AI ▷ #share-your-work (5 messages):

Links mentioned:


LangChain AI ▷ #tutorials (3 messages):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #embeddings (1 messages):

natureplayer: https://huggingface.co/spaces/mteb/leaderboard


LLM Perf Enthusiasts AI ▷ #feedback-meta (1 messages):


LLM Perf Enthusiasts AI ▷ #openai (8 messages🔥):


LLM Perf Enthusiasts AI ▷ #prompting (7 messages):


Alignment Lab AI ▷ #general-chat (6 messages):


Alignment Lab AI ▷ #oo (1 messages):

cryptossssun: 🤔


Alignment Lab AI ▷ #looking-for-work (1 messages):


Datasette - LLM (@SimonW) ▷ #ai (4 messages):

Links mentioned:

Infinite Craft: A game about crafting


DiscoResearch ▷ #general (4 messages):

Links mentioned:


Skunkworks AI ▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=SEavari8xaU


Skunkworks AI ▷ #bakklava-1 (1 messages):

.mrfoo: LLaVA 1.6 dropped : https://llava-vl.github.io/blog/2024-01-30-llava-1-6/