Frozen AI News archive

1/8/2024: The Four Wars of the AI Stack

The **Nous Research AI Discord** discussions highlighted several key topics including the use of **DINO**, **CLIP**, and **CNNs** in the **Obsidian Project**. A research paper on distributed models like **DistAttention** and **DistKV-LLM** was shared to address cloud-based **LLM** service challenges. Another paper titled 'Self-Extend LLM Context Window Without Tuning' argued that existing **LLMs** can handle long contexts inherently. The community also discussed AI models like **Mixtral**, favored for its **32k context window**, and compared it with **Mistral** and **Marcoroni**. Other topics included hierarchical embeddings, agentic retrieval-augmented generation (**RAG**), synthetic data for fine-tuning, and the application of **LLMs** in the oil & gas industry. The launch of the **AgentSearch-V1** dataset with one billion embedding vectors was also announced. The discussions covered **mixture-of-experts (MoE)** implementations and the performance of smaller models.

Canonical issue URL

https://www.latent.space/p/dec-2023

Enjoy!


Table of Contents

[TOC]

Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #ctx-length-research (2 messages):

Links mentioned:

▷ #off-topic (5 messages):

Links mentioned:

▷ #interesting-links (5 messages):

Links mentioned:

▷ #general (79 messages🔥🔥):

Links mentioned:

Tweet from Owen Colegrove (@ocolegro): The full dataset for AgentSearch-V1 is now available on HF!! Recommended: @qdrant_engine - for indexing and search @nomic_ai - for visualization I'm looking to expand what is indexed - agent spe...

▷ #ask-about-llms (54 messages🔥):

Links mentioned:

GitHub - cg123/mergekit: Tools for merging pretrained large language models.: Tools for merging pretrained large language models. - GitHub - cg123/mergekit: Tools for merging pretrained large language models.

▷ #project-obsidian (1 messages):

qnguyen3: DINO, CLIP and CNNs for now


Eleuther Discord Summary

Eleuther Channel Summaries

▷ #general (29 messages🔥):

▷ #research (33 messages🔥):

Links mentioned:

▷ #scaling-laws (5 messages):

▷ #lm-thunderdome (16 messages🔥):


OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (30 messages🔥):

▷ #gpt-4-discussions (26 messages🔥):

▷ #prompt-engineering (1 messages):

▷ #api-discussions (1 messages):


Perplexity AI Discord Summary

Perplexity AI Channel Summaries

▷ #general (23 messages🔥):

Links mentioned:

▷ #sharing (4 messages):

▷ #pplx-api (12 messages🔥):

Links mentioned:


Mistral Discord Summary

Mistral Channel Summaries

▷ #general (9 messages🔥):

▷ #models (2 messages):

▷ #deployment (1 messages):

joselolol.: Hello good sir, consider using the MLX framework!

▷ #ref-implem (1 messages):

akshay_1: https://docs.mistral.ai/platform/guardrailing/

▷ #finetuning (2 messages):

▷ #showcase (4 messages):

Links mentioned:

▷ #random (5 messages):

Links mentioned:

The Origin of Consciousness in the Breakdown of the Bicameral Mind - Wikipedia

▷ #la-plateforme (3 messages):


DiscoResearch Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:


Latent Space Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:


LAION Discord Summary

LAION Channel Summaries

▷ #general (8 messages🔥):

▷ #research (2 messages):