Frozen AI News archive

12/9/2023: The Mixtral Rush

**Mixtral's weights** were released without code, prompting the **Disco Research community** and **Fireworks AI** to implement it rapidly. Despite efforts, no significant benchmark improvements were reported, limiting its usefulness for local LLM usage but marking progress for the **small models community**. Discussions in the DiscoResearch Discord covered **Mixtral's performance** compared to models like **Hermes 2.5** and **Hermes 2**, with evaluations on benchmarks such as **winogrande**, **truthfulqa_mc2**, and **arc_challenge**. Technical topics included GPU requirements, multi-GPU setups, and quantization via **GPTQ**. Benchmarking strategies like grammar-based evaluation, chain of thought (CoT), and min_p sampling were explored, alongside model sampling techniques like Min P and Top P to enhance response stability and creativity. Users also discussed GPTs' learning limitations and the adaptability of models under varying conditions, emphasizing min_p sampling's role in enabling higher temperature settings for creativity.

Canonical issue URL

image.png

We also saw similar efforts from Fireworks AI:

image.png

Unfortunately nobody has reported significant benchmark improvements and it is not likely to be useful for local LLM usage. Still, great progress for the smol models community.

[TOC]

DiscoResearch Discord Summary

DiscoResearch Channel Summaries

▷ #disco_judge (1 messages):

cryptossssun: is there any plan of dev the Mixtral Model?

▷ #mixtral_implementation (651 messages🔥🔥🔥):

▷ #general (222 messages🔥🔥):

▷ #benchmark_dev (111 messages🔥🔥):


Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #off-topic (7 messages):

▷ #benchmarks-log (7 messages):

▷ #interesting-links (34 messages🔥):

▷ #general (667 messages🔥🔥🔥):

▷ #ask-about-llms (40 messages🔥):


OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (54 messages🔥):

▷ #openai-chatter (106 messages🔥🔥):

▷ #openai-questions (55 messages🔥🔥):

▷ #gpt-4-discussions (32 messages🔥):

▷ #prompt-engineering (40 messages🔥):

▷ #api-discussions (40 messages🔥):


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (47 messages🔥):

▷ #axolotl-dev (135 messages🔥🔥):

▷ #other-llms (1 messages):

▷ #general-help (11 messages🔥):

▷ #datasets (10 messages🔥):

▷ #runpod-help (1 messages):


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #general (57 messages🔥🔥):

▷ #today-im-learning (7 messages):

▷ #cool-finds (1 messages):

fblgit: Introducing.. Xaberius 34B, the #1 LLM 🙂 And its just a beta... weakest checkpoint 🙂

▷ #i-made-this (7 messages):

▷ #diffusion-discussions (3 messages):

▷ #computer-vision (3 messages):

▷ #NLP (4 messages):

▷ #diffusion-discussions (3 messages):


Alignment Lab AI Discord Summary

Alignment Lab AI Channel Summaries

▷ #general-chat (4 messages):

▷ #oo (18 messages🔥):

▷ #oo2 (5 messages):


LangChain AI Discord Summary

LangChain AI Channel Summaries

▷ #announcements (1 messages):

▷ #general (9 messages🔥):

▷ #langserve (1 messages):

▷ #langchain-templates (1 messages):

▷ #share-your-work (1 messages):

▷ #tutorials (1 messages):


LLM Perf Enthusiasts AI Discord Summary

LLM Perf Enthusiasts AI Channel Summaries

▷ #opensource (4 messages):

▷ #offtopic (2 messages):

▷ #speed (5 messages):

Azure vs GPT-4 Performance:

Optimum-NVIDIA on Hugging Face For Fast LLM Inference:

▷ #rag (3 messages):


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (6 messages):

▷ #llm-paper-club (1 messages):

swyxio: it’s literally “[INST]”, part of


Ontocord (MDEL discord) Discord Summary

Only 1 channel had activity, so no need to summarize...

xa9ax: Who all are heading to NeurIPS?


The Skunkworks AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI Engineer Foundation Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Perplexity AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.