Frozen AI News archive

12/11/2023: Mixtral beats GPT3.5 and Llama2-70B

**Mistral AI** announced the **Mixtral 8x7B** model featuring a Sparse Mixture of Experts (SMoE) architecture, sparking discussions on its potential to rival **GPT-4**. The community debated GPU hardware options for training and fine-tuning transformer models, including **RTX 4070s**, **A4500**, **RTX 3090s with nvlink**, and **A100 GPUs**. Interest was expressed in fine-tuning Mixtral and generating quantized versions, alongside curating high-quality coding datasets. Resources shared include a YouTube video on open-source model deployment, an Arxiv paper, GitHub repositories, and a blog post on Mixture-of-Experts. Discussions also touched on potential open-source releases of **GPT-3.5 Turbo** and **llama-3**, and running **OpenHermes 2.5** on Mac M3 Pro with VRAM considerations.

Canonical issue URL

image.png

And people are rightfully cheering. They also announced their API platform today.

[TOC]

Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #off-topic (13 messages🔥):

▷ #interesting-links (13 messages🔥):

▷ #general (545 messages🔥🔥🔥):

▷ #ask-about-llms (50 messages🔥):


OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (69 messages🔥🔥):

▷ #openai-chatter (131 messages🔥🔥):

▷ #openai-questions (97 messages🔥🔥):

▷ #gpt-4-discussions (69 messages🔥🔥):

▷ #prompt-engineering (93 messages🔥🔥):

▷ #api-discussions (93 messages🔥🔥):


DiscoResearch Discord Summary

DiscoResearch Channel Summaries

▷ #disco_judge (1 messages):

nagaraj_arvind: They are the same

▷ #mixtral_implementation (322 messages🔥🔥):

▷ #general (15 messages🔥):

▷ #benchmark_dev (2 messages):


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #general (70 messages🔥🔥):

▷ #today-im-learning (1 messages):

neuralink: the last three days i learned: implemented 0.01% of DiLoCo decentralized pre-training

▷ #cool-finds (1 messages):

▷ #i-made-this (6 messages):

▷ #reading-group (2 messages):

▷ #diffusion-discussions (1 messages):

▷ #computer-vision (15 messages🔥):

▷ #NLP (34 messages🔥):

▷ #diffusion-discussions (1 messages):


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (64 messages🔥🔥):

▷ #axolotl-dev (16 messages🔥):

▷ #general-help (9 messages🔥):

▷ #datasets (1 messages):


LangChain AI Discord Summary

LangChain AI Channel Summaries

▷ #general (69 messages🔥🔥):

▷ #share-your-work (1 messages):


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (8 messages🔥):

▷ #ai-event-announcements (1 messages):

swyxio: Nov recap here! https://fxtwitter.com/latentspacepod/status/1734245367817093479


Alignment Lab AI Discord Summary

Alignment Lab AI Channel Summaries

▷ #oo (5 messages):

▷ #open-orca-community-chat (1 messages):


Skunkworks AI Discord Summary

Skunkworks AI Channel Summaries

▷ #finetune-experts (1 messages):

zq_dev: Anybody attempting to instruction tune mixtral yet?

▷ #moe-main (1 messages):

▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=yKwRf8IwTNI


MLOps @Chipro Discord Summary

MLOps @Chipro Channel Summaries

▷ #events (1 messages):

ty.x.202.: @everyone https://discord.gg/FKYww6Fn?event=1183435830711296032

▷ #general-ml (1 messages):

fehir: New EU legislation in nutshell


The Ontocord (MDEL discord) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI Engineer Foundation Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Perplexity AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it