Frozen AI News archive

12/8/2023 - Mamba v Mistral v Hyena

Three new AI models are highlighted: **Mistral's 8x7B MoE model (Mixtral)**, **Mamba models** up to 3B by Together, and **StripedHyena 7B**, a competitive subquadratic attention model from Stanford's Hazy Research. Discussions on **Anthropic's Claude 2.1** focus on its prompting technique and alignment challenges. The **Gemini AI** from Google is noted as potentially superior to **GPT-4**. The community also explores **Dreambooth** for image training and shares resources like the **DialogRPT-human-vs-machine** model on Hugging Face. Deployment challenges for large language models, including CPU performance and GPU requirements, are discussed with references to **Falcon 180B** and transformer batching techniques. User engagement includes meme sharing and humor.

Canonical issue URL

This is all very substantial and shows what happens when you ship model weights instead of heavily edited marketing videos.

[TOC]

Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #ctx-length-research (8 messages🔥):

▷ #off-topic (8 messages🔥):

▷ #benchmarks-log (1 messages):

nonameusr: https://huggingface.co/allenai/tulu-2-dpo-70b

▷ #interesting-links (5 messages):

▷ #general (375 messages🔥🔥):

▷ #ask-about-llms (80 messages🔥🔥):

▷ #memes (2 messages):


OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (92 messages🔥🔥):

▷ #openai-chatter (246 messages🔥🔥):

▷ #openai-questions (69 messages🔥🔥):

▷ #gpt-4-discussions (31 messages🔥):

▷ #prompt-engineering (11 messages🔥):

▷ #api-discussions (11 messages🔥):


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (179 messages🔥🔥):

▷ #axolotl-dev (56 messages🔥🔥):

▷ #general-help (12 messages🔥):

▷ #rlhf (3 messages):

▷ #community-showcase (2 messages):

▷ #runpod-help (10 messages🔥):

▷ #docs (1 messages):

le_mess: Bro stop advertising this. I've said it before 😅


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (45 messages🔥):

▷ #ai-event-announcements (2 messages):

▷ #llm-paper-club (8 messages🔥):


LangChain AI Discord Summary

LangChain AI Channel Summaries

▷ #general (40 messages🔥):

▷ #share-your-work (2 messages):


LLM Perf Enthusiasts AI Discord Summary

LLM Perf Enthusiasts AI Channel Summaries

▷ #gpt4 (8 messages🔥):

▷ #finetuning (1 messages):

robhaisfield: Anyone have a JSON I can use to fine-tune 3.5-turbo so I can just see how it works?

▷ #opensource (3 messages):

▷ #irl (4 messages):


Skunkworks AI Discord Summary

Skunkworks AI Channel Summaries

▷ #general (2 messages):

▷ #papers (3 messages):

▷ #moe-main (2 messages):

▷ #off-topic (1 messages):

pradeep1148: https://youtu.be/mAGLD5598cs


MLOps @Chipro Discord Summary

MLOps @Chipro Channel Summaries

▷ #events (2 messages):

▷ #general-ml (2 messages):


Alignment Lab AI Discord Summary

Alignment Lab AI Channel Summaries

▷ #open-orca-community-chat (1 messages):

▷ #looking-for-workers (1 messages):


The Ontocord (MDEL discord) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI Engineer Foundation Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Perplexity AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it