Frozen AI News archive

The Dissection of Smaug (72B)

**Abacus AI** launched **Smaug 72B**, a large finetune of **Qwen 1.0**, which remains unchallenged on the **Hugging Face Open LLM Leaderboard** despite skepticism from **Nous Research**. **LAION** introduced a local voice assistant model named **Bud-E** with a notable demo. The **TheBloke Discord** community discussed model performance trade-offs between large models like **GPT-4** and smaller quantized models, fine-tuning techniques using datasets like **WizardLM_evol_instruct_V2_196k** and **OpenHermes-2.5**, and challenges in web UI development and model merging involving **Mistral-7b** and **MiquMaid**. The **LM Studio Discord** highlighted issues with model conversion from PyTorch to gguf, hardware setups involving **Intel Xeon CPUs** and **Nvidia P40 GPUs**, privacy concerns, and limitations in image generation and web UI availability.

Canonical issue URL


It's now the Chinese year of the Dragon, and Abacus AI appropriately rung it in making a lot of noise about Smaug 72B, their latest and largest finetune of Qwen (1.0... badly timed since 1.5 just came up, but you can be sure they will update it with more noise)

image.png

Typical skepticism aside, it is still standing unchallenged after a week on the HF Open LLM Leaderboard, and with published contamination results, which is a good sign. However the Nous people are skeptical:

image.png

In other news, LAION popped up with an adorably named local voice assistant model with a great demo.


Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


LM Studio Discord Summary


HuggingFace Discord Summary


OpenAI Discord Summary


Nous Research AI Discord Summary


Eleuther Discord Summary


LAION Discord Summary


Perplexity AI Discord Summary


CUDA MODE Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


LlamaIndex Discord Summary


LangChain AI Discord Summary


Mistral Discord Summary

One Size Fits All with Mistral's Subscription: Users discussed the subscription model for the Mistral Discord chatbot, confirming it is a unified model with payment per token and scalable deployment, highlighted by @mrdragonfox; quantized models, such as those found on Hugging Face, were also mentioned as requiring less RAM.

GPU Economics: Rent vs. Own: @i_am_dom analyzed the cost-effectiveness of Google GPU rentals versus owning hardware like A100s 40GB, suggesting that after 70000 computational units or about half a year of use, owning GPUs could be more economical.

Docker Deployment Discussion: A request for docker_compose.yml for deploying Mistral AI indicates ongoing discussions about streamlining Mistral AI setups as REST APIs in Docker environments.

Fine-Tuning for Self-Awareness and Personal Assistants: Fine-tuning topics ranged from installation success on Cloudfare AI maker to a lack of self-awareness in models, as noted by @dawn.dusk in relation to GPT-4 and Mistral; a Datacamp tutorial was recommended for learning use cases and prompts.

Showcasing Mistral’s Capabilities: @jakobdylanc’s Discord chatbot with collaborative LLM prompting feature supports multiple models including Mistral with a lean 200-line implementation, available on GitHub; additionally, Mistral 7b's note-taking prowess was spotlighted in an article at Hacker Noon, outperforming higher-rated models.


Latent Space Discord Summary


DiscoResearch Discord Summary


LLM Perf Enthusiasts AI Discord Summary

Please note that the other message from rabiat did not contain sufficient context or information relevant for a technical, detail-oriented engineer audience and thus was omitted from the summary.


Alignment Lab AI Discord Summary


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1251 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (535 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (30 messages🔥):

Links mentioned:


TheBloke ▷ #coding (10 messages🔥):

Links mentioned:


LM Studio ▷ #💬-general (399 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (92 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (1 messages):

Links mentioned:

Model Object Teardowns - HackMD: Model File Formats


LM Studio ▷ #🎛-hardware-discussion (197 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (1 messages):

ramendraws: :1ski_smug:


LM Studio ▷ #autogen (2 messages):


LM Studio ▷ #avx-beta (1 messages):


HuggingFace ▷ #announcements (1 messages):


HuggingFace ▷ #general (603 messages🔥🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (8 messages🔥):

Links mentioned:

HTTPX WS: no description found


HuggingFace ▷ #cool-finds (5 messages):

Links mentioned:


HuggingFace ▷ #i-made-this (10 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (20 messages🔥):

Links mentioned:

Mamba Presentation - When2meet: no description found


HuggingFace ▷ #computer-vision (1 messages):


HuggingFace ▷ #NLP (20 messages🔥):


OpenAI ▷ #ai-discussions (234 messages🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (37 messages🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


OpenAI ▷ #prompt-engineering (63 messages🔥🔥):


OpenAI ▷ #api-discussions (63 messages🔥🔥):


Nous Research AI ▷ #off-topic (11 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (22 messages🔥):

Links mentioned:


Nous Research AI ▷ #general (228 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (30 messages🔥):


Nous Research AI ▷ #project-obsidian (8 messages🔥):


Eleuther ▷ #general (162 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (66 messages🔥🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (7 messages):

Links mentioned:


Eleuther ▷ #lm-thunderdome (11 messages🔥):


LAION ▷ #general (166 messages🔥🔥):

Links mentioned:


LAION ▷ #announcements (1 messages):

Links mentioned:


LAION ▷ #research (5 messages):

Links mentioned:


Perplexity AI ▷ #general (77 messages🔥🔥):

Links mentioned:


Perplexity AI ▷ #sharing (11 messages🔥):


Perplexity AI ▷ #pplx-api (9 messages🔥):

Links mentioned:

Feature Roadmap: no description found


CUDA MODE ▷ #general (9 messages🔥):

Links mentioned:


CUDA MODE ▷ #triton (1 messages):

Links mentioned:

ACO: no description found


CUDA MODE ▷ #cuda (38 messages🔥):

Links mentioned:


CUDA MODE ▷ #torch (11 messages🔥):

Links mentioned:

Tweet from jack morris (@jxmnop): welp. this is what happened when i tried to use torch compile


CUDA MODE ▷ #announcements (2 messages):

Links mentioned:

Join the CUDA MODE Discord Server!: CUDA reading group | 4068 members


CUDA MODE ▷ #algorithms (1 messages):

ericauld: Very interested, though I just realized I'm like a month late


CUDA MODE ▷ #youtube-recordings (3 messages):


OpenAccess AI Collective (axolotl) ▷ #general (28 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (10 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (27 messages🔥):

Links mentioned:


LlamaIndex ▷ #announcements (1 messages):

Links mentioned:

LLMs for Advanced Question-Answering over Tabular/CSV/SQL Data (Building Advanced RAG, Part 2): In the second video of this series we show you how to compose an simple-to-advanced query pipeline over tabular data. This includes using LLMs to infer both ...


LlamaIndex ▷ #blog (5 messages):

Links mentioned:

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding: Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with g...


LlamaIndex ▷ #general (53 messages🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (2 messages):

Links mentioned:

Video Revolution: GPT4V and LlamaIndex Unleashed: Ankush k Singal


LangChain AI ▷ #general (23 messages🔥):

Links mentioned:


LangChain AI ▷ #langserve (1 messages):


LangChain AI ▷ #share-your-work (8 messages🔥):

Links mentioned:


LangChain AI ▷ #tutorials (2 messages):

Links mentioned:


Mistral ▷ #general (18 messages🔥):

Links mentioned:

TheBloke/Mistral-7B-Instruct-v0.2-GGUF · Hugging Face: no description found


Mistral ▷ #models (2 messages):


Mistral ▷ #deployment (1 messages):


Mistral ▷ #finetuning (3 messages):

Links mentioned:

Mistral 7B Tutorial: A Step-by-Step Guide to Using and Fine-Tuning Mistral 7B: The tutorial covers accessing, quantizing, fine-tuning, merging, and saving this powerful 7.3 billion parameter open-source language model.


Mistral ▷ #showcase (4 messages):

Links mentioned:


Mistral ▷ #la-plateforme (2 messages):


Latent Space ▷ #ai-general-chat (10 messages🔥):

Links mentioned:


DiscoResearch ▷ #general (8 messages🔥):

Links mentioned:

‎Gemini - chat to supercharge your ideas: Bard is now Gemini. Get help with writing, planning, learning, and more from Google AI.


DiscoResearch ▷ #discolm_german (1 messages):

Links mentioned:

GitHub - uclaml/SPIN: The official implementation of Self-Play Fine-Tuning (SPIN): The official implementation of Self-Play Fine-Tuning (SPIN) - uclaml/SPIN


LLM Perf Enthusiasts AI ▷ #speed (1 messages):

rabiat: Interesting thought 🙂


LLM Perf Enthusiasts AI ▷ #openai (3 messages):


Alignment Lab AI ▷ #oo (2 messages):


Skunkworks AI ▷ #off-topic (1 messages):

pradeep1148: https://www.youtube.com/watch?v=W4T7zHluzaM