Frozen AI News archive

Miqu confirmed to be an early Mistral-medium checkpoint

**Miqu**, an open access model, scores **74 on MMLU** and **84.5 on EQ-Bench**, sparking debates about its performance compared to **Mistral Medium**. The **CEO of Mistral** confirmed these results. Discussions in the **TheBloke Discord** highlight **Miqu's** superiority in instruction-following and sampling methods like dynatemp and min-p. Developers also explore browser preferences and Discord UI themes. Role-playing with models like **BagelMistery Tour v2** and **Psyfighter v2** is popular, alongside technical talks on **fp16 quantization** of **Miqu-1-70b**. Training and fine-tuning tips for models like **Unsloth** and **Mistral 7B** are shared. In the **Nous Research AI Discord**, the **Activation Beacon** method is discussed for extending LLM context length from 4K to 400K tokens. **SQLCoder-70B**, fine-tuned on **CodeLlama-70B**, leads in text-to-SQL generation and is available on Hugging Face. The **Miqu model** also impresses with an **83.5 EQ-Bench score**, fueling speculation about its capabilities.

Canonical issue URL

There's been a lot of speculation about the surprisingly good open access (not open source, because no license) model Miqu - scoring 74 on MMLU (vs 75 for mistral-medium) and 84.5 on EQ-bench, a subjectively better version of MMLU. There've been a lot of debates both for and against this fact - but the CEO of Mistral has now come out and confirmed it.

image.png

So technically we can't use this model but it's an interesting leak for sure.


Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


Nous Research AI Discord Summary


LM Studio Discord Summary

These summaries highlight key discussions on technical challenges, model performance, and innovative applications within the LM Studio community, reflecting a vibrant dialogue on AI technology's frontiers among practitioners.


OpenAI Discord Summary


Eleuther Discord Summary


Mistral Discord Summary


HuggingFace Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


Perplexity AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


LangChain AI Discord Summary


LlamaIndex Discord Summary


LAION Discord Summary


DiscoResearch Discord Summary


Latent Space Discord Summary


Alignment Lab AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


AI Engineer Foundation Discord Summary


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1209 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (285 messages🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (17 messages🔥):


TheBloke ▷ #model-merging (4 messages):

Links mentioned:

k-NN In Mixture of Experts: no description found


TheBloke ▷ #coding (19 messages🔥):


Nous Research AI ▷ #ctx-length-research (1 messages):

dreamgen: "global state" tokens sound similar to attention sinks, did not read the paper though


Nous Research AI ▷ #off-topic (26 messages🔥):

Links mentioned:


Nous Research AI ▷ #benchmarks-log (14 messages🔥):


Nous Research AI ▷ #interesting-links (15 messages🔥):

Links mentioned:


Nous Research AI ▷ #general (502 messages🔥🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (58 messages🔥🔥):

Links mentioned:


LM Studio ▷ #💬-general (184 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (61 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (5 messages):


LM Studio ▷ #🎛-hardware-discussion (99 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (18 messages🔥):

Links mentioned:

TheBloke/Dr_Samantha-7B-GGUF · Hugging Face: no description found


LM Studio ▷ #autogen (1 messages):


LM Studio ▷ #langchain (2 messages):


OpenAI ▷ #ai-discussions (3 messages):


OpenAI ▷ #gpt-4-discussions (170 messages🔥🔥):


OpenAI ▷ #prompt-engineering (32 messages🔥):


OpenAI ▷ #api-discussions (32 messages🔥):


Eleuther ▷ #general (74 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (96 messages🔥🔥):

Links mentioned:


Eleuther ▷ #lm-thunderdome (3 messages):

Links mentioned:


Eleuther ▷ #multimodal-general (2 messages):


Eleuther ▷ #gpt-neox-dev (7 messages):


Mistral ▷ #general (147 messages🔥🔥):

Links mentioned:


Mistral ▷ #finetuning (2 messages):


Mistral ▷ #showcase (1 messages):

Links mentioned:

GitHub - brett-baudin-consulting/uMdali: Enterprise Chat Front End: Enterprise Chat Front End. Contribute to brett-baudin-consulting/uMdali development by creating an account on GitHub.


Mistral ▷ #la-plateforme (4 messages):

Links mentioned:

GitHub - mistralai/platform-docs-public: Contribute to mistralai/platform-docs-public development by creating an account on GitHub.


HuggingFace ▷ #announcements (1 messages):

Links mentioned:


HuggingFace ▷ #general (97 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #cool-finds (1 messages):

Links mentioned:

Multimodal Malaysian LLM dataset - a mesolitica Collection: no description found


HuggingFace ▷ #i-made-this (3 messages):

Links mentioned:


HuggingFace ▷ #reading-group (3 messages):


HuggingFace ▷ #diffusion-discussions (5 messages):

Links mentioned:

works 徳永明正-航空イラストなど-: no description found


HuggingFace ▷ #NLP (7 messages):

Links mentioned:

LLama cpp problem ( gpu support) · Issue #509 · abetlen/llama-cpp-python: Hello, I am completly newbie, when it comes to the subject of llms I install some ggml model to oogabooga webui And I try to use it. It works fine, but only for RAM. For VRAM only uses 0.5gb, and I...


HuggingFace ▷ #diffusion-discussions (5 messages):

Links mentioned:

works 徳永明正-航空イラストなど-: no description found


OpenAccess AI Collective (axolotl) ▷ #general (23 messages🔥):

Links mentioned:

152334H/miqu-1-70b-sf · Hugging Face: no description found


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (7 messages):


OpenAccess AI Collective (axolotl) ▷ #general-help (65 messages🔥🔥):


OpenAccess AI Collective (axolotl) ▷ #community-showcase (1 messages):

Links mentioned:

Tweet from Fabrizio Milo (@fabmilo): Had fun experimenting on function calling outside #OpenAI API. Sharing my #colab [1] that leverages @ggerganov's llama.cpp[2] / @abetlen llamacpp python wrapper [3] + @LangChainAI wrapper + @ca...


OpenAccess AI Collective (axolotl) ▷ #runpod-help (9 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #deployment-help (1 messages):

yamashi: Parallel req


Perplexity AI ▷ #general (12 messages🔥):


Perplexity AI ▷ #sharing (10 messages🔥):

Links mentioned:


Perplexity AI ▷ #pplx-api (61 messages🔥🔥):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #triton (5 messages):

Links mentioned:

triton.language.debug_barrier — Triton documentation: no description found


LLM Perf Enthusiasts AI ▷ #cuda (24 messages🔥):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #torch (1 messages):

andreaskoepf: https://x.com/pytorch/status/1752406904809341165


LLM Perf Enthusiasts AI ▷ #algorithms (2 messages):


LLM Perf Enthusiasts AI ▷ #suggestions (1 messages):

vim410: Thanks for sharing, i am one of the person who wrote the article. 🙂


LLM Perf Enthusiasts AI ▷ #jobs (2 messages):

Links mentioned:

Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.: A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team


LLM Perf Enthusiasts AI ▷ #beginner (8 messages🔥):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #pmpp-book (6 messages):


LangChain AI ▷ #announcements (2 messages):

Links mentioned:

GitHub Incident: Forks not being recognized, PRs automatically closed · langchain-ai/langchain · Discussion #16796: As of Jan 30, 2024 9:30am PST we're aware that most LangChain forks have stopped being recognized as forks, and the corresponding PRs have automatically been closed. We're in contact with the ...


LangChain AI ▷ #general (35 messages🔥):

Links mentioned:


LangChain AI ▷ #langserve (3 messages):


LangChain AI ▷ #share-your-work (5 messages):

Links mentioned:


LangChain AI ▷ #tutorials (3 messages):

Links mentioned:


LlamaIndex ▷ #blog (2 messages):


LlamaIndex ▷ #general (37 messages🔥):


LlamaIndex ▷ #ai-discussion (1 messages):

andysingal: hallucination-leaderboard. https://github.com/vectara/hallucination-leaderboard


LAION ▷ #general (17 messages🔥):

Links mentioned:

Imperium Of Man - Warhammer 40k: Imperium Of Man - Warhammer 40k is a fan-made (unofficial) trailer by JustMovies, produced using various AI generative tools. What started as a project a few...


LAION ▷ #research (17 messages🔥):

Links mentioned:


DiscoResearch ▷ #disco_judge (1 messages):

huunguyen: <@213644857309134849> - any luck on the prometheus mistral model for en?


DiscoResearch ▷ #general (21 messages🔥):

Links mentioned:


DiscoResearch ▷ #embedding_dev (3 messages):

Links mentioned:


DiscoResearch ▷ #discolm_german (1 messages):

Links mentioned:

DiscoResearch/DiscoLM_German_7b_v1 · Hugging Face: no description found


Latent Space ▷ #ai-general-chat (21 messages🔥):

Links mentioned:


Latent Space ▷ #llm-paper-club (2 messages):

Links mentioned:


Alignment Lab AI ▷ #general-chat (3 messages):

Links mentioned:


Alignment Lab AI ▷ #looking-for-workers (1 messages):


LLM Perf Enthusiasts AI ▷ #general (2 messages):


AI Engineer Foundation ▷ #general (1 messages):

Links mentioned:

Guide to Submit Projects to AI Engineer Foundation: no description found