Frozen AI News archive

RWKV "Eagle" v5: Your move, Mamba

**RWKV v5 Eagle** was released with better-than-**mistral-7b** evaluation results, trading some English performance for multilingual capabilities. The mysterious **miqu-1-70b** model sparked debate about its origins, possibly a leak or distillation of **Mistral Medium** or a fine-tuned **Llama 2**. Discussions highlighted fine-tuning techniques, including the effectiveness of **1,000 high-quality prompts** over larger mixed-quality datasets, and tools like **Deepspeed**, **Axolotl**, and **QLoRA**. The **Nous Research AI** community emphasized the impact of **Rotary Position Embedding (RoPE) theta settings** on LLM extrapolation, improving models like **Mistral Instruct v0.2**. Speed improvements in **Mistral Tuna** kernels reduced token processing costs, enhancing efficiency. The launch of **Eagle 7B** with 7.52B parameters showcased strong multilingual performance, surpassing other 7B class models.

Canonical issue URL

RWKV v5 ("Eagle") was released this weekend, with better-than-mistral-7b-size evals, and an acknowledgement that it trades off English performance for multilingual capabilities. Stella from EleutherAI (who has supported RWKV from the beginning - see the RWKV pod on Latent Space) put it best:

image.png

In other news, there's much speculation about miqu-1-70b, which could be a leak or distillation of Mistral-Medium (not proven either way). There's also more discussion about the Bard upset on the LMsys board.

--

Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary

Noteworthy Projects and Resources:


Nous Research AI Discord Summary


OpenAI Discord Summary

Key Document Mentioned: The community shared OpenAI's December 17th, 2023 Prompt Engineering Guide, a resource loaded into GPT for those exploring advanced prompt engineering strategies.


LM Studio Discord Summary


Eleuther Discord Summary


Mistral Discord Summary


HuggingFace Discord Summary


LangChain AI Discord Summary


LAION Discord Summary


Perplexity AI Discord Summary


LlamaIndex Discord Summary


Latent Space Discord Summary


DiscoResearch Discord Summary


Alignment Lab AI Discord Summary


AI Engineer Foundation Discord Summary


The Skunkworks AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Perf Enthusiasts AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Datasette - LLM (@SimonW) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Ontocord (MDEL discord) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1395 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (578 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (71 messages🔥🔥):


TheBloke ▷ #coding (27 messages🔥):

Links mentioned:


Nous Research AI ▷ #ctx-length-research (10 messages🔥):

Links mentioned:

Scaling Laws of RoPE-based Extrapolation: The extrapolation capability of Large Language Models (LLMs) based on Rotary Position Embedding is currently a topic of considerable interest. The mainstream approach to addressing extrapolation with ...


Nous Research AI ▷ #off-topic (187 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (57 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #general (607 messages🔥🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (51 messages🔥):

Links mentioned:


OpenAI ▷ #ai-discussions (164 messages🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (75 messages🔥🔥):


OpenAI ▷ #prompt-engineering (299 messages🔥🔥):

Links mentioned:

OpenAI's Dec 17th, 2023 Prompt Engineering Guide: OpenAI dropped the Prompt Engineering guide today. Guide: https://platform.openai.com/docs/guides/prompt-engineering It is loaded into this GPT if you don’t want to do that yourself. This GPT also h...


OpenAI ▷ #api-discussions (299 messages🔥🔥):

Links mentioned:

OpenAI's Dec 17th, 2023 Prompt Engineering Guide: OpenAI dropped the Prompt Engineering guide today. Guide: https://platform.openai.com/docs/guides/prompt-engineering It is loaded into this GPT if you don’t want to do that yourself. This GPT also h...


LM Studio ▷ #💬-general (316 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (74 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (5 messages):


LM Studio ▷ #🎛-hardware-discussion (144 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (6 messages):

Links mentioned:

no title found: no description found


LM Studio ▷ #autogen (7 messages):

Links mentioned:

GitHub - nexusflowai/NexusRaven-V2: Contribute to nexusflowai/NexusRaven-V2 development by creating an account on GitHub.


Eleuther ▷ #general (108 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (131 messages🔥🔥):

Links mentioned:


Eleuther ▷ #lm-thunderdome (29 messages🔥):

Links mentioned:


Eleuther ▷ #gpt-neox-dev (34 messages🔥):

Links mentioned:


Mistral ▷ #general (92 messages🔥🔥):

Links mentioned:


Mistral ▷ #deployment (3 messages):


Mistral ▷ #finetuning (16 messages🔥):


Mistral ▷ #showcase (18 messages🔥):

Links mentioned:


Mistral ▷ #random (3 messages):

Links mentioned:

I asked XBOX's CFO about the Metaverse, XBOX in 2030, VR, & tech's future: in this mess of a video I chat with Kevin about the future of human-computer-interaction and nerd out about virtual reality, the metaverse, and some other st...


Mistral ▷ #la-plateforme (17 messages🔥):

Links mentioned:

Endpoints | Mistral AI Large Language Models): We provide different endpoints with different price/performance tradeoffs. Our endpoints depend on internal models.


HuggingFace ▷ #general (70 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (7 messages):

Links mentioned:


HuggingFace ▷ #cool-finds (1 messages):

Links mentioned:

Meta's Text to Audio is INSANE - MAGNet, Moondream & ZeroShape!: A brief video about some of the trending huggingfac spaces of the past weeks. In this video, we explore 3-4 different AI apps and validate their functionalit...


HuggingFace ▷ #i-made-this (10 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (8 messages🔥):

Links mentioned:

Literature Review on AI in Law: This blog was inspired by Owl from the Laion Discord server. Thanks for the discussions! In this blog, my main goal is to go through why…


HuggingFace ▷ #diffusion-discussions (6 messages):

Links mentioned:


HuggingFace ▷ #computer-vision (9 messages🔥):

Links mentioned:

Depth Anything: no description found


HuggingFace ▷ #NLP (22 messages🔥):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (6 messages):

Links mentioned:


LangChain AI ▷ #general (77 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #langserve (8 messages🔥):


LangChain AI ▷ #langchain-templates (1 messages):


LangChain AI ▷ #share-your-work (8 messages🔥):

Links mentioned:


LangChain AI ▷ #tutorials (1 messages):

Links mentioned:

Create Chat UI Using ChainLit, LangChain, Ollama & Mistral 🧠: In this video, I am demonstrating how you can create a simple ChatGPT like UI in locally in your computer. You can follow along with me by cloning the repo l...


LAION ▷ #general (89 messages🔥🔥):

Links mentioned:


LAION ▷ #research (5 messages):

Links mentioned:


Perplexity AI ▷ #general (52 messages🔥):

Links mentioned:


Perplexity AI ▷ #sharing (15 messages🔥):

Links mentioned:

Tutorial: Perplexity Collections: Uncover the power of 'Collections' in Perplexity, a top-tier AI research tool. This tutorial guides you through effectively grouping threads around specific ...


Perplexity AI ▷ #pplx-api (14 messages🔥):


LlamaIndex ▷ #blog (3 messages):


LlamaIndex ▷ #general (57 messages🔥🔥):


Latent Space ▷ #ai-general-chat (35 messages🔥):

Links mentioned:


Latent Space ▷ #llm-paper-club (2 messages):

Links mentioned:

LLM Paper Club (Asia Edition!) · Luma: Asia-timezone friendly version of the Latent.Space x EugeneYan.com LLM Paper Club! This week we'll be covering the new Self-Rewarding Language Models paper (...


DiscoResearch ▷ #general (4 messages):

Links mentioned:


DiscoResearch ▷ #embedding_dev (1 messages):

sebastian.bodza: >80k


DiscoResearch ▷ #discolm_german (10 messages🔥):


Alignment Lab AI ▷ #general-chat (1 messages):


Alignment Lab AI ▷ #oo (5 messages):


AI Engineer Foundation ▷ #general (1 messages):