Frozen AI News archive

1/16/2024: ArtificialAnalysis - a new model/host benchmark site

**Artificial Analysis** launched a new models and hosts comparison site, highlighted by **swyx**. **Nous Research AI** Discord discussed innovative summarization techniques using **NVIDIA 3090 and 2080ti GPUs** for processing around **100k tokens**, and adapting prompts for smaller models like **OpenChat 7B**. The availability of **Hermes 2 Mixtral** on **Huggingface's HuggingChat** was noted, alongside fine-tuning challenges with **Mixtral** using Axolotl. Discussions included byte-level tokenization experiments with **Byte Mistral**, multimodal training on **COCO image bytes**, and inference speed improvements using **vllm** and **llama.cpp**. Calls for transparency in data sharing and open-sourcing the **Hermes 2 Mixtral** dataset were emphasized, with comparisons of **dpo** and **sft** methods and quantized LLM use on **M1 MacBook Pro**.

Canonical issue URL

Artificial Analysis: this gem of a models and hosts comparison site was just launched:

image.png

swyx's tweet on this here:

image.png

--

Table of Contents

[TOC]

Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #ctx-length-research (1 messages):

gabriel_syme: Dang this looks great https://fxtwitter.com/_akhaliq/status/1747515567492174185

▷ #off-topic (78 messages🔥🔥):

Links mentioned:

▷ #interesting-links (2 messages):

Links mentioned:

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models: Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources. Existing long-context extension methods usually need additional tra...

▷ #general (224 messages🔥🔥):

Links mentioned:

▷ #ask-about-llms (33 messages🔥):

Links mentioned:


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #announcements (4 messages):

Links mentioned:

▷ #general (141 messages🔥🔥):

Links mentioned:

▷ #today-im-learning (11 messages🔥):

▷ #cool-finds (2 messages):

▷ #i-made-this (14 messages🔥):

Links mentioned:

▷ #reading-group (6 messages):

▷ #diffusion-discussions (6 messages):

Links mentioned:

▷ #computer-vision (4 messages):

Links mentioned:

Akshay's Personal Website: I am a Machine Learning Enthusiast. Check out my Projects and Blogs

▷ #NLP (5 messages):

Links mentioned:

▷ #diffusion-discussions (6 messages):

Links mentioned:


OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (71 messages🔥🔥):

Links mentioned:

Salazar Introduces the No AI Fraud Act: WASHINGTON, D.C. – Today, Reps. María Elvira Salazar (R-FL) and Madeleine Dean (D-PA) introduced the No Artificial Intelligence Fake Replicas And Unauthorized Duplications (No AI FRAUD) Act. The bill ...

▷ #gpt-4-discussions (76 messages🔥🔥):

▷ #prompt-engineering (23 messages🔥):

▷ #api-discussions (23 messages🔥):


LM Studio Discord Summary

LM Studio Channel Summaries

▷ #💬-general (143 messages🔥🔥):

Links mentioned:

▷ #🤖-models-discussion-chat (35 messages🔥):

Links mentioned:

Mastering Model Merging: A Deep Dive into TIES-Merging Technique: TIES-Merging is a groundbreaking method for merging model checkpoints, enabling seamless multitasking. These robust model merging techniques can greatly enha...

▷ #🧠-feedback (5 messages):

▷ #🎛-hardware-discussion (8 messages🔥):

Links mentioned:

Yay Kitty GIF - Yay Kitty Cat - Discover & Share GIFs: Click to view the GIF

▷ #autogen (2 messages):


Mistral Discord Summary

The above summaries are based on discussions that included engineers and engaged with technical, hands-on aspects of working with the mentioned AI models. Links to additional resources or examples provided by community members have been included for further reference.

Mistral Channel Summaries

▷ #general (51 messages🔥):

Links mentioned:

▷ #models (46 messages🔥):

Links mentioned:

▷ #deployment (1 messages):

▷ #ref-implem (1 messages):

Links mentioned:

client-python/examples/async_embeddings.py at main · mistralai/client-python: Python client library for Mistral AI platform. Contribute to mistralai/client-python development by creating an account on GitHub.

▷ #finetuning (28 messages🔥):

Links mentioned:

▷ #la-plateforme (23 messages🔥):

Links mentioned:

Open-weight models | Mistral AI Large Language Models): We open-source both pre-trained models and fine-tuned models. These models are not tuned for safety as we want to empower users to test and refine moderation based on their use cases. For safer models...


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (40 messages🔥):

Links mentioned:

▷ #ai-event-announcements (1 messages):

Links mentioned:

▷ #llm-paper-club-chat (1 messages):

Links mentioned:

Tweet from Allen (Simian) Luo (@SimianLuo): Best joke of my life. We are happy to announce that LCM get rejected by ICLR🤣🤣 lol. Question:Should i continue to do research in school?😁


LlamaIndex Discord Discord Summary

LlamaIndex Discord Channel Summaries

▷ #blog (3 messages):

▷ #general (30 messages🔥):

Links mentioned:

Embeddings - LlamaIndex 🦙 0.9.33


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (10 messages🔥):

Links mentioned:

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA: As large language models (LLMs) have become increasingly compute and memory intensive, parameter-efficient fine-tuning (PEFT) methods are now a common strategy to fine-tune LLMs. A popular PEFT method...

▷ #axolotl-dev (21 messages🔥):

Links mentioned:


DiscoResearch Discord Summary

DiscoResearch Channel Summaries

▷ #mixtral_implementation (10 messages🔥):

Links mentioned:

▷ #general (7 messages):

Links mentioned:

▷ #embedding_dev (8 messages🔥):


LLM Perf Enthusiasts AI Discord Summary

LLM Perf Enthusiasts AI Channel Summaries

▷ #opensource (10 messages🔥):

▷ #openai (4 messages):


Skunkworks AI Discord Summary

Skunkworks AI Channel Summaries

▷ #general (10 messages🔥):

Links mentioned:

Tweet from Teknium (e/λ) (@Teknium1): Personally I dont see a problem with adding their work to "related works" or citations - I am not sure what the complaint about citing a blog or non-paper is? Do you see it as a negative in so...

▷ #papers (1 messages):

Links mentioned:

Tweet from Prateek Yadav (@prateeky2806): 🎉 Thrilled to announce our MOE Expert Merging paper has been accepted to @iclr_conf as a SpotLight paper. ! We reduce the inference memory cost of MOE models by utilizing routing statistics-based me...


Datasette - LLM (@SimonW) Discord Summary

Datasette - LLM (@SimonW) Channel Summaries

▷ #ai (2 messages):

Links mentioned:

GitHub - dbreunig/emoji-suggest: A microservice to suggest an emoji given a string.: A microservice to suggest an emoji given a string. - GitHub - dbreunig/emoji-suggest: A microservice to suggest an emoji given a string.

▷ #llm (3 messages):

Links mentioned:

Finding Bathroom Faucets with Embeddings: Using embeddings to navigate impenetrable domains


Alignment Lab AI Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:

Tweet from Prateek Yadav (@prateeky2806): 🎉 Thrilled to announce our MOE Expert Merging paper has been accepted to @iclr_conf as a SpotLight paper. ! We reduce the inference memory cost of MOE models by utilizing routing statistics-based me...


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.