Frozen AI News archive

12/13/2023 SOLAR10.7B upstages Mistral7B?

**Upstage** released the **SOLAR-10.7B** model, which uses a novel Depth Up-Scaling technique built on the **llama-2** architecture and integrates **mistral-7b** weights, followed by continued pre-training. The **Nous** community finds it promising but not exceptional. Additionally, weights for the **phi-2** base model were released, trained on **1.4 trillion tokens** including synthetic texts created by GPT-3 and filtered by GPT-4, using **96 A100 GPUs** over 14 days. On **OpenAI's** Discord, users discussed challenges with various **GPT** models, including incoherent outputs, API usage limitations, and issues with **GPT-4 Vision API**. Conversations also covered understanding **AGI** and **ASI**, concerns about OpenAI's partnership with Axel Springer, and pricing changes for GPT Plus. Discussions included the **Gemini** chat model integrated into Bard and comparisons with GPT-4 performance.

Canonical issue URL

We developed the Depth Up-Scaling technique. Built on the Llama2 architecture, SOLAR-10.7B incorporates the innovative Upstage Depth Up-Scaling. We then integrated Mistral 7B weights into the upscaled layers, and finally, continued pre-training for the entire model.

image.png

The Nous community thinks it's good but not great.

In other news, weights for the Phi-2 base model were released - it's 1.4T tokens of Phi 1.5 + 250B worth of new GPT3-created synthetic texts and GPT4-filtered websites trained over 96 A100s for 14 days.

[TOC]

OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (55 messages🔥🔥):

▷ #openai-chatter (270 messages🔥🔥):

▷ #openai-questions (168 messages🔥🔥):

▷ #gpt-4-discussions (21 messages🔥):

▷ #prompt-engineering (45 messages🔥):

▷ #api-discussions (45 messages🔥):


Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #off-topic (9 messages🔥):

▷ #benchmarks-log (19 messages🔥):

▷ #interesting-links (103 messages🔥🔥):

▷ #general (382 messages🔥🔥):

▷ #ask-about-llms (72 messages🔥🔥):


DiscoResearch Discord Summary

DiscoResearch Channel Summaries

▷ #disco_judge (3 messages):

▷ #mixtral_implementation (160 messages🔥🔥):

▷ #general (18 messages🔥):

▷ #benchmark_dev (6 messages):


OpenAccess AI Collective (axolotl) Discord Summary




OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (71 messages🔥🔥):

▷ #axolotl-dev (82 messages🔥🔥):

▷ #general-help (14 messages🔥):

▷ #datasets (6 messages):


Mistral Discord Summary

Mistral Channel Summaries

▷ #general (74 messages🔥🔥):

▷ #models (21 messages🔥):

▷ #deployment (6 messages):

▷ #ref-implem (8 messages🔥):

▷ #finetuning (1 messages):

▷ #showcase (8 messages🔥):

▷ #random (3 messages):

▷ #la-plateforme (52 messages🔥):


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #announcements (1 messages):

▷ #general (57 messages🔥🔥):

▷ #today-im-learning (4 messages):

▷ #cool-finds (7 messages):

▷ #i-made-this (7 messages):

▷ #reading-group (5 messages):

▷ #diffusion-discussions (3 messages):

▷ #computer-vision (3 messages):

▷ #NLP (9 messages🔥):

▷ #diffusion-discussions (3 messages):


LangChain AI Discord Summary

Only 1 channel had activity, so no need to summarize...


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (9 messages🔥):

▷ #ai-event-announcements (1 messages):

▷ #llm-paper-club (7 messages):


LLM Perf Enthusiasts AI Discord Summary

LLM Perf Enthusiasts AI Channel Summaries

▷ #general (1 messages):

dongdong0755: Anyone trying out gemini pro api?

▷ #gpt4 (1 messages):

.psychickoala: Any update here? Anyone have examples of this?

▷ #finetuning (2 messages):

▷ #opensource (1 messages):

▷ #resources (2 messages):

▷ #speed (1 messages):

rabiat: Are there any latency stats on googles gemini?

▷ #eval (1 messages):

▷ #prompting (1 messages):

jeffreyw128: Anyone find a good way to understand


Alignment Lab AI Discord Summary

Alignment Lab AI Channel Summaries

▷ #general-chat (4 messages):

▷ #oo (1 messages):

▷ #phi-tuning (3 messages):


AI Engineer Foundation Discord Summary

AI Engineer Foundation Channel Summaries

▷ #general (2 messages):

▷ #events (1 messages):


MLOps @Chipro Discord Summary

MLOps @Chipro Channel Summaries

▷ #events (1 messages):

▷ #general-ml (1 messages):


Skunkworks AI Discord Summary

Only 1 channel had activity, so no need to summarize...


The Ontocord (MDEL discord) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Perplexity AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.