Frozen AI News archive

1/6-7/2024: LlaMA Pro - an alternative to PEFT/RAG??

New research papers introduce promising **Llama Extensions** including **TinyLlama**, a compact **1.1B** parameter model pretrained on about **1 trillion tokens** for 3 epochs, and **LLaMA Pro**, an **8.3B** parameter model expanding **LLaMA2-7B** with additional training on **80 billion tokens** of code and math data. LLaMA Pro adds layers to avoid catastrophic forgetting and balances language and code tasks but faces scrutiny for not using newer models like **Mistral** or **Qwen**. Meanwhile, **OpenAI** Discord discussions reveal insights on **GPT-4** token limits, privacy reassurances, fine-tuning for GPT-3.5, challenges with multi-language image recognition, custom GPT creation requiring **ChatGPT Plus**, and security concerns in GPT deployment. Users also share tips on dynamic image generation with **DALL-E** and logo creation.

Canonical issue URL

New papers released show very promising Llama Extensions:

But it is getting some scrutiny already for basing on LlaMA and not using Mistral/Qwen/etc:

image.png

Yannic Kilcher already has a great Llama Pro explainer out:

https://www.youtube.com/watch?v=hW3OVWfndLw

In other news, LangChain is planning to promote their recent v0.1 next week.


Table of Contents

[TOC]

OpenAI Discord Summary

OpenAI Channel Summaries

▷ #ai-discussions (241 messages🔥🔥):

Links mentioned:

▷ #gpt-4-discussions (97 messages🔥🔥):

Links mentioned:

▷ #prompt-engineering (48 messages🔥):

Links mentioned:

Usage policies

▷ #api-discussions (48 messages🔥):

Links mentioned:

Usage policies


Eleuther Discord Summary

Eleuther Channel Summaries

▷ #general (161 messages🔥🔥):

Links mentioned:

▷ #research (64 messages🔥🔥):

Links mentioned:

▷ #interpretability-general (2 messages):

Links mentioned:

An explanation for every token: using an LLM to sample another LLM — LessWrong: Introduction Much has been written about the implications and potential safety benefits of building an AGI based on one or more Large Language Models…

▷ #lm-thunderdome (13 messages🔥):

Links mentioned:

LLM-Benchmark-Logs/benchmark-logs/Mixtral-7x8-Base.md at main · teknium1/LLM-Benchmark-Logs: Just a bunch of benchmark logs for different LLMs. Contribute to teknium1/LLM-Benchmark-Logs development by creating an account on GitHub.


Perplexity AI Discord Summary

Perplexity AI Channel Summaries

▷ #general (180 messages🔥🔥):

Links mentioned:

▷ #sharing (15 messages🔥):

Links mentioned:

▷ #pplx-api (5 messages):

Links mentioned:

Chat Completions


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (57 messages🔥🔥):

Links mentioned:

▷ #axolotl-dev (43 messages🔥):

Links mentioned:

▷ #general-help (61 messages🔥🔥):

Links mentioned:

▷ #shearedmistral (14 messages🔥):

Links mentioned:


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #general (85 messages🔥🔥):

Links mentioned:

▷ #today-im-learning (26 messages🔥):

Links mentioned:

▷ #cool-finds (6 messages):

Links mentioned:

▷ #i-made-this (7 messages):

Links mentioned:

▷ #reading-group (29 messages🔥):

Links mentioned:

MC-JEPA neural model: Unlock the power of motion recognition & generative ai on videos and images: 🌟 Unlock the Power of AI Learning from Videos ! 🎬 Watch a deep dive discussion on the MC-JEPA approach with Oliver, Nevil, Ojasvita, Shashank and Srikanth....

▷ #core-announcements (1 messages):

Links mentioned:

diffusers/examples/research_projects/diffusion_dpo at main · huggingface/diffusers: 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - huggingface/diffusers

▷ #computer-vision (3 messages):

Links mentioned:

Tweet from merve (@mervenoyann): DINOv2 is the king for self-supervised learning in images 🦖🦕 But how does it work? I've tried to explain how it works but let's expand on it 🧶

▷ #NLP (3 messages):

Links mentioned:

Omaratef3221/flan-t5-base-dialogue-generator · Hugging Face


LAION Discord Summary

Links mentioned:

LAION Channel Summaries

▷ #general (103 messages🔥🔥):

Links mentioned:

▷ #research (24 messages🔥):

Links mentioned:

▷ #learning-ml (3 messages):


LangChain AI Discord Summary

LangChain AI Channel Summaries

▷ #announcements (1 messages):

▷ #general (33 messages🔥):

Links mentioned:

casa-bot/services/api/main.py at dev · ai-ponx/casa-bot: Agentive real estate sms assistant. Contribute to ai-ponx/casa-bot development by creating an account on GitHub.

▷ #langserve (17 messages🔥):

Links mentioned:

▷ #langchain-templates (1 messages):

▷ #share-your-work (3 messages):

Links mentioned:

Neutrino AI

▷ #tutorials (5 messages):


Mistral Discord Summary

Mistral Channel Summaries

▷ #general (25 messages🔥):

Additional Notes: Conversation is majorly around problems and potential solutions related to fine tuning and testing AI models, with some users suggesting possible workarounds to the ongoing issues. There is also a discussion on the practicality and accuracy of testing methods. Several users also express interest in testing specific functionalities, such as 'if-else' conditions.

Links mentioned:

▷ #models (1 messages):

10anant10: Hey anyone wanna build something together

▷ #deployment (2 messages):

▷ #ref-implem (1 messages):

productiondown: Hey folks, https://docs.mistral.ai/usage/guardrailing this link is broken

▷ #finetuning (3 messages):

Links mentioned:

▷ #showcase (1 messages):

pradeep1148: https://www.youtube.com/watch?v=aXeU6mVRgiA

▷ #random (3 messages):

▷ #la-plateforme (21 messages🔥):

Links mentioned:

▷ #office-hour (1 messages):


Datasette/LLM (@SimonW) Discord Summary

Datasette/LLM (@SimonW) Channel Summaries

▷ #ai (3 messages):

Links mentioned:

It’s OK to call it Artificial Intelligence: We need to be having high quality conversations about AI: what it can and can’t do, its many risks and pitfalls and how to integrate it into society in the …

▷ #llm (2 messages):


Alignment Lab AI Discord Summary

Alignment Lab AI Channel Summaries

▷ #general-chat (2 messages):

Links mentioned:

When "Everything" Becomes Too Much: The npm Package Chaos of 2024 - Socket: An NPM user named PatrickJS launched a troll campaign with a package called "everything," which depends on all public npm packages.

▷ #oo (1 messages):

teknium: Hi all <a:waveyboy:507416520788279297>


Skunkworks AI Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.