Frozen AI News archive

1/17/2024: Help crowdsource function calling datasets

**LM Studio** updated its FAQ clarifying its **closed-source** status and perpetual freeness for personal use with no data collection. The new beta release includes fixes and hints at upcoming **2-bit quantization** support. For gaming, models like **Dolphin 2.7 Mixtral 8x7B**, **MegaDolphin**, and **Dolphin 2.6 Mistral 7B DPO** with **Q4_K_M** quantization were recommended. Discussions highlighted that single powerful GPUs outperform multi-GPU setups due to bottlenecks, with older GPUs like Tesla P40 being cost-effective. **Microsoft's AutoGen Studio** was introduced but has issues and requires **API fees** for open-source models. Linux users are advised to use **llama.cpp** over LM Studio due to lack of headless mode. Additional tools like **LLMFarm** for iOS and various Hugging Face repositories were also mentioned. *"LM Studio must be running to use the local inference server as there is no headless mode available"* and *"matching model size to GPU memory is key for performance"* were notable points.

Canonical issue URL

Skunkworks is working on collating function calling datasets - key to turning everything into functions!

image.png

It's also important to familiarize with underlying data formats and sources:

image.png

What other datasets are out there for tuning function calls? Can we synthesize some?


Table of Contents

[TOC]

LM Studio Discord Summary

Additional links shared provided insights into a variety of projects, including Microsoft's AutoGen Studio, LM Studio's alternative for iOS LLMFarm, and various Hugging Face model repositories. However, sparse details or a single message was insufficient to establish context for summarization regarding GitHub links for NexusRaven-V2 and the mention of memory challenges with local models.

LM Studio Channel Summaries

▷ #💬-general (62 messages🔥🔥):

Links mentioned:

▷ #🤖-models-discussion-chat (80 messages🔥🔥):

Links mentioned:

▷ #🧠-feedback (5 messages):

Links mentioned:

▷ #🎛-hardware-discussion (81 messages🔥🔥):

Links mentioned:

▷ #🧪-beta-releases-chat (8 messages🔥):

▷ #autogen (6 messages):

Links mentioned:

autogen/samples/apps/autogen-studio at main · microsoft/autogen: Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ - microsoft/autogen

▷ #langchain (1 messages):

sublimatorniq: https://github.com/nexusflowai/NexusRaven-V2

▷ #memgpt (1 messages):

pefortin: Yeah, local models struggle on how and when to use memory.


Eleuther Discord Summary

Eleuther Channel Summaries

▷ #general (119 messages🔥🔥):

Links mentioned:

▷ #research (69 messages🔥🔥):

Links mentioned:

▷ #interpretability-general (1 messages):

▷ #lm-thunderdome (7 messages):

Links mentioned:

▷ #gpt-neox-dev (9 messages🔥):

Links mentioned:

Python version update by segyges · Pull Request #1122 · EleutherAI/gpt-neox: Don't know if this is ready or not; in my local testing it fails some of the pytest tests, but it's plausible to likely it was doing so before. Bumps image to ubuntu 22.04 and uses the system ...


Nous Research AI Discord Summary

Nous Research AI Channel Summaries

▷ #off-topic (27 messages🔥):

Links mentioned:

▷ #interesting-links (15 messages🔥):

Links mentioned:

▷ #general (110 messages🔥🔥):

Links mentioned:

▷ #ask-about-llms (40 messages🔥):

Links mentioned:


Mistral Discord Summary

Mistral Channel Summaries

▷ #general (76 messages🔥🔥):

Links mentioned:

▷ #models (71 messages🔥🔥):

Links mentioned:

▷ #finetuning (33 messages🔥):

Links mentioned:

▷ #showcase (6 messages):

Links mentioned:

▷ #la-plateforme (5 messages):

Links mentioned:

Mistral 7B - Host Analysis | ArtificialAnalysis.ai: Analysis of Mistral 7B Instruct across metrics including quality, latency, throughput, price and others.


HuggingFace Discord Discord Summary

HuggingFace Discord Channel Summaries

▷ #general (84 messages🔥🔥):

Links mentioned:

▷ #today-im-learning (4 messages):

Links mentioned:

Webinar "A Whirlwind Tour of ML Model Serving Strategies (Including LLMs)" · Luma: Data Phoenix team invites you all to our upcoming webinar that’s going to take place on January 25th, 10 am PST. Topic: A Whirlwind Tour of ML Model Serving Strategies (Including...

▷ #cool-finds (8 messages🔥):

Links mentioned:

▷ #i-made-this (6 messages):

Links mentioned:

▷ #reading-group (2 messages):

▷ #diffusion-discussions (12 messages🔥):

Links mentioned:

▷ #computer-vision (1 messages):

▷ #NLP (53 messages🔥):

Links mentioned:

▷ #diffusion-discussions (12 messages🔥):

Links mentioned:


OpenAI Discord Summary

GPT's Negation Challenge Sparks Discussion at Dev Event: During an event, there was a notable acknowledgement from a developer that resonated with @darthgustav. regarding the AI's issue with handling negation prompts, which tends to ignore the negation leading to potential errors.

Could GPT Assistant Join the Free Tier?: @mischasimpson hinted, based on a live tutorial they watched, that the GPT assistant may soon be accessible without cost, which indicates a possible shift towards making advanced AI tools available on OpenAI's free tier.

Customizing Education with GPT: Users @mischasimpson and @darthgustav. discussed the use of GPT for generating personalized reading exercises for children, touching on the simplicity of process and the potential to track completion and performance.

The Curious Case of the Mythical GPT-4.5 Turbo: In a conversation spiked with speculation, @okint believed to have encountered a version of the AI dubbed "gpt-4.5-turbo." However, others like @7877 and @luarstudios were quick to remind the community to be wary of possible AI fabrications, as such a version might be nonexistent.

Managing Expectations of GPT's Capabilities: Users @solbus and @.bren_._ provided clarity on the actual workings of Custom GPTs, dispelling misconceptions that they can be trained directly on knowledge files and explaining that true model training requires OpenAI services or building a large language model from scratch.

OpenAI Channel Summaries

▷ #ai-discussions (47 messages🔥):

Links mentioned:

How to Mass Delete Discord Messages: In this video I will be showing you how to mass delete discord messages in dms, channels, servers, ect with UnDiscord which is a easy extension that allows y...

▷ #gpt-4-discussions (43 messages🔥):

▷ #prompt-engineering (31 messages🔥):

▷ #api-discussions (31 messages🔥):


Latent Space Discord Summary

Latent Space Channel Summaries

▷ #ai-general-chat (17 messages🔥):

Links mentioned:

▷ #ai-event-announcements (1 messages):

Links mentioned:

▷ #llm-paper-club (32 messages🔥):

Links mentioned:

▷ #llm-paper-club-chat (65 messages🔥🔥):

Links mentioned:


Perplexity AI Discord Summary

Perplexity AI Channel Summaries

▷ #general (31 messages🔥):

Links mentioned:

▷ #sharing (13 messages🔥):

Links mentioned:

Tweet from Riley Brown (@rileybrown_ai): I use @perplexity_ai more than chatgpt & google. Their collections feature is very underrated. And it gets better every month.

▷ #pplx-api (10 messages🔥):


Skunkworks AI Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:


OpenAccess AI Collective (axolotl) Discord Summary

OpenAccess AI Collective (axolotl) Channel Summaries

▷ #general (22 messages🔥):

Links mentioned:

Kquant03/FrankenDPO-4x7B-bf16 · Hugging Face

▷ #axolotl-dev (4 messages):

Links mentioned:

▷ #general-help (6 messages):

▷ #runpod-help (7 messages):

▷ #replicate-help (1 messages):

hamelh: 〰️


LlamaIndex Discord Discord Summary

LlamaIndex Discord Channel Summaries

▷ #blog (3 messages):

Links mentioned:

▷ #general (31 messages🔥):

Links mentioned:

▷ #ai-discussion (1 messages):

Links mentioned:

Unleashing the Power of Semantic Chunking: A Journey with LlamaIndex: Ankush k Singal


DiscoResearch Discord Summary

DiscoResearch Channel Summaries

▷ #mixtral_implementation (2 messages):

▷ #general (10 messages🔥):

Links mentioned:

▷ #benchmark_dev (2 messages):

Links mentioned:

EQ-Bench Leaderboard

▷ #embedding_dev (10 messages🔥):

Links mentioned:


LLM Perf Enthusiasts AI Discord Summary

LLM Perf Enthusiasts AI Channel Summaries

▷ #gpt4 (2 messages):

▷ #opensource (11 messages🔥):

Links mentioned:

Mixtral 8x7B - Host Analysis | ArtificialAnalysis.ai: Analysis of Mixtral 8x7B Instruct across metrics including quality, latency, throughput, price and others.

▷ #speed (1 messages):

rabiat: Azure pretty slow for us across different regions. Anyone expierenceing the same?


LangChain AI Discord Summary

LangChain AI Channel Summaries

▷ #general (7 messages):

Links mentioned:

▷ #langserve (2 messages):

▷ #share-your-work (4 messages):

Links mentioned:


LAION Discord Summary

LAION Channel Summaries

▷ #general (5 messages):

▷ #research (4 messages):

Links mentioned:


Alignment Lab AI Discord Summary

Only 1 channel had activity, so no need to summarize...


YAIG (a16z Infra) Discord Summary

Only 1 channel had activity, so no need to summarize...

Links mentioned:

A look back at CNCF, Linux Foundation, and top 30 open source project velocity in 2023: By Chris Aniszczyk We have been tracking open source project velocity over the last several years and wanted to share the latest update highlighting open source project velocity over the last 12&#8230...