Frozen AI News archive

AI gets Memory

**AI Discords** analysis covered **20 guilds**, **312 channels**, and **6901 messages**. The report highlights the divergence of RAG style operations for context and memory, with implementations like **MemGPT** rolling out in **ChatGPT** and **LangChain**. The **TheBloke Discord** discussed **open-source large language models** such as the **Large World Model** with contexts up to **1 million tokens**, and the **Cohere aya model** supporting **101 languages**. Roleplay-focused models like **MiquMaid-v2-70B** were noted for performance improvements with enhanced hardware. Finetuning techniques like **Sequential Fine-Tuning (SFT)** and **Direct Preference Optimization (DPO)** were explained, with tools like **Unsloth AI's apply_chat_template** preferred over Alpaca. Integration of JavaScript and Python via **JSPyBridge** in the **SillyTavern** project was also discussed. Training challenges with **Mixtral 8x7b qlora** versus **Mistral 7b** were noted. The **LM Studio Discord** focused on hardware limitations affecting large model loading, medical LLMs like **medAlpaca**, and hardware discussions around GPU upgrades and overclocking. Anticipation for **IQ3_XSS** 1.5 bit quantization support in LM Studio was expressed.

Canonical issue URL

We have long contended that the RAG style operations have been used for context (knowledge base, facts about the world) and memory (running list of facts about you) will diverge. The leading implementation was MemGPT and now it seems to have rolled out in both ChatGPT (with a weirdly roon-y tweet. more details from Joanne Jang) and LangChain.

OpenAI:

image.png

LangChain:

image.png

In some sense this is just a crossing over of something the LMstudio/Sillytavern roleplay people have had for a while now. Expectation is that it will mildly improve UX but not lead to a big wow moment since the memory modeling is quite crude at the moment, not humanlike, and subject to context limits.


Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


LM Studio Discord Summary


LAION Discord Summary


Eleuther Discord Summary


LlamaIndex Discord Summary

LlamaIndex v0.10 Marks Major Milestone: LlamaIndex v0.10 has been released, presenting notable advancements including a new llama-index-core package and PyPi packages for every integration/template. Detailed information on migration is accessible through their comprehensive blog post and documentation.

Webinar on No-Code RAG with LlamaIndex: A webinar demonstrating the creation of no-code Retrieve and Generate (RAG) apps using LlamaIndex.TS is set up with Flowise co-founder Henry Heng. Registration for the Friday event is available here.

Troubleshooting LlamaIndex: Engineers faced challenges with migration following LlamaIndex's update and were pointed to a Notion migration guide for assistance. Furthermore, for configuration queries like chunk_size post-ServiceContext depreciation, engineers are advised to refer to the new Settings documentation and relevant LlamaIndex GitHub resources.

RAG App Building with Dewy Tremendously Simplified: A comprehensive guide to building a full-stack RAG app using NextJS, OpenAI, and the open-source knowledge base Dewy has been shared. The tutorial is aimed at grounding language models in precise, reliable data and can be studied in detail here.

Handling Document Complexity and Enhancing Enterprise with LlamaIndex: Users engaged in discussions about filtering complex documents and integrating LlamaIndex to enhance enterprise efficiency with tools such as Slack, Jira, and GDrive. Also, creating multiple agents for merging different document sources was considered, referencing the possibility of using traditional indexing techniques instead of high-cost LLMs for dynamic filtering.


HuggingFace Discord Summary


Nous Research AI Discord Summary


Mistral Discord Summary


Perplexity AI Discord Summary


OpenAI Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


LangChain AI Discord Summary


DiscoResearch Discord Summary


CUDA MODE Discord Summary


Latent Space Discord Summary


LLM Perf Enthusiasts AI Discord Summary


AI Engineer Foundation Discord Summary


Skunkworks AI Discord Summary


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Datasette - LLM (@SimonW) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1460 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (154 messages🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (43 messages🔥):

Links mentioned:

GitHub - e-p-armstrong/augmentoolkit: Convert Compute And Books Into Instruct-Tuning Datasets: Convert Compute And Books Into Instruct-Tuning Datasets - e-p-armstrong/augmentoolkit


TheBloke ▷ #coding (8 messages🔥):

Links mentioned:

GitHub - extremeheat/JSPyBridge: 🌉. Bridge to interoperate Node.js and Python: 🌉. Bridge to interoperate Node.js and Python . Contribute to extremeheat/JSPyBridge development by creating an account on GitHub.


LM Studio ▷ #💬-general (202 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (75 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🎛-hardware-discussion (140 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (21 messages🔥):

Links mentioned:


LM Studio ▷ #avx-beta (2 messages):

Links mentioned:

LM Studio Beta Releases: no description found


LM Studio ▷ #crew-ai (3 messages):


LAION ▷ #general (361 messages🔥🔥):

Links mentioned:


LAION ▷ #research (48 messages🔥):

Links mentioned:


Eleuther ▷ #general (179 messages🔥🔥):

Links mentioned:


Eleuther ▷ #research (208 messages🔥🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (1 messages):


Eleuther ▷ #lm-thunderdome (4 messages):


Eleuther ▷ #gpt-neox-dev (4 messages):


LlamaIndex ▷ #announcements (2 messages):

Links mentioned:


LlamaIndex ▷ #blog (5 messages):


LlamaIndex ▷ #general (303 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

Links mentioned:

Building a RAG chatbot with NextJS, OpenAI & Dewy | Dewy: This guide will walk you through building a RAG application using NextJS for the web framework, the OpenAI API for the language model, and Dewy as your knowledge base.


HuggingFace ▷ #announcements (1 messages):

Links mentioned:


HuggingFace ▷ #general (192 messages🔥🔥):

<ul>
  <li><strong>Search Engine Development Struggles</strong>: <code>@spidy___</code> discussed challenges in developing a search engine and extracting keywords with <code>@vipitis</code>, <code>@cubietom</code>, and others. The conversation explored the limitations of NER and alternatives like keyword extraction, TF-IDF, BM25, and the use of spaCy for Part of Speech tagging.</li>
  <li><strong>Hosting and Inferencing Challenges</strong>: Users like <code>@sullynaj</code> and <code>@ram1428</code> enquired about hosting custom models and whether serverless inferencing is available, with pointers to server-less or affordable solutions discussed.</li>
  <li><strong>Tackling Model Scale</strong>: Conversations with users like <code>@zorian_93363</code> and <code>@xacer_</code> revolved around the feasibility and usefulness of running very large models (100B+ parameters) on typical "open source enthusiast" hardware.</li>
  <li><strong>Valentine's Day Vibes</strong>: <code>@not_lain</code> spread love and joy on Valentine's Day, encouraging the community to hug their loved ones.</li>
  <li><strong>Discussion on Running Models Locally</strong>: <code>@aj_0003</code> asked about running machine learning models locally while <code>@pierrunoyt</code> discussed using Hugging Face to clone and run a model.</li>
</ul>

Links mentioned:


HuggingFace ▷ #today-im-learning (9 messages🔥):

Links mentioned:


HuggingFace ▷ #cool-finds (8 messages🔥):

Links mentioned:

New paper by DeepMind: Buffer Overflow in…"](https://huggingface.co/posts/osanseviero/980907000007376): no description found


HuggingFace ▷ #i-made-this (10 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (32 messages🔥):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (10 messages🔥):


HuggingFace ▷ #computer-vision (7 messages):


HuggingFace ▷ #NLP (4 messages):


HuggingFace ▷ #diffusion-discussions (10 messages🔥):


Nous Research AI ▷ #ctx-length-research (3 messages):

Links mentioned:

DAMO-NLP-SG/LongCorpus-2.5B · Datasets at Hugging Face: no description found


Nous Research AI ▷ #off-topic (8 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (17 messages🔥):

Links mentioned:


Nous Research AI ▷ #general (180 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (38 messages🔥):

Links mentioned:


Nous Research AI ▷ #collective-cognition (3 messages):


Mistral ▷ #general (43 messages🔥):


Mistral ▷ #models (20 messages🔥):

Links mentioned:


Mistral ▷ #deployment (46 messages🔥):

Links mentioned:


Mistral ▷ #finetuning (76 messages🔥🔥):


Mistral ▷ #showcase (2 messages):


Mistral ▷ #random (15 messages🔥):

Links mentioned:

Ergonomie socio-cognitive des systèmes intelligents - Classique et alternance - Ametys Campus - Université Savoie Mont Blanc: no description found


Mistral ▷ #la-plateforme (2 messages):


Perplexity AI ▷ #general (149 messages🔥🔥):

Links mentioned:


Perplexity AI ▷ #sharing (13 messages🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


Perplexity AI ▷ #pplx-api (24 messages🔥):

Links mentioned:


OpenAI ▷ #annnouncements (1 messages):

Links mentioned:

Memory and new controls for ChatGPT: We’re testing the ability for ChatGPT to remember things you discuss to make future chats more helpful. You’re in control of ChatGPT’s memory.


OpenAI ▷ #ai-discussions (83 messages🔥🔥):

Links mentioned:

Discord - A New Way to Chat with Friends & Communities: Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.


OpenAI ▷ #gpt-4-discussions (49 messages🔥):

Links mentioned:


OpenAI ▷ #prompt-engineering (23 messages🔥):

Links mentioned:


OpenAI ▷ #api-discussions (23 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (66 messages🔥🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (8 messages🔥):

Links mentioned:

@dctanner on Hugging Face: "As the amount of datasets for fine tuning chat models has grown, there's been…": no description found


OpenAccess AI Collective (axolotl) ▷ #general-help (26 messages🔥):

Links mentioned:

GitHub - OpenAccess-AI-Collective/axolotl: Go ahead and axolotl questions: Go ahead and axolotl questions. Contribute to OpenAccess-AI-Collective/axolotl development by creating an account on GitHub.


OpenAccess AI Collective (axolotl) ▷ #datasets (1 messages):


OpenAccess AI Collective (axolotl) ▷ #runpod-help (1 messages):


LangChain AI ▷ #announcements (1 messages):

Links mentioned:


LangChain AI ▷ #general (59 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #langserve (8 messages🔥):

Links mentioned:

LangSmith: no description found


LangChain AI ▷ #share-your-work (1 messages):

Links mentioned:

Building a RAG chatbot with NextJS, OpenAI & Dewy | Dewy: This guide will walk you through building a RAG application using NextJS for the web framework, the OpenAI API for the language model, and Dewy as your knowledge base.


LangChain AI ▷ #tutorials (2 messages):


DiscoResearch ▷ #general (29 messages🔥):

Links mentioned:


DiscoResearch ▷ #embedding_dev (5 messages):


DiscoResearch ▷ #discolm_german (1 messages):

thomasrenkert: Is there an ETA for v2 of the German model? Or for the Mixtral variant?


CUDA MODE ▷ #general (1 messages):


CUDA MODE ▷ #cuda (9 messages🔥):

Links mentioned:


CUDA MODE ▷ #algorithms (3 messages):


CUDA MODE ▷ #jobs (3 messages):

Links mentioned:

no title found: no description found


CUDA MODE ▷ #beginner (9 messages🔥):

Links mentioned:

The Book of Shaders: Gentle step-by-step guide through the abstract and complex universe of Fragment Shaders.


CUDA MODE ▷ #pmpp-book (7 messages):


CUDA MODE ▷ #smol-hw (1 messages):

Links mentioned:

GitHub - tlkh/asitop: Perf monitoring CLI tool for Apple Silicon: Perf monitoring CLI tool for Apple Silicon. Contribute to tlkh/asitop development by creating an account on GitHub.


Latent Space ▷ #ai-general-chat (24 messages🔥):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #opensource (2 messages):


LLM Perf Enthusiasts AI ▷ #openai (4 messages):

Links mentioned:

Memory and new controls for ChatGPT: We’re testing the ability for ChatGPT to remember things you discuss to make future chats more helpful. You’re in control of ChatGPT’s memory.


AI Engineer Foundation ▷ #events (6 messages):


Skunkworks AI ▷ #general (2 messages):