Frozen AI News archive

CodeLLama 70B beats GPT4 on HumanEval

**Meta AI** surprised the community with the release of **CodeLlama**, an open-source model now available on platforms like **Ollama** and **MLX** for local use. The **Miqu model** sparked debate over its origins, possibly linked to **Mistral Medium** or a fine-tuned **Llama-2-70b**, alongside discussions on **AI ethics** and alignment risks. The **Aphrodite engine** showed strong performance on **A6000 GPUs** with specific configurations. Role-playing AI models such as **Mixtral** and **Flatdolphinmaid** faced challenges with repetitiveness, while **Noromaid** and **Rpcal** performed better, with **ChatML** and **DPO** recommended for improved responses. Learning resources like fast.ai's course were highlighted for ML/DL beginners, and fine-tuning techniques with optimizers like *Paged 8bit lion* and *adafactor* were discussed.

Canonical issue URL

The surprise release of CodeLlama from Meta AI is an incredible gift to open source AI:

image.png

As can be expected, the community has already got to work putting it on Ollama and MLX for you to run locally.


Table of Contents

[TOC]

PART 1: High level Discord summaries

TheBloke Discord Summary


Nous Research AI Discord Summary


LM Studio Discord Summary


OpenAI Discord Summary


OpenAccess AI Collective (axolotl) Discord Summary


Mistral Discord Summary


Eleuther Discord Summary


HuggingFace Discord Summary


Latent Space Discord Summary


DiscoResearch Discord Summary


LangChain AI Discord Summary


LlamaIndex Discord Summary


LAION Discord Summary


Perplexity AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


LLM Perf Enthusiasts AI Discord Summary


Skunkworks AI Discord Summary


Datasette - LLM (@SimonW) Discord Summary


Alignment Lab AI Discord Summary


AI Engineer Foundation Discord Summary


The Ontocord (MDEL discord) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

TheBloke ▷ #general (1403 messages🔥🔥🔥):

Links mentioned:


TheBloke ▷ #characters-roleplay-stories (184 messages🔥🔥):

Links mentioned:


TheBloke ▷ #training-and-fine-tuning (21 messages🔥):

Links mentioned:


TheBloke ▷ #coding (29 messages🔥):

Links mentioned:


Nous Research AI ▷ #ctx-length-research (6 messages):

Links mentioned:


Nous Research AI ▷ #off-topic (44 messages🔥):

Links mentioned:


Nous Research AI ▷ #interesting-links (27 messages🔥):

Links mentioned:


Nous Research AI ▷ #general (389 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (37 messages🔥):

Links mentioned:


LM Studio ▷ #💬-general (177 messages🔥🔥):

Links mentioned:

Friends Phoebe GIF - Friends Phoebe Rachel - Discover & Share GIFs: Click to view the GIF


LM Studio ▷ #🤖-models-discussion-chat (92 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (14 messages🔥):


LM Studio ▷ #🎛-hardware-discussion (38 messages🔥):

Links mentioned:


LM Studio ▷ #🧪-beta-releases-chat (11 messages🔥):

Links mentioned:

no title found: no description found


LM Studio ▷ #memgpt (1 messages):


OpenAI ▷ #ai-discussions (3 messages):


OpenAI ▷ #gpt-4-discussions (114 messages🔥🔥):


OpenAI ▷ #prompt-engineering (44 messages🔥):


OpenAI ▷ #api-discussions (44 messages🔥):


OpenAccess AI Collective (axolotl) ▷ #general (139 messages🔥🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (2 messages):


OpenAccess AI Collective (axolotl) ▷ #general-help (30 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #datasets (6 messages):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #community-showcase (2 messages):


OpenAccess AI Collective (axolotl) ▷ #deployment-help (15 messages🔥):


Mistral ▷ #general (137 messages🔥🔥):

Links mentioned:


Mistral ▷ #finetuning (8 messages🔥):

Links mentioned:


Mistral ▷ #showcase (1 messages):

nk_pas: Write prompts everywhere and run Mistral plateforme in a single key stroke


Mistral ▷ #la-plateforme (28 messages🔥):

Links mentioned:


Eleuther ▷ #general (8 messages🔥):

Links mentioned:


Eleuther ▷ #research (119 messages🔥🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (12 messages🔥):

Links mentioned:


Eleuther ▷ #lm-thunderdome (2 messages):


Eleuther ▷ #gpt-neox-dev (6 messages):


HuggingFace ▷ #general (94 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (10 messages🔥):


HuggingFace ▷ #cool-finds (3 messages):

Links mentioned:


HuggingFace ▷ #i-made-this (7 messages):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (11 messages🔥):

Links mentioned:

How to WIN an Election | Ordinary Guide: Follow me on twitter: https://twitter.com/ordinarytingsSupport the Channel on Patreon: https://www.patreon.com/ordinarythingsHow do you win an election? By l...


HuggingFace ▷ #computer-vision (1 messages):

swetha98: I am getting this error while running code for fine tuning donut docvqa .


HuggingFace ▷ #NLP (3 messages):


HuggingFace ▷ #diffusion-discussions (11 messages🔥):

Links mentioned:

How to WIN an Election | Ordinary Guide: Follow me on twitter: https://twitter.com/ordinarytingsSupport the Channel on Patreon: https://www.patreon.com/ordinarythingsHow do you win an election? By l...


Latent Space ▷ #ai-general-chat (92 messages🔥🔥):

Links mentioned:


DiscoResearch ▷ #mixtral_implementation (5 messages):


DiscoResearch ▷ #general (6 messages):

Links mentioned:


DiscoResearch ▷ #embedding_dev (55 messages🔥🔥):

Links mentioned:


LangChain AI ▷ #general (46 messages🔥):

Links mentioned:


LangChain AI ▷ #share-your-work (3 messages):

Links mentioned:


LangChain AI ▷ #tutorials (3 messages):

Links mentioned:


LlamaIndex ▷ #blog (6 messages):

Links mentioned:

LlamaIndex RAG Hackathon (in-person only): Think Beyond Chatbots: Unleashing the Potential of AI Agents


LlamaIndex ▷ #general (36 messages🔥):

Links mentioned:


LAION ▷ #general (31 messages🔥):


LAION ▷ #research (7 messages):

Links mentioned:


Perplexity AI ▷ #general (32 messages🔥):

Links mentioned:


Perplexity AI ▷ #sharing (5 messages):

Links mentioned:

🔥 New Gemini Pro Better than GP-4? Huge Performance Boost on ⚔️ Chatbot Arena ⚔️: New Bard has surpassed GPT-4 on Chatbot Arena. 🦾 Discord: https://discord.com/invite/t4eYQRUcXB☕ Buy me a Coffee: https://ko-fi.com/promptengineering|🔴 Pat...


Perplexity AI ▷ #pplx-api (1 messages):


LLM Perf Enthusiasts AI ▷ #triton (1 messages):


LLM Perf Enthusiasts AI ▷ #cuda (29 messages🔥):

Links mentioned:

CUDA Pro Tip: Increase Performance with Vectorized Memory Access | NVIDIA Technical Blog: This post demonstrates the use of vectorized memory access in CUDA C/C++ to increase bandwidth utilization while decreasing instruction count.


LLM Perf Enthusiasts AI ▷ #pmpp-book (3 messages):


LLM Perf Enthusiasts AI ▷ #youtube-recordings (1 messages):

andreaskoepf: New video link: https://youtu.be/4sgKnKbR-WE?si=J-B0kHqknRXhE7e_


LLM Perf Enthusiasts AI ▷ #general (9 messages🔥):

Links mentioned:

AI startup program | Google Cloud: Tap into the best of Google’s infrastructure, AI products, and foundation models. Get up to $250,000 USD in Google Cloud credits, training, and more.


LLM Perf Enthusiasts AI ▷ #gpt3-5 (1 messages):

Links mentioned:


LLM Perf Enthusiasts AI ▷ #gpt4 (1 messages):


LLM Perf Enthusiasts AI ▷ #opensource (1 messages):

thebaghdaddy: do we believe the mistral medium hype about becoming 2nd only to GPT4?


Skunkworks AI ▷ #general (7 messages):

Links mentioned:

nisten/BigCodeLlama-169b · Hugging Face: no description found


Skunkworks AI ▷ #off-topic (2 messages):

Links mentioned:


Datasette - LLM (@SimonW) ▷ #ai (3 messages):

Links mentioned:

Emojify: no description found


Datasette - LLM (@SimonW) ▷ #llm (1 messages):


Alignment Lab AI ▷ #general-chat (2 messages):

Links mentioned:

GPT-5: Everything You Need to Know So Far: Was yesterday the day GPT-5 actually started training? This video has everything we think we know so far about GPT-5, drawing on exclusive interviews, OpenAI...


AI Engineer Foundation ▷ #general (1 messages):