Frozen AI News archive

Common Corpus: 2T Open Tokens with Provenance

**Pleais** via **Huggingface** released **Common Corpus**, the largest fully open multilingual dataset with over **2 trillion tokens** including detailed **provenance information**. They also introduced **OCRonos-Vintage**, a **124M-parameter OCR correction model** that efficiently fixes digitization errors on CPU and GPU, unlocking knowledge from PDFs. On AI tools, **LangChainAI** launched **Prompt Canvas** for collaborative **prompt engineering**, while **DeepSeek** released **JanusFlow 1.3B**, a unified multimodal LLM integrating autoregressive and rectified flow models for enhanced **image understanding** and **generation**. **Alibaba Cloud** announced **Qwen2.5-Coder**, a code-focused LLM with advanced coding capabilities, and **Claude 3.5 Sonnet** was highlighted for superior code generation. Discussions on **quantization challenges** and **scaling laws for precision** by **Tim Dettmers** and others emphasized the impact of low-precision training on model scalability and inference efficiency. *"Scaling Laws for Precision"* paper insights and alternative efficiency methods were also noted.

Canonical issue URL

AI News for 11/12/2024-11/13/2024. We checked 7 subreddits, 433 Twitters and 30 Discords (217 channels, and 2494 messages) for you. Estimated reading time saved (at 200wpm): 274 minutes. You can now tag @smol_ai for AINews discussions!

Great dataset releases always precede great models. Last we covered FineWeb (our coverage here) a generation of GPT2 speedruns ensued. Today, Pleais (via Huggingface) is here with an update Common Corpus "the largest fully open multilingual dataset for training LLMs, containing over 2 trillion tokens of permissibly licensed content with provenance information (2,003,039,184,047 tokens)."

image.png

Apart from the meticulous provenance, the team also used OCRonos-Vintage, "a lightweight but powerful OCR correction model that fixes digitization errors at scale. Running efficiently on both CPU and GPU, this 124M-parameter model corrects spacing issues, replaces incorrect words, and repairs broken text structures." This unlocks a whole lot of knowledge in PDFs:

image.png

Common Corpus was first released in March with 500b tokens so it is nice to see this work grow.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Tools and Development

AI Model Releases and Updates

AI Research and Technical Insights

Developer Tips and Tools

AI Adoption and Impact

Memes/Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Qwen 2.5 Coder Improves 128K Context but Faces Usability Challenges

Theme 2. Scaling Laws in Precision and CPU Inference Tests

Theme 3. Largest Mixture of Expert Models: Analysis and Performance

Theme 4. Unreliable Responses in Qwen 2.5: Self-Identification Issues

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

Theme 1. AI Video Gen Evolution: CogVideoX 5B & DimensionX Release

Theme 2. Claude's Performance Issues & Rate Limits Spark User Frustration

Theme 3. Gemini Now Accessible via OpenAI API Library

Theme 4. Greg Brockman Returns to OpenAI Amid Leadership Changes

Theme 5. Major AI Companies Hitting Scaling Challenges


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1. New AI Models Shake Up the Landscape

Theme 2. AI Tools and Integrations Enhance Workflows

Theme 3. Benchmark Showdowns: Models Put to the Test

Theme 4. Community Voices Concerns Over AI Trends

Theme 5. Tech Challenges Push Developers to Innovate


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


HuggingFace Discord


Perplexity AI Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


LM Studio Discord


Stability.ai (Stable Diffusion) Discord


Interconnects (Nathan Lambert) Discord


Notebook LM Discord Discord


Latent Space Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


OpenAI Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


Cohere Discord


OpenAccess AI Collective (axolotl) Discord


DSPy Discord


OpenInterpreter Discord


LAION Discord


LLM Agents (Berkeley MOOC) Discord


Alignment Lab AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


AI21 Labs (Jamba) Discord


The LLM Finetuning (Hamel + Dan) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (160 messages🔥🔥):

  • Qwen Coder Models
  • Multi-GPU Training
  • Dataset Formatting
  • Finetuning VRAM Requirements
  • Inference System Tokens

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (2 messages):

  • User reactions
  • Discord interactions

Unsloth AI (Daniel Han) ▷ #help (178 messages🔥🔥):

  • Gemma 2B RAM usage
  • Flash Attention installation issues
  • RAG experience and usage
  • Ollama model management
  • Training on responses only

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

  • Optillm release
  • Hyperstack legitimacy

Link mentioned: GitHub - codelion/optillm: Optimizing inference proxy for LLMs: Optimizing inference proxy for LLMs. Contribute to codelion/optillm development by creating an account on GitHub.


HuggingFace ▷ #general (270 messages🔥🔥):

  • Qwen Model Performance
  • AI Project Ethical Considerations
  • GPU Specifications and Performance
  • Langchain and Hugging Face Integration
  • Deep Learning Model Suggestions

Links mentioned:


HuggingFace ▷ #cool-finds (12 messages🔥):

  • LLM for E-commerce Branding
  • Cell Journal Research Confirmation
  • Learning Machine Learning
  • Cross-Posting Concerns

HuggingFace ▷ #i-made-this (28 messages🔥):

  • LightRAG Article
  • Sulie Foundation Model
  • ZeroGPU Debugging
  • Benchmarking VLA Models
  • PromptDX Usability

Links mentioned:


HuggingFace ▷ #reading-group (3 messages):

  • Reading time adjustment
  • User timezone concerns

HuggingFace ▷ #computer-vision (2 messages):

  • Open3D-ML Development
  • O3D Historical Context
  • 3D Object Classification Techniques

Links mentioned:


HuggingFace ▷ #NLP (4 messages):

  • Legal Document Retrieval
  • Enhancing Retrieval with Graphs
  • Fine-tuning Tokenizers for Vietnamese Models

HuggingFace ▷ #diffusion-discussions (8 messages🔥):

  • SDXL Lightning Model
  • Realtime Image Generation Workflows
  • Training Diffusion Models

Link mentioned: Reddit - Dive into anything: no description found


Perplexity AI ▷ #announcements (1 messages):

  • Advertising Experiment
  • User Experience Assurance

Perplexity AI ▷ #general (282 messages🔥🔥):

  • Perplexity AI Subscription Model
  • Perplexity Ads Implementation
  • Model Selection Issues
  • User Experience Concerns
  • Fractal Machine Learning

Links mentioned:


Perplexity AI ▷ #sharing (7 messages):

  • 4-7-8 Breathing Technique
  • TCP/IP Guide for IT Aspirants
  • History of the Paralympics
  • Differences in AI Models
  • Nostalgia for Childhood Games

Perplexity AI ▷ #pplx-api (4 messages):

  • search_domain_filter
  • Vercel AI SDK with Perplexity
  • Reddit citations issues

Eleuther ▷ #general (54 messages🔥):

  • Career Paths in ML Optimization
  • Performance Improvement Strategies
  • AI Conference Insights
  • Job Application Trends at Big Tech
  • Internship Experiences in AI

Link mentioned: AI Conference Deadlines): no description found


Eleuther ▷ #research (98 messages🔥🔥):

  • Saddle Points in Gradient Descent
  • Batch Normalization and Alternatives
  • Vision Language Action Models
  • Binding Problem in Neural Networks
  • Complex Valued Latents vs. Real Valuations

Links mentioned:


Eleuther ▷ #scaling-laws (1 messages):

  • Greedy Line Search
  • Gradient Descent Stepsizes
  • Periodicity in Optimization

Link mentioned: Tweet from Ben Grimmer (@prof_grimmer): I've proven the strangest result of my career.. The classic idea that gradient descent's rate is best with constant stepsizes 1/L is wrong. The idea that we need stepsizes in (0,2/L) for conve...


Eleuther ▷ #lm-thunderdome (20 messages🔥):

  • Checkpoint Issues with EvalHarness
  • Finetuned Model Performance on Sentiment Analysis
  • Accurate Averaging for CoT Accuracy
  • Multi-Class Text Classification Task Design
  • Metrics for New Task with Two Metrics

Eleuther ▷ #gpt-neox-dev (24 messages🔥):

  • Single GPU Bugs
  • DagsHub Integration
  • YAML File Extensions
  • Model Training and Maintenance
  • Error Handling in Configurations

Link mentioned: GitHub - markNZed/gpt-neox at pipe_parallel_size_1: An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries - GitHub - markNZed/gpt-neox at pipe_parallel_size_1


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • UnslopNemo 12B
  • SorcererLM
  • Inferor 12B
  • Mistral Parameter Updates
  • UI Improvements

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

  • GitHub Open Source Posting Rules

OpenRouter (Alex Atallah) ▷ #general (186 messages🔥🔥):

  • Model Performance Issues
  • Tool Calling Functionality
  • Image Generation APIs
  • Qwen Model Updates
  • Mistral Large Output Quality

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (7 messages):

  • Custom Provider Keys Access Requests

aider (Paul Gauthier) ▷ #announcements (1 messages):

  • Aider v0.63.0 Release
  • Qwen 2.5 Coder Support
  • Web Command Functionality
  • Improved Language Prompting
  • Bug Fixes and Performance Enhancements

aider (Paul Gauthier) ▷ #general (96 messages🔥🔥):

  • Vectorizing and Reranking Read-Only Files
  • Aider Extensions for VSCode and Neovim
  • Issues with Sonnet Performance
  • OpenRouter Provider Configuration
  • Upcoming AI Conferences

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (61 messages🔥🔥):

  • Aider integration suggestions
  • Using Aider in Architect mode vs other modes
  • Using git diff with Aider
  • Setting up Aider in Termux
  • Using Rust Analyzer in VSCode

Links mentioned:


aider (Paul Gauthier) ▷ #links (4 messages):

  • SupermavenAI joins Cursor
  • Organizing codebase for AI
  • Challenges faced with AI coding tools

Links mentioned:


LM Studio ▷ #general (62 messages🔥🔥):

  • Quantization Differences
  • LM Studio Connection to TTS
  • Performance Issues with Qwen 2.5
  • LaTeX Rendering in LM Studio
  • Python Script for Llama.cpp Integration

Links mentioned:


LM Studio ▷ #hardware-discussion (40 messages🔥):

  • GPU Combinations
  • Local LLM Usage on Macs vs PCs
  • Partial Offload Challenges
  • Cloud vs Local Model Usage
  • Upcoming Hardware Competition

Stability.ai (Stable Diffusion) ▷ #general-chat (87 messages🔥🔥):

  • Training with Dreambooth
  • Using Animatediff for Video Generation
  • Downloading Checkpoint Files
  • Python Version for Stable Diffusion
  • Accessing Discord Servers

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (48 messages🔥):

  • Nous 3 Model Performance
  • Francois Chollet Leaves Google
  • AI Agent Tool Operator Launch
  • JanusFlow Model Introduction
  • Community Discussions on Gwern

Links mentioned:


Interconnects (Nathan Lambert) ▷ #other-papers (6 messages):

  • Jailbreak Rapid Response
  • Prompt Injection at Anthropic
  • Internal Models of Anthropic

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (9 messages🔥):

  • Vision Language Models (VLMs)
  • Finbarr Blog Discussion
  • Computational Cost of VLM Inference
  • Recent VLM Papers

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (4 messages):

  • MKBHD controversy
  • ChatGPT vs Claude humor

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rlhf (1 messages):

swyxio: we are looking for someone on this yes


Interconnects (Nathan Lambert) ▷ #reads (11 messages🔥):

  • Dylan Patel's Inference Math Lecture
  • Jonathan Frankle's AI Insights
  • Hiring Practices at Databricks

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (2 messages):

  • SnailBot News

Notebook LM Discord ▷ #use-cases (19 messages🔥):

  • KATT Integration for Podcasting
  • Research Consultancy Client Data Management
  • Podcast Creation Limits
  • Sharing NotebookLM Outside Organizations
  • Magic Book Experiment with Podcast

Link mentioned: A discussion on How to Improve Health Evidence-Backed Nutrition Tips: #HealthTips #NutritionForWellness #EvidenceBasedDietnutrition tips, improve health, evidence-based nutrition, healthy lifestyle, diet tips, healthy eating, m...


Notebook LM Discord ▷ #general (56 messages🔥🔥):

  • NotebookLM usage tips
  • Podcast generation issues
  • Data security concerns

Links mentioned:


Latent Space ▷ #ai-general-chat (64 messages🔥🔥):

  • Supermaven joins Cursor
  • Windsurf Editor launch
  • Perplexity ads introduction
  • Mira Lab updates
  • RAG future predictions

Links mentioned:


Latent Space ▷ #ai-announcements (2 messages):

  • Building AI for the Enterprise
  • Windsurf Editor Launch
  • Cascade Flow Experience
  • General Access Tools
  • Community Feedback

Links mentioned:


GPU MODE ▷ #general (2 messages):

  • XOR Tensor Cores
  • Beamforming Algorithms
  • Lambda as a Stable Option

GPU MODE ▷ #triton (13 messages🔥):

  • Triton Kernel Functionality
  • Libstdc++ Issues in Triton with Conda
  • GEMM Kernel Design in Triton
  • Warp Memory Alignment Error
  • Lowering ttir to Hexagon Dialect

Links mentioned:


GPU MODE ▷ #torch (7 messages):

  • torch.compile memory usage
  • dynamic shapes impact
  • profiling GPU memory allocations
  • direct access to GPU

GPU MODE ▷ #cool-links (1 messages):

0xredj: https://latent.toys


GPU MODE ▷ #jobs (1 messages):

  • SEMROM job opening
  • Quantization methods
  • Inference framework development
  • Collaborative ML and hardware work
  • Open-source contributions

Link mentioned: ML Quantization Engineer | Jobs bei SEMRON: Wir sind SEMRON, ein Startup aus Dresden, und wir entwickeln innovative Mikrochips für KI-Anwendungen.


GPU MODE ▷ #beginner (2 messages):

  • Lecture Access
  • Discord Channel Information

GPU MODE ▷ #off-topic (9 messages🔥):

  • AI-generated images
  • Food generation
  • Identity impersonation
  • Food cycle
  • Bot verification

GPU MODE ▷ #rocm (3 messages):

  • MI300X performance
  • FP16 peak throughput
  • Triton on AMD GPUs

Link mentioned: Triton on AMD GPUs: Lei Zhang and Lixun Zhang talk to Triton support for AMD. This talk shows off some very clever optimization techniques around chiplets and also instruction s...


GPU MODE ▷ #liger-kernel (2 messages):

  • Liger-Kernel v0.4.1 Release
  • Gemma 2 Support
  • CrossEntropy Patching Fix

Link mentioned: Release v0.4.1: Gemma 2 Support, CrossEntropy Patching FIx, and GroupNorm · linkedin/Liger-Kernel: Highlights Gemma 2 Support: The long pending gemma 2 is finally supported thanks to @Tcc0403! He has implemented the nasty softcapping in fused linear cross entropy (#320) and discovered the conv...


GPU MODE ▷ #self-promotion (6 messages):

  • Flash Attention Overview
  • Video Feedback
  • CUDA and Triton Programming
  • Multi-Head Attention
  • Community Praise

GPU MODE ▷ #🍿 (4 messages):

  • Discord Leaderboard UX
  • Bot Development
  • Dataset Collection

Link mentioned: Ideas for what to do next · Issue #6 · gpu-mode/discord-cluster-manager: Discord based leaderboard UX How does the leaderboard get rendered @AndreSlavescu Slash commands that autofill that a script is expected and maybe more information like the kernel name or gpu type ...


GPU MODE ▷ #thunderkittens (15 messages🔥):

  • TK improvements
  • H100 DSMEM limitations
  • Cooperative groups in CUDA
  • Semaphore synchronization
  • TMA instruction usage

Link mentioned: ThunderKittens/tests/unit/warp/memory/tile/dsmem.cu at 06d654a0858840e006d428cd96aac2cd0d19ca25 · HazyResearch/ThunderKittens: Tile primitives for speedy kernels. Contribute to HazyResearch/ThunderKittens development by creating an account on GitHub.


GPU MODE ▷ #edge (1 messages):

  • Quantization Techniques
  • Algorithm Optimization
  • Deployment Strategies
  • Memory Efficiency
  • Hybrid Processing

tinygrad (George Hotz) ▷ #general (29 messages🔥):

  • Tinygrad distributed approach
  • Multi-node FSDP
  • Data handling in Cloud
  • Device-to-device communication
  • Performance concerns with sharding

Link mentioned: How multigpu training works: Tutorials on tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (32 messages🔥):

  • Unsigned Tensor min() bug
  • Asynchronous mode for Tinygrad inference
  • Using BitwiseNot for integer operations
  • Pull Request updates
  • Realizing Tensors

Links mentioned:


OpenAI ▷ #ai-discussions (28 messages🔥):

  • AI language models and date awareness
  • Blaze AI for content creation
  • AI songwriting tools
  • ChatGPT's UI control system
  • Copy-paste issues on mobile

OpenAI ▷ #gpt-4-discussions (11 messages🔥):

  • Prompt Engineering
  • ChatGPT o1-preview Feedback
  • Model Limitations
  • Using VPNs
  • Tinkering with Requests

OpenAI ▷ #prompt-engineering (5 messages):

  • Scratchpad Techniques
  • Structured Output
  • Prompt Management Challenges

OpenAI ▷ #api-discussions (5 messages):

  • Scratchpad Technique
  • Prompt Management Issues

Modular (Mojo 🔥) ▷ #general (13 messages🔥):

  • exllamav2
  • MAX Involvement
  • Error with Batched MatMul

Link mentioned: GitHub - turboderp/exllamav2: A fast inference library for running LLMs locally on modern consumer-class GPUs: A fast inference library for running LLMs locally on modern consumer-class GPUs - turboderp/exllamav2


Modular (Mojo 🔥) ▷ #mojo (33 messages🔥):

  • Mojo JIT Compiler
  • MAX Platform Overview
  • UnsafePointer in Mojo
  • Mojo's Performance Capabilities
  • Mana Project References

Links mentioned:


LlamaIndex ▷ #blog (2 messages):

  • GenAI pipelines
  • RAG workflows
  • Dynamic section retrieval

LlamaIndex ▷ #general (35 messages🔥):

  • Vocera Launch on Product Hunt
  • RAG vs Reporting Debate
  • Chatbots in Corporate Settings
  • Blockchain Engineering Expertise

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

  • Vocera Launch
  • AI Voice Agents

Link mentioned: Vocera - Launch voice agents faster with simulation & monitoring | Product Hunt: Vocera helps AI developers build production-ready voice agents 10X faster. It generates adversarial scenarios, simulates realistic calls and gives actionable insights to your agents. It also monitors ...


Cohere ▷ #questions (6 messages):

  • Payment Management Functionality
  • Card Decline Issues
  • Cohere API Internal Server Error
  • Rerank API Best Practices

Link mentioned: Rerank Best Practices — Cohere): Tips for optimal endpoint performance, including constraints on the number of documents, tokens per document, and tokens per query.


Cohere ▷ #api-discussions (19 messages🔥):

  • API Errors
  • Recent Fix Deployment
  • Reranking Issues
  • Endpoint Troubles
  • Model Usage

Cohere ▷ #projects (1 messages):

  • Benchmarking Vision Language Action models
  • Collaboration on Robotics Research
  • VLM & VLA Performance Evaluation

Link mentioned: Tweet from harsh (@HarshSikka)): Excited to share our new paper "Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks" We evaluate how well VLM & VLA models can control robots across 20 different real-wor...


Cohere ▷ #cohere-toolkit (3 messages):

  • ICS Support for Events
  • File Content Viewing Feature

OpenAccess AI Collective (axolotl) ▷ #general (6 messages):

  • Docker Images
  • Version Tagging
  • Axolotl Release Updates

Link mentioned: make sure to tag images in docker for tagged releases by winglian · Pull Request #2051 · axolotl-ai-cloud/axolotl: Description Motivation and Context How has this been tested? Screenshots (if appropriate) Types of changes Social Handles (Optional)


OpenAccess AI Collective (axolotl) ▷ #other-llms (5 messages):

  • Qwen2.5 Coder Sizes
  • Performance Comparison of Qwen2.5

Link mentioned: Qwen2.5 Coder 32B vs GPT4o vs Claude 3.5 Sonnet (new): Let's see which model is the best


OpenAccess AI Collective (axolotl) ▷ #general-help (11 messages🔥):

  • qwen-2.5-7B-Instruct configuration
  • train_on_inputs clarification
  • SFTrainer and TRL comparison

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #announcements (3 messages):

  • Release of version 0.5.0
  • Feedback and Issues Reporting

Link mentioned: Release v0.5.0 · axolotl-ai-cloud/axolotl: What's Changed fix(log): improve warning to clarify that lora_modules_to_save expect a list by @NanoCode012 in #1197 Add: colab example by @JohanWork in #1196 Feat/chatml add system message by @m...


DSPy ▷ #show-and-tell (1 messages):

  • Forge Reasoning API
  • Nous Chat Evolution

DSPy ▷ #general (8 messages🔥):

  • DSPY Comparative Analysis
  • Zero Shot vs Few Shot Prompting
  • LLM Call Tracking
  • GitHub Notebooks

Links mentioned:


OpenInterpreter ▷ #general (5 messages):

  • Developer branch updates
  • UI improvements
  • Streamed responses handling
  • Open Interpreter
  • Text editor development

OpenInterpreter ▷ #ai-content (3 messages):

  • OpenCoder
  • AI Bubble Predictions
  • GPU Investments
  • Dot-Com Bubble Parallels

Links mentioned:


LAION ▷ #general (4 messages):

  • MongoDB dismissal
  • Social services and employment
  • EPOCH 58 COCK model updates

LAION ▷ #research (3 messages):

  • Vision Language Action Models
  • Watermark Anything
  • Robotic Learning Tasks
  • AI Generators Performance

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (4 messages):

  • TapeAgents Queries
  • AI Agents for Enterprise Workflows
  • Communication via Tape

Alignment Lab AI ▷ #general (2 messages):

  • Latent Toys website

Link mentioned: latent.toys: no description found


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (2 messages):

  • Palmyra X 004 model
  • Writer handler integration
  • Pull Request Review Process

Link mentioned: [BFCL] Add support for Writer models and Palmyra X 004 by samjulien · Pull Request #755 · ShishirPatil/gorilla: This PR adds support for Writer models and our latest Palmyra X 004 to BFCL. Thank you!


AI21 Labs (Jamba) ▷ #general-chat (2 messages):

  • Legacy models deprecation
  • Open source solutions




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}