Frozen AI News archive

Nothing much happened today

**HuggingFace** released a browser-based timestamped Whisper using transformers.js. A Twitter bot by **truth_terminal** became the first "semiautonomous" bot to secure VC funding. **Microsoft** and **Apple** abruptly left the **OpenAI** board amid regulatory scrutiny. **Meta** is finalizing a major upgrade to Reddit comments addressing hallucination issues. The **Yi model** gained popularity on GitHub with 7.4K stars and 454 forks, with potential integration with **Axolotl** for pregeneration and preprocessing. **AMD** technologies enable household/small business AI appliances. **Meta** released **Chameleon-7b** and **Chameleon-30b** models on HuggingFace supporting unified text and image tokenization. **Salesforce**'s **xLAM-1b** model outperforms **GPT-3.5** in function calling despite its smaller size. **Anole** pioneered open-source multimodal text-image-video generation up to 720p 144fps. **Phi-3 Mini** expanded from 3.8B to 4.7B parameters with function calling, competing with **Mistral-7b v3**. *"System 2 distillation"* in humans relates to automaticity and procedural memory.

Canonical issue URL

AI News for 7/9/2024-7/10/2024. We checked 7 subreddits, 384 Twitters and 29 Discords (463 channels, and 2339 messages) for you. Estimated reading time saved (at 200wpm): 250 minutes. You can now tag @smol_ai for AINews discussions!

Yesterday was busy busy, today wasn't. A smattering of tiny morsels, more entertaining than anything:

Meta: we are in the final stages of a major upgrade to reddit comments, following the hallucination conversation from yesterday.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

Yi AI Model Updates and Integrations

Cognitive Computing AI's Tweets and Discussions

AI and Human Cognition

Miscellaneous


AI Reddit Recap

Across r/LocalLlama, r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity. Comment crawling works now but has lots to improve!

AI Model Releases and Developments

AI Applications and Use Cases

AI Ethics and Governance


AI Discord Recap

A summary of Summaries of Summaries

Claude 3 Sonnet

1. New Language Model Releases

2. AI Model Benchmarking and Evaluation

3. Synthetic Data Generation and Feedback Loops

Claude 3.5 Sonnet

1. Anole: First Open-Source Auto-Regressive LMM

2. xAI's Ambitious H100 Cluster Expansion

3. AMD's Strategic AI Acquisition of Silo AI

4. GitHub Copilot Copyright Lawsuit Update

Claude 3 Opus

1. Ghost 8B Beta Launch

2. Llama 3 Training Discussions

3. Model Saving Stumbles

4. Model Showdown: Gemini vs DeepSeek

5. CodeGeeX4 Cracks the Code

GPT4T (gpt-4-turbo-2024-04-09)

1. Multilingual LLMs

2. Model Fine-Tuning and Optimization

3. AI Hardware and Infrastructure

4. AI Legal and Ethical Issues

5. AI Community Initiatives

GPT4O (gpt-4o-2024-05-13)

$PLSDELETTHIS{openaiSummaryO}


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


HuggingFace Discord


CUDA MODE Discord


OpenAI Discord


LM Studio Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


Perplexity AI Discord


Nous Research AI Discord


LlamaIndex Discord


Stability.ai (Stable Diffusion) Discord


OpenRouter (Alex Atallah) Discord


Interconnects (Nathan Lambert) Discord


LAION Discord


LangChain AI Discord


Cohere Discord


OpenInterpreter Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


OpenAccess AI Collective (axolotl) Discord


AI Stack Devs (Yoko Li) Discord


LLM Finetuning (Hamel + Dan) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Perf Enthusiasts AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (478 messages🔥🔥🔥):

  • Gemini-1.0-pro Finetuning
  • Kaggle Notebook for Gemma-2-9b
  • Qwen2 Finetuning and Effectiveness
  • Synthetic Data Generation
  • Koboldcpp for Local LLMs

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (18 messages🔥):

  • Llama 3 model usage in Unsloth
  • Swedish language model
  • Training advice for Llama 3
  • Inference speed issues
  • Mac GPU performance for model inference

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (35 messages🔥):

  • GPTs Agents
  • Learning Rate Schedulers
  • Xformers Version Compatibility Issues
  • GGUF Model Saving and Loading
  • Custom Token Embeddings Training

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

  • Ghost 8B Beta
  • Language Models
  • Multilingual Support
  • Knowledge Capabilities
  • Cost-Effectiveness

Links mentioned:


Unsloth AI (Daniel Han) ▷ #community-collaboration (2 messages):

  • New token embeddings
  • Vocab expansion challenges

Unsloth AI (Daniel Han) ▷ #research (3 messages):

  • FSDP2 example
  • Norm tweaking for LLMs quantization

Links mentioned:


HuggingFace ▷ #announcements (1 messages):

  • Google TPUs
  • Datasets Filtering
  • Gemini Nano in Browser
  • Rust-based Inference
  • Depth Estimation Models

Links mentioned:


HuggingFace ▷ #general (297 messages🔥🔥):

  • Life advice on Data Science vs. Machine Learning
  • Gemma Model Usage Issues
  • Debugging and Coding Tips
  • Transformers and LLM Usage
  • Neural Network Training Issues

Links mentioned:


HuggingFace ▷ #today-im-learning (4 messages):

  • freeCodeCamp Machine Learning course
  • Finding help from communities
  • Triplet collapse in embedding models

HuggingFace ▷ #cool-finds (10 messages🔥):

  • Generative AI's impact on storytelling
  • KMWorld AI 100
  • Fine-tuning LLMs with QLoRA
  • Candle running on iOS
  • Wav2Lip lip-synching issues

Links mentioned:


HuggingFace ▷ #i-made-this (15 messages🔥):

  • qdurllm demo
  • Branchy-phi-2 showcase
  • Knowledge Graphs workshop
  • Early Exit in LLM
  • MCQ generation App

Links mentioned:


HuggingFace ▷ #computer-vision (5 messages):

  • User appreciation
  • Image segmentation issues

CUDA MODE ▷ #general (12 messages🔥):

  • Shared memory in GPUs
  • Hackathon team formation

Links mentioned:


CUDA MODE ▷ #cool-links (1 messages):

  • AMD acquisition
  • AI start-ups in Europe
  • Silo AI
  • LLMs development
  • Nvidia competition

Link mentioned: AMD to buy Finnish start-up Silo AI for $665mn in drive to compete with Nvidia : All-cash acquisition by California-based chipmaker is the largest of its kind in Europe in a decade


CUDA MODE ▷ #jobs (1 messages):

  • Job Opportunities
  • Remote Work
  • AI Research
  • PyEmber Framework
  • Hugging Face DRL Leaderboard

Link mentioned: Waleed Salah Eldin Resume.pdf: no description found


CUDA MODE ▷ #beginner (37 messages🔥):

  • Nvidia's tensor offloading
  • CUDA and GPU learning on MacBook
  • HIP as AMD's CUDA equivalent
  • Google Colab for CUDA learning
  • Future GPU purchase considerations

Links mentioned:


CUDA MODE ▷ #off-topic (1 messages):

apaz: https://x.com/typedfemale/status/1810025768715686188


CUDA MODE ▷ #llmdotc (237 messages🔥🔥):

  • MuAdam Implementation Issues
  • Embedding Weight Initialization
  • Performance of StableAdam
  • Handling Loss Spikes
  • MuP Learning Rate Impact

Links mentioned:


OpenAI ▷ #ai-discussions (199 messages🔥🔥):

  • AI locking mechanism
  • Training configurations for AI projects
  • High-performance computing and GPUs
  • Neural network feature extraction
  • Decentralized computing

OpenAI ▷ #gpt-4-discussions (16 messages🔥):

  • User Frustration with ChatGPT Responses
  • Limitations of Search Results
  • Context Window Specifications
  • Differentiation Between Plan and Platform Dependency
  • Topic Relevance to Channels

OpenAI ▷ #prompt-engineering (1 messages):

  • thought process for custom GPT
  • accurate and truthful responses

OpenAI ▷ #api-discussions (1 messages):

  • Custom GPT Thought Process
  • Testing and Feedback for Custom GPT

LM Studio ▷ #💬-general (100 messages🔥🔥):

  • LM Studio configuration issues
  • Running LLMs on mobile devices
  • Multi-GPU support in LM Studio
  • Custom model import in DiffusionBee
  • LM Studio text embedding limitations

Link mentioned: Text Embeddings | LM Studio: Text embeddings are a way to represent text as a vector of numbers.


LM Studio ▷ #🤖-models-discussion-chat (44 messages🔥):

  • Comparing deepseek chat and coder models
  • Flash attention settings
  • Optimal AI models for personal accountability
  • GLM4 support in llama.cpp
  • CodeGeeX4 coding model

Links mentioned:


LM Studio ▷ #🧠-feedback (4 messages):

  • LM Studio bugs
  • Hugging Face accessibility

Link mentioned: lmstudio-community (LM Studio Community): no description found


LM Studio ▷ #🎛-hardware-discussion (40 messages🔥):

  • 3090 vs 4090 performance
  • Electricity costs in Australia
  • Building multi-GPU setups
  • AMD acquires SiloAI
  • Intel Arc 770 for AI

Links mentioned:


LM Studio ▷ #amd-rocm-tech-preview (2 messages):

  • LM Studio update issues
  • Installing multiple versions of LM Studio

Latent Space ▷ #ai-general-chat (55 messages🔥🔥):

  • GPTs Agents
  • OpenAI's sidebars
  • Chroma chunking strategies
  • Turbopuffer launch
  • xAI H100s cluster

Links mentioned:


Latent Space ▷ #llm-paper-club-west (93 messages🔥🔥):

  • ColBERT paper
  • AI agent implementations
  • ImageBind
  • SBERT training
  • Multi-agent systems

Links mentioned:


Modular (Mojo 🔥) ▷ #general (9 messages🔥):

  • Writing AI with Mojo
  • Qualcomm's SNPE and Mojo
  • Modverse Weekly Issue
  • Chris Lattner on ThePrimeagen

Links mentioned:


Modular (Mojo 🔥) ▷ #💬︱twitter (1 messages):

ModularBot: From Modular: https://twitter.com/Modular/status/1810782477079957831


Modular (Mojo 🔥) ▷ #✍︱blog (3 messages):

  • PyTorch in Enterprise
  • AI Development Challenges
  • Generative AI Adoption

Links mentioned:


Modular (Mojo 🔥) ▷ #tech-news (3 messages):

  • Mr. Lattner event
  • The Primeagen Twitch

Link mentioned: ThePrimeagen - Twitch: 🚨🚨 HIGH SPEED GAME PROGRAMMING🚨🚨 !today


Modular (Mojo 🔥) ▷ #🔥mojo (96 messages🔥🔥):

  • Zero-copy deserialization in Mojo
  • Reference and value passing in Mojo
  • Building the Mojo compiler
  • Syntax of __setitem__ in Mojo
  • Memory management and ownership in Mojo

Links mentioned:


Modular (Mojo 🔥) ▷ #📰︱newsletter (1 messages):

Zapier: Modverse Weekly - Issue 39 https://www.modular.com/modverse/modverse-weekly-issue-39


Modular (Mojo 🔥) ▷ #nightly (4 messages):

  • GitHub Issue #3208
  • Setitem and getitem issues
  • Mojo nightly release
  • Pattern matching requirements

Link mentioned: [BUG] Opening a unix fifo in "write" mode raises an exception · Issue #3208 · modularml/mojo: Bug description I'm not sure why this is failing, mentioned it on Discord and was asked to open an issue: $ mojo run src/main.mojo Unhandled exception caught during execution: unable to remove exi...


Modular (Mojo 🔥) ▷ #mojo-marathons (18 messages🔥):

  • Graviton 4 Instances
  • Benchmark Variance
  • Symmetrical vs Asymmetrical Benchmarking

Links mentioned:


Eleuther ▷ #general (52 messages🔥):

  • entity disambiguation papers
  • LLM-based synthetic data generation tools
  • empathy LLMs papers
  • EleutherAI community map update
  • Diffusion Models with Exponential Integrator paper

Links mentioned:


Eleuther ▷ #research (21 messages🔥):

  • RegMix data mixture approach
  • VLM failures on simple visual tasks
  • Composability of LM interventions
  • Self-improvement using synthetic data

Links mentioned:


Eleuther ▷ #scaling-laws (5 messages):

  • brain size and intelligence
  • cortical neuron count in mammals
  • neuron density in birds and lizards
  • bigger animals, bigger brains
  • genetics and IQ

Eleuther ▷ #interpretability-general (2 messages):

  • EleutherAI at ICML
  • ICML social thread
  • ICML announcement

Eleuther ▷ #lm-thunderdome (2 messages):

  • vllm updates
  • GPU memory utilization with older GPUs

Perplexity AI ▷ #announcements (1 messages):

  • Perplexity Enterprise Pro partnership
  • AWS marketplace collaboration

Perplexity AI ▷ #general (57 messages🔥🔥):

  • Gemini 1.5 vs Claude 3
  • Perplexity AI Image Generation
  • Context Window Limits
  • Pharmacy Cash Prices
  • Plans for Claude 3.5 Opus

Perplexity AI ▷ #sharing (6 messages):

  • AI Health Coaches
  • Robotic Factories
  • DNA Ointments
  • Digital Libraries
  • Sealand

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (6 messages):

  • PPLX Library Setup Issues
  • API Rate Limits and Citation Feature
  • API Balance Top Up Issue
  • Understanding API Pricing

Nous Research AI ▷ #off-topic (2 messages):

  • error.pdf Tenor GIF
  • Discussion reaction

Link mentioned: Gary Marcus Yann Lecun GIF - Gary Marcus Yann LeCun Lecun - Discover & Share GIFs: Click to view the GIF


Nous Research AI ▷ #interesting-links (7 messages):

  • Anole LMM
  • Autoverse
  • Image Resolution in Chameleon
  • Open Source Image Model

Links mentioned:


Nous Research AI ▷ #general (33 messages🔥):

  • Sonnet's base64 image generation
  • Gemini 1.5 jailbreak for car breaking instructions
  • Anole model fine-tune controversy
  • Bitnet model updates
  • AI Regulation and GOP stance

Links mentioned:


Nous Research AI ▷ #ask-about-llms (8 messages🔥):

  • Trainer Instructions
  • Marker Library for PDF Conversion
  • RAG Solutions
  • Parsing PDFs
  • QWen2 Discussion

Link mentioned: GitHub - VikParuchuri/marker: Convert PDF to markdown quickly with high accuracy: Convert PDF to markdown quickly with high accuracy - VikParuchuri/marker


Nous Research AI ▷ #rag-dataset (16 messages🔥):

  • Generic RAG schema
  • Reranking Relevance
  • Token Efficiency

LlamaIndex ▷ #blog (4 messages):

  • AGI House Hackathon
  • LlamaCloud for Data ETL/Management
  • $1M+ ARR with LlamaIndex
  • Launch of Llama-Agents

Link mentioned: AGI House: no description found


LlamaIndex ▷ #general (55 messages🔥🔥):

  • astream_chat implementation errors
  • LlamaParse usage for PDF data extraction
  • Query engine template issues in RAG
  • Ollama vs Llama-3/Mistral performance
  • Handling formatting in LLMs

Links mentioned:


Stability.ai (Stable Diffusion) ▷ #general-chat (58 messages🔥🔥):

  • Stable Diffusion on Mac
  • Adetailer and VAE Encoding
  • Basic Setup for Stable Diffusion
  • Performance on Different GPUs
  • High-Resolution Fix Features

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (50 messages🔥):

  • LLM applications for language translation
  • LangChain issues affecting OpenRouter
  • LLM eval frameworks
  • Rate limit for Gemini model
  • Noromaid model removal

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (11 messages🔥):

  • 405b Update July 23rd
  • HarmonicMath's Theorem Proving Breakthroughs
  • Legal Compliance with AI Development

Link mentioned: Tweet from Harmonic (@HarmonicMath): We are excited to share three major updates today on our path to mathematical superintelligence 🦾 1. A new state-of-the-art of 90% on the MiniF2F benchmark. This beats our previously announced 83% f...


Interconnects (Nathan Lambert) ▷ #ml-questions (10 messages🔥):

  • Control Vector
  • Steering Vector
  • Concept Vectors
  • Feature Clamping
  • MuSR Benchmark

Interconnects (Nathan Lambert) ▷ #random (4 messages):

  • Newsletter Distribution Challenges
  • App-Based Reading Preferences

Interconnects (Nathan Lambert) ▷ #memes (3 messages):

  • Sam Altman Koenigsegg
  • Who from Whoville

Link mentioned: Tweet from ₕₐₘₚₜₒₙ — e/acc (@Hamptonism): Sam Altman, CEO of OpenAi in his Koenigsegg Regera.


Interconnects (Nathan Lambert) ▷ #rlhf (5 messages):

  • Paper discussion on y_l vs y_w
  • DPO and policy implications
  • Overfitting in DPO
  • AI2 Slides Presentation

Link mentioned: DPO and overfitting: The Overfitting Test DPO and its derivatives, all fail to overfit a small training data But changing the loss to CrossEntropy of Chosen does overfit Example of a naive task where DPO fails: Create a s...


Interconnects (Nathan Lambert) ▷ #posts (1 messages):

SnailBot News: <@&1216534966205284433>


LAION ▷ #general (14 messages🔥):

  • Court ruling on AI copyright
  • Microsoft and Apple leaving OpenAI's board
  • OPEA 0.7 Community Event
  • Issues with Anole across multiple GPUs
  • Graph-based captioning paper

Links mentioned:


LAION ▷ #research (15 messages🔥):

  • Complex-valued architectures for vision
  • 2D DFT in token mixing
  • Grad issues in deeper networks
  • Model scaling issues
  • Handling complex values in models

LangChain AI ▷ #general (13 messages🔥):

  • LangChain's ConversationSummaryMemory
  • Building Agents and subagents in LangGraph
  • Chroma's retrieval issues
  • Integrating algorithms within LangChain chatbot
  • Adding costs to LangSmith using traceable decorator

Link mentioned: International Chatting group 💕: WhatsApp Group Invite


LangChain AI ▷ #share-your-work (2 messages):

  • Using AI in coding
  • Automating tasks with AI
  • Building applications with AI
  • AI for learning and teaching
  • Knowledge management with AI

Link mentioned: How I Use AI : The jobs for which I have been using AI as a solopreneur.


LangChain AI ▷ #tutorials (3 messages):

  • Knowledge Graphs Workshop
  • Video Game Sales Case Study
  • LangChain Use Cases

Link mentioned: The Future of AI: Leveraging Knowledge Graphs for Advanced RAG: Get ready to dive into the world of natural language querying with Langchain and Neo4j! Learn how to interact with graph databases using cypher query languag...


Cohere ▷ #general (7 messages):

  • Introductions from different countries

OpenInterpreter ▷ #general (1 messages):

  • Llama3 code generation issues
  • Alternative LLM suggestions

OpenInterpreter ▷ #O1 (3 messages):

  • LLM service flag issue
  • Documentation update
  • Profile usage workaround

OpenInterpreter ▷ #ai-content (1 messages):

  • Open Interpreter
  • Mozilla Discord Event

tinygrad (George Hotz) ▷ #learn-tinygrad (4 messages):

  • Tinygrad Error Handling
  • Gradient Settings in Tinygrad
  • NV Accelerator Clarification

Link mentioned: nvdla: NVDLA Open Source Project. nvdla has 17 repositories available. Follow their code on GitHub.


MLOps @Chipro ▷ #events (2 messages):

  • KAN authors response on arXiv paper
  • Judging team participation inquiry

Link mentioned: alphaXiv: no description found


MLOps @Chipro ▷ #general-ml (1 messages):

  • Image Segmentation Issues
  • Hermes 2
  • Mistral struggles
  • Model Merging
  • Open Empathic

OpenAccess AI Collective (axolotl) ▷ #general (2 messages):

  • Multi-token prediction
  • HF implementation

OpenAccess AI Collective (axolotl) ▷ #general-help (1 messages):

  • DPO Fine-Tune Issue
  • Multi-GPU Data Processing Error
  • dataset_prepared_path Workaround
  • RunPod FFT Crash

AI Stack Devs (Yoko Li) ▷ #app-showcase (2 messages):

  • Continued Idea Development
  • Positive Feedback

LLM Finetuning (Hamel + Dan) ▷ #predibase (1 messages):

  • Credits expired
  • Extension request






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}