Frozen AI News archive

AIPhone 16: the Visual Intelligence Phone

**Apple** announced the new **iPhone 16** lineup featuring **Visual Intelligence**, a new AI capability integrated with Camera Control, Apple Maps, and Siri, emphasizing privacy and default service use over third-party AI like OpenAI. **Apple Photos** now includes advanced video understanding with timestamp recognition. Meanwhile, **Reflection-70B** claims to be a top open-source model but benchmarks show it performs close to **Llama 3 70B** and slightly worse than **Qwen 2 72B**. **Yann LeCun** highlighted ongoing challenges with LLM planning abilities, noting models like **Llama-3.1-405b** and **Claude** show some skill, while **GPT-4** and **Gemini** lag behind. **Weights & Biases** is sponsoring an event to advance LLM evaluation techniques with prizes and API access.

Canonical issue URL

AI News for 9/6/2024-9/9/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (215 channels, and 7493 messages) for you. Estimated reading time saved (at 200wpm): 774 minutes. You can now tag @smol_ai for AINews discussions!

At the special Apple Event today, the new iPhone 16 lineup was announced, together with 5 minutes spent covering some updates on Apple Intelligence (we'll assume you are up to speed on our WWDC and Beta release coverage).

image.png

The newest update is what they now call Visual Intelligence, rolling out with the new dedicated Camera Control button for iPhone 16:

image.png

As discussed on the Winds of AI Winter pod and now confirmed, Apple is commoditizing OpenAI and putting its own services first:

image.png

Presumably one will eventually be able to configure what the Ask and Search buttons call in the new UI, but every Visual Intelligence request will run through Apple Maps and Siri first and those services second. Apple wins here by running first, being default, and being private/free, which is surprisingly a more defensible position than being "best".

Apple Photos now also have very good video understanding, down to the timestamps in a video:

image.png

Craig Federighi called this a part of Apple Intelligence in his segment, but some of these features are already in the iOS 18.0 beta (Apple Intelligence only shipped in iOS 18.1).

You can read the Hacker News commentary for other highlights and cynical takes but that's the big must-know thing from today.

How many years until Apple Visual Intelligence is just... always on?

image.png


A Note on Reflection 70B: our coverage last week (and tweet op-ed) covered known criticisms on Friday, but more emerged over the weekend to challenge their claims. We expect more developments over the course of this week, therefore it is premature to make it another title story, but interested readers should scroll to the /r/localLlama section below for a full accounting.

Perhaps we should work on more ungameable LLM evals? Good thing this month's inference is supported by our friends at W&B...


Sponsored by Weights & Biases: If you’re a builder in the Bay Area Sep 21/22, Weights & Biases invites you to hack with them on pushing the state of LLM-evaluators forward. Build better LLM Judges at the W&B Judgement Day hack - $5k in prizes, API access and food provided.

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Developments and Benchmarks

AI Tools and Applications

AI Research and Developments

AI Ethics and Societal Impact

Hardware and Infrastructure


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Reflection 70B Controversy: Potential API Fraud and Community Backlash

Theme 2. Community Lessons from Reflection 70B Incident: Trust and Verification in AI

Theme 3. Memes and Humor Surrounding Reflection 70B Controversy

Theme 4. Advancements in Open-Source AI Models and Tools

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Developments and Releases

AI Research and Applications

AI Development Tools and Visualization

AI Ethics and Societal Impact

AI Industry and Market Trends

Memes and Humor


AI Discord Recap

A summary of Summaries of Summaries GPT4O (gpt-4o-2024-05-13)

1. AI Model Performance

2. AI Tools and Integrations

3. Open Source AI Developments

4. Benchmarking and Evaluation

5. AI Community Events


PART 1: High level Discord summaries

HuggingFace Discord


aider (Paul Gauthier) Discord


OpenRouter (Alex Atallah) Discord


Stability.ai (Stable Diffusion) Discord


LM Studio Discord


Perplexity AI Discord


Cohere Discord


Nous Research AI Discord


CUDA MODE Discord


OpenAI Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


Latent Space Discord


OpenInterpreter Discord


LlamaIndex Discord


Torchtune Discord


LangChain AI Discord


OpenAccess AI Collective (axolotl) Discord


LAION Discord


DSPy Discord


tinygrad (George Hotz) Discord


Gorilla LLM (Berkeley Function Calling) Discord


LLM Finetuning (Hamel + Dan) Discord


Alignment Lab AI Discord


MLOps @Chipro Discord


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #general (930 messages🔥🔥🔥):

  • Hugging Face Inference API Issues
  • Model Fine-Tuning Experiences
  • AI Art and Prompting Challenges
  • Q&A on LLM Features and Usage

Links mentioned:


HuggingFace ▷ #today-im-learning (9 messages🔥):

  • Latch-up effect in CMOS microcircuits
  • Deploying uncensored models to SageMaker
  • Daily learning progress forum

HuggingFace ▷ #cool-finds (11 messages🔥):

  • Medical AI Research Updates
  • AlphaProteo Protein Prediction Model
  • Medical LLMs Applications
  • ML Training Visualization Tools
  • Exploring Medical Literature

Links mentioned:


HuggingFace ▷ #i-made-this (51 messages🔥):

  • PowershAI Features
  • GraphRAG Utilization
  • Om LLM Architecture
  • FLUX.1 [dev] Model Release
  • OCR Correction Techniques

Links mentioned:


HuggingFace ▷ #reading-group (6 messages):

  • Universal Approximation Theorem
  • Uncensored Models
  • Model Definitions
  • Leshno's Theorem
  • HuggingFace Models

Links mentioned:


HuggingFace ▷ #computer-vision (8 messages🔥):

  • Community Computer Vision Course
  • Stanford CS231n Course
  • Imgcap CLI Tool
  • Face Recognition Datasets
  • Data Training Methods with CSV Files

Links mentioned:


HuggingFace ▷ #NLP (3 messages):

  • HF Trainer confusion matrix
  • RAG-based retrieval evaluation

HuggingFace ▷ #diffusion-discussions (2 messages):

  • Transformer2DModel
  • DiT

aider (Paul Gauthier) ▷ #general (687 messages🔥🔥🔥):

  • DeepSeek and Aider Performance
  • AI Development Concerns
  • Aider Workflow Strategies
  • Using a Config File for Aider
  • Conventions and Prompt Engineering

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (193 messages🔥🔥):

  • Aider Chat Functionality
  • Model Performance Comparisons
  • Git Integration Features
  • Language Output Behavior
  • Using Aider with Conventions

Links mentioned:


aider (Paul Gauthier) ▷ #links (14 messages🔥):

  • Reflection 70B vs Llama3 70B
  • V0 updates and applications
  • Zed's GitHub discussions
  • YouTube AI coding videos

Links mentioned:


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

  • Reflection API
  • Reflection-Tuning Technique
  • Self-Correcting AI Models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #app-showcase (10 messages🔥):

  • ISO20022
  • Bitcoin and CBDCs
  • cli_buddy GitHub project
  • Open Source Multi-lingual Model
  • OpenRouter Usage

Link mentioned: GitHub - rezmeplxrf/cli_buddy: Contribute to rezmeplxrf/cli_buddy development by creating an account on GitHub.


OpenRouter (Alex Atallah) ▷ #general (611 messages🔥🔥🔥):

  • DeepSeek Coder
  • Reflection Model
  • OpenRouter API Issues
  • Gemini Models
  • Multi-Modal Models

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (11 messages🔥):

  • Vertex AI Key Compatibility
  • JSON Formatting Issues
  • Google AI Studio Usage
  • Base64 Encoding Workaround

Link mentioned: Add Vertex AI support by u-minor · Pull Request #45 · saoudrizwan/claude-dev: This PR adds support for Vertex AI in Google Cloud. At this time, the Application Default Credentials (ADC) must be set in the gcloud command to use Vertex AI. Authentication supports one of the fo...


Stability.ai (Stable Diffusion) ▷ #general-chat (592 messages🔥🔥🔥):

  • AI model training methods
  • GPU recommendations for image generation
  • Stable Diffusion models comparison
  • Influencer culture and content creation
  • Using detail enhancing LoRAs

Links mentioned:


LM Studio ▷ #general (402 messages🔥🔥):

  • LM Studio Updates
  • Model Performance and Settings
  • Training Language Models
  • User Experience with LM Studio
  • Server Interaction and API Requests

Links mentioned:


LM Studio ▷ #hardware-discussion (83 messages🔥🔥):

  • LM Studio and VOSK
  • Intel A770 Performance
  • NVIDIA Caution with VRAM
  • Reflection-Llama-3.1 Issues
  • Apple's Upcoming Hardware

Links mentioned:


Perplexity AI ▷ #general (334 messages🔥🔥):

  • Perplexity Subscription Issues
  • Promo Code Leak Controversy
  • Model Usage Limits
  • Web Scraping by LLMs
  • Technical Issues with Perplexity

Links mentioned:


Perplexity AI ▷ #sharing (49 messages🔥):

  • One Piece Documentation
  • AI Services
  • Carbon Capture Technologies
  • Kung Pao Chicken Recipe
  • AI Tutors Engagement

Link mentioned: YouTube: no description found


Perplexity AI ▷ #pplx-api (13 messages🔥):

  • API response length
  • API access issues
  • Payment method errors
  • Model deprecation
  • Search domain filter

Link mentioned: no title found: no description found


Cohere ▷ #discussions (334 messages🔥🔥):

  • Cohere tech
  • Haircuts and styles
  • Role of bots in moderation
  • AI scams and crypto
  • Multimodal models and projects

Links mentioned:


Cohere ▷ #questions (25 messages🔥):

  • Recruiting Team Contact
  • Use of Cohere Products
  • MrDragonFox's Presence
  • Embed vs Embed Jobs

Link mentioned: Cookbooks — Cohere: no description found


Cohere ▷ #api-discussions (20 messages🔥):

  • Configuring Output Lengths
  • Search Query Costs
  • Using Calendar Agent
  • Invalid Raw Prompt Error
  • Chat Turns in API

Link mentioned: Calendar Agent with Native Multi Step Tool — Cohere: This page describes how to use cohere Chat API with list_calendar_events and create_calendar_event tools to book appointments.


Cohere ▷ #projects (13 messages🔥):

  • LLM Web App Launch
  • Streamlit Hosting Plans
  • Langchain Integration
  • Admin Access Concern

Links mentioned:


Nous Research AI ▷ #general (199 messages🔥🔥):

  • Reflection 70B Performance
  • Upcoming AI Models
  • Nous Forge Presentation
  • Benchmark Evaluations
  • AI Model Mislabeling

Links mentioned:


Nous Research AI ▷ #ask-about-llms (7 messages):

  • DeepSeek v2.5 Performance
  • LLM for Book and Movie Queries
  • FaceNet for One-Shot Recognition
  • Hermes Nemo Release Date
  • Anything LLM Interest

Nous Research AI ▷ #research-papers (2 messages):

  • Medical LLMs
  • Continual In-Context Learning
  • Frameworks for Medical AI
  • LLM Digital Twins

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): Last Week in Medical AI: Top Research Papers/Models 🏅(September 1 - September 7, 2024) Medical LLM & Other Models : - CancerLLM: Large Language Model in Cancer Domain - MedUnA: Vision-Languag...


Nous Research AI ▷ #interesting-links (19 messages🔥):

  • PlanSearch introduces diverse LLM outputs
  • RedTeam Arena launches with gamification
  • Reflection 70b model capabilities
  • Insights on AI research fraud
  • Itext2kg as a knowledge graph tool

Links mentioned:


Nous Research AI ▷ #research-papers (2 messages):

  • Medical LLM advancements
  • Continual In-Context Learning
  • Transformer architecture
  • Robotic Endoscopic Surgery
  • Decentralized Health Intelligence

Link mentioned: Tweet from Open Life Science AI (@OpenlifesciAI): Last Week in Medical AI: Top Research Papers/Models 🏅(September 1 - September 7, 2024) Medical LLM & Other Models : - CancerLLM: Large Language Model in Cancer Domain - MedUnA: Vision-Languag...


Nous Research AI ▷ #reasoning-tasks (2 messages):

  • AGI through RL
  • Transformers and SSI
  • Importance of Scaling
  • Breakthroughs Needed in AI

CUDA MODE ▷ #general (16 messages🔥):

  • Together AI's MLP Kernels
  • ROCm/AMD vs. NVIDIA
  • RTX 5XXX Architecture Generation
  • Reflection Drama
  • PyTorch on ROCm

Links mentioned:


CUDA MODE ▷ #triton (49 messages🔥):

  • Triton Internals Article
  • FP16 vs BFP16 Performance
  • Kernel Optimization Strategies
  • Quantization Techniques

Link mentioned: BitBLAS/benchmark at main · microsoft/BitBLAS: BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment. - microsoft/BitBLAS


CUDA MODE ▷ #torch (6 messages):

  • Dynamo Call Analysis
  • getitem Performance
  • PyTorch Container Module
  • TorchDynamo Cache Lookup

Link mentioned: pytorch/torch/nn/modules/container.py at 31c4e0d37d8efc37a0697159e5b9121ec34d5141 · pytorch/pytorch: Tensors and Dynamic neural networks in Python with strong GPU acceleration - pytorch/pytorch


CUDA MODE ▷ #algorithms (2 messages):

  • Self Promotion in Messages

CUDA MODE ▷ #cool-links (18 messages🔥):

  • Course Lab Notebooks
  • Zen, CUDA, and Tensor Cores
  • VLLM Office Hours
  • AdEMAMix Optimizer
  • Herbie Tool for Numerical Analysis

Links mentioned:


CUDA MODE ▷ #beginner (27 messages🔥):

  • Tensor Core Efficiency
  • WMMA Usage
  • CUDA Kernel Optimization
  • Occupancy in Tensor Cores
  • CUDA Development Templates

Links mentioned:


CUDA MODE ▷ #pmpp-book (2 messages):

  • PMPP Book for Parallel Computing
  • CUDA Resource Stream on GitHub

Link mentioned: GitHub - cuda-mode/resource-stream: CUDA related news and material links: CUDA related news and material links. Contribute to cuda-mode/resource-stream development by creating an account on GitHub.


CUDA MODE ▷ #torchao (2 messages):

  • Build Fixes
  • GitHub Pull Requests

Link mentioned: Unbreak build after #621 by andrewor14 · Pull Request #826 · pytorch/ao: no description found


CUDA MODE ▷ #off-topic (14 messages🔥):

  • Marathon Experience
  • Injury Recovery
  • CUDA Related Content
  • Spoiler Over Images
  • Hiking Accident

CUDA MODE ▷ #irl-meetup (6 messages):

  • Toronto GPU Programming Meetups
  • Triton Learning
  • Cutlass Interest

CUDA MODE ▷ #triton-puzzles (10 messages🔥):

  • Triton-Puzzles Error Handling
  • Installing Triton-Viz
  • 403 Error on Localhost

Links mentioned:


CUDA MODE ▷ #hqq-mobius (2 messages):

  • HFGenerator
  • Batch Size Support

CUDA MODE ▷ #llmdotc (2 messages):

  • H100 Scaling
  • NCCL Multi-GPU Training

Link mentioned: NCCL only multi-gpu multi-node training without MPI by chinthysl · Pull Request #426 · karpathy/llm.c: Scheduling jobs using Slurm seems much easier in a multi-node training setup compared to setting up MPI for the cluster. This draft contains the changes to use mpirun for single-node training and S...


CUDA MODE ▷ #rocm (1 messages):

  • AMD's UDNA Architecture
  • Deprioritization of High-End Gaming GPUs
  • Transition from GCN to RDNA and CDNA

Link mentioned: AMD announces unified UDNA GPU architecture — bringing RDNA and CDNA together to take on Nvidia's CUDA ecosystem: Two become one.


CUDA MODE ▷ #arm (1 messages):

  • ExecuTorch
  • PyTorch

CUDA MODE ▷ #liger-kernel (19 messages🔥):

  • Liger's Swiglu Kernels vs Together AI Benchmarks
  • Optimizations in cuBLAS and PyTorch Implementations
  • Handling of ignore_index in Cross Entropy
  • Conv2D Performance Issues
  • Benchmarking with Phi3 on A100

Links mentioned:


CUDA MODE ▷ #thunder (4 messages):

  • Thunder channel introduction
  • Triton Matmul example
  • Fusing operations
  • Liger kernel application

Links mentioned:


OpenAI ▷ #ai-discussions (112 messages🔥🔥):

  • Reflection Llama-3.1 updates
  • OpenAI model announcements
  • AI hardware requirements
  • Learning OpenAI API
  • Performance of local models

Links mentioned:


OpenAI ▷ #gpt-4-discussions (7 messages):

  • GPT handling books
  • Voice access rollout

OpenAI ▷ #prompt-engineering (30 messages🔥):

  • AI Reasoning Breakdown
  • Prompt Engineering Insights
  • Stock Market Prompt Use Cases
  • Different Response Styles
  • Prompt Library Channel Location

OpenAI ▷ #api-discussions (30 messages🔥):

  • AI reasoning breakdown
  • Response variation in AI
  • API discussion and prompts
  • Stock history analysis with AI
  • Judging interestingness with AI

Modular (Mojo 🔥) ▷ #general (80 messages🔥🔥):

  • Integrating C and Mojo
  • LLVM Developer Meeting Insights
  • Subprocess Implementation in Mojo
  • Mojo Community Meeting Transition
  • Hash Functions Presentation

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (96 messages🔥🔥):

  • DType as Dict key
  • Multiple-precision integer arithmetic
  • Mojo hardware access drivers
  • Variant type usage
  • Creating bindings for GStreamer

Links mentioned:


Eleuther ▷ #general (124 messages🔥🔥):

  • DeepMind's Transition
  • Quora Data Scraping
  • Continual In-Context Learning
  • Adaptive Transformers
  • AI Hackathons

Links mentioned:


Eleuther ▷ #research (20 messages🔥):

  • Cosine Similarity of Gradients
  • Laplace Approximation in Bayesian Deep Learning
  • Weight Decay and Orthogonal Regularization
  • Prior in Bayesian Approaches
  • Training Dynamics and Phase Changes

Links mentioned:


Eleuther ▷ #scaling-laws (13 messages🔥):

  • Power Law Curves in ML
  • Self-Organized Criticality
  • Scaling Laws in Statistical Estimation
  • Sandpile Avalanche Model
  • Critical Systems and Fluctuations

Link mentioned: Per Bak: How Nature Works: The Science of Self-Organised Criticality: no description found


Eleuther ▷ #interpretability-general (12 messages🔥):

  • Layer Responsibilities in Models
  • Graph Cluster Detection Probability
  • Residual Stream Differences
  • SAE Latent Activation Variations
  • Communication Network Protection

Links mentioned:


Eleuther ▷ #lm-thunderdome (5 messages):

  • Generate Until Tasks Bug
  • TurkishMMLU Release
  • Community Feedback on Changes

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (144 messages🔥🔥):

  • Reflection API issues
  • Incompetence in AI model releases
  • Automated AI research
  • Evaluation of LLMs
  • Hugging Face community response

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (3 messages):

  • GPT Next
  • KDDI Summit Presentation

Link mentioned: OpenAI clarifies: No, "GPT Next" isn't a new model.: Confusion from a presentation got OpenAI fans in a tizzy.


Interconnects (Nathan Lambert) ▷ #random (12 messages🔥):

  • OpenAI team dynamics
  • Google's recent activity
  • System prompts focus

Interconnects (Nathan Lambert) ▷ #posts (2 messages):

  • Internal bureaucracy at Google
  • Challenges of scaling within large organizations

Latent Space ▷ #ai-general-chat (47 messages🔥):

  • AI Codex for Cursor
  • Reflection API
  • Apple Intelligence Updates
  • Gemini Enum Mode
  • Photorealistic LoRA Model

Links mentioned:


Latent Space ▷ #ai-in-action-club (76 messages🔥🔥):

  • Open Source AI Code Editors
  • Collaboration Tools
  • Error Handling in Code
  • Fine Tuning with Loras
  • Zed VS Cursor

Links mentioned:


OpenInterpreter ▷ #general (38 messages🔥):

  • OpenInterpreter Performance
  • AI Skills on OpenInterpreter
  • 01 iOS App Features
  • Using OpenInterpreter with LLMs
  • Connecting with Venture Capitalists

Links mentioned:


OpenInterpreter ▷ #O1 (54 messages🔥):

  • Torch installation issues
  • 01 Light discontinuation
  • Refund process for 01
  • 01 app launch details
  • Using OpenInterpreter

Links mentioned:


OpenInterpreter ▷ #ai-content (5 messages):

  • Scriptomatic with open source models
  • Instructor Python library

Link mentioned: instructor: structured outputs for llm


LlamaIndex ▷ #blog (9 messages🔥):

  • Agentic System Deployment
  • Running Reflection 70B
  • Advanced RAG Pipelines
  • Automating Financial Analysis
  • Dynamic ETL for RAG

LlamaIndex ▷ #general (51 messages🔥):

  • Cohere Reranker
  • LlamaIndex Node Postprocessors
  • Llama Parse Service Status
  • LlamaIndex Structured Outputs
  • Using Llama 3 with LlamaIndex

Links mentioned:


Torchtune ▷ #general (25 messages🔥):

  • Gemma model configuration
  • Support for gemma 2
  • PR for torchtune adjustments
  • Tokenizer eos problem

Links mentioned:


Torchtune ▷ #dev (32 messages🔥):

  • Compiling Generation Methods
  • Cache Handling During Generation
  • Handling Non-Contiguous Inputs
  • Tensor.is_inference() Method Proposal
  • Proposed Implementation of Chunked Linear + CE

Links mentioned:


LangChain AI ▷ #general (41 messages🔥):

  • Decoding .astream_events()
  • Gradio Upload Limitations
  • LangChain Azure Integration
  • Data Set Creation Strategies
  • Audio Transcription with Claude

LangChain AI ▷ #share-your-work (9 messages🔥):

  • VAKX platform
  • Selenium and GPT-4 vision integration
  • AI Reddit Manager tool
  • Mocking LLM embedder
  • RAG chatbot using OpenAI and LangChain

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (33 messages🔥):

  • Overfitting in Models
  • Benchmark Limitations
  • Scam in AI Tool
  • RAG APIs

OpenAccess AI Collective (axolotl) ▷ #general-help (2 messages):

  • H100 loading support
  • 8-bit model loading

LAION ▷ #general (21 messages🔥):

  • Factory Network x Tech: Berlin AI Hackathon
  • Finegrain Object Cutter
  • Concrete ML and Homomorphic Encryption
  • Open Source AI Event by GitHub

Links mentioned:


LAION ▷ #research (9 messages🔥):

  • Multimodality in LLMs
  • Reflection-70B Performance Claims
  • AI Scams and Fraud
  • Tool Augmented Generation

Links mentioned:


LAION ▷ #paper-discussion (1 messages):

erkinalp: https://arxiv.org/abs/2408.06292


DSPy ▷ #show-and-tell (2 messages):

  • LanceDB Integration
  • Pull Request for dspy
  • GitHub Review Process

Link mentioned: Lancedb Integration by PrashantDixit0 · Pull Request #1444 · stanfordnlp/dspy: This PR adds LanceDB as a retriever to handle large datasets.


DSPy ▷ #general (26 messages🔥):

  • Deprecation of GPT-3.5
  • MIPROv2 Error
  • Finetuning LLMs
  • CookLangFormatter Issues
  • Retrieval Models in DSPy

Links mentioned:


tinygrad (George Hotz) ▷ #general (6 messages):

  • WebGPU PR #6304
  • WGPU buffer limit increase
  • Dependency issues with Rubicon ObjC
  • Time zone change announcement

Link mentioned: bring back webgpu [run_process_replay] by geohot · Pull Request #6304 · tinygrad/tinygrad: This works on Asahi Linux!


tinygrad (George Hotz) ▷ #learn-tinygrad (17 messages🔥):

  • Multi-GPU Tensor Issues
  • PTX Compilation Time for Tinygrad
  • GGUF PRs Status
  • Const with dtype uchar
  • Model Performance with Sharding

Link mentioned: tinygrad/examples/mlperf/training_submission_v4.1/tinycorp/benchmarks/bert/implementations/tinybox_green/run_and_time.sh at 22e33795785f6c72449480e380ffdc213b5c7bbc · tinygrad/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️ - tinygrad/tinygrad


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (10 messages🔥):

  • xLAM System Prompt Differences
  • Function Calling Documentation for LLaMA
  • Merge Conflicts in GitHub Pull Requests
  • Model Evaluation with VLLM
  • Hammer-7b Handler Pull Request

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #general (2 messages):

  • 4090 GPU capabilities
  • Hybrid search with Milvus
  • Embedding models
  • Reranking metadata

Link mentioned: pymilvus/examples/hello_hybrid_sparse_dense.py at master · milvus-io/pymilvus: Python SDK for Milvus. Contribute to milvus-io/pymilvus development by creating an account on GitHub.


Alignment Lab AI ▷ #general (1 messages):

  • RAG based retrieval
  • Evaluation metrics for RAG
  • Comparative analysis of RAG vs other LLMs

MLOps @Chipro ▷ #events (1 messages):

  • Open Source AI
  • GitHub Panel Event
  • Panelists

Link mentioned: GitHub Presents: Open Source AI - Access, Democratization, and Responsibility · Luma: AI is rapidly transforming industries from software development, content creation, agentic workflows and beyond. Central to this transformation is open source…




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}