Frozen AI News archive

Gemini Live

**Google** launched **Gemini Live** on Android for **Gemini Advanced** subscribers during the Pixel 9 event, featuring integrations with Google Workspace apps and other Google services. The rollout began on 8/12/2024, with iOS support planned. **Anthropic** released **Genie**, an AI software engineering system achieving a **57%** improvement on SWE-Bench. **TII** introduced **Falcon Mamba**, a 7B attention-free open-access model scalable to long sequences. Benchmarking showed that longer context lengths do not always improve Retrieval-Augmented Generation. **Supabase** launched an AI-powered Postgres service dubbed the "ChatGPT of databases," fully open source. **Perplexity AI** partnered with Polymarket to integrate real-time probability predictions into search results. A tutorial demonstrated a multimodal recipe recommender using **Qdrant**, **LlamaIndex**, and **Gemini**. An OpenAI engineer shared success tips emphasizing debugging and hard work. The connection between matrices and graphs in linear algebra was highlighted for insights into nonnegative matrices and strongly connected components. **Keras 3.5.0** was released with Hugging Face Hub integration for model saving and loading.

Canonical issue URL

AI News for 8/12/2024-8/13/2024. We checked 7 subreddits, 384 Twitters and 29 Discords (253 channels, and 2423 messages) for you. Estimated reading time saved (at 200wpm): 244 minutes. You can now tag @smol_ai for AINews discussions!

As promised at Google I/O, Gemini Live launched in Android today, for Gemini Advanced subscribers, as part of the #MadeByGoogle Pixel 9 launch event. With sympathies to the poor presenter who had 2 demo failures onstage:

image.png

The embargoed media reviews of Gemini Live have been cautiously positive. It will have "extensions" that are integrations with your Google Workspace (Gmail, Docs, Drive), YouTube, Google Maps, and other Google properties.

The important thing is Google started the rollout of it today (though we still cannot locate anyone with a live recording of it as of 5pm PT) vs a still-indeterminate date for ChatGPT's Advanced Voice Mode. Gemini Live will also come to iOS subscribers at a future point.

The company also shared demos of Gemini Live with Pixel Buds Pro 2 to people in the audience and with the WSJ. For those that care about the Pixel 9, there are also notable image AI integrations with the Add Me photo feature and the Magic Editor.

https://www.youtube.com/watch?v=KoN_bcDmhR4


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Model Developments and Benchmarks

AI Tools and Applications

AI Engineering Insights

AI Ethics and Regulation

AI Community and Events

Memes and Humor

This summary captures the main themes and discussions from the provided tweets, focusing on recent developments in AI models, tools, applications, and the broader implications for AI engineering and the tech industry.


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Advanced Quantization and Model Optimization Techniques

Theme 2. Open-source Contributions to LLM Development

All AI Reddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Releases and Capabilities

AI-Generated Media

Autonomous Vehicles

AI and Society


AI Discord Recap

A summary of Summaries of Summaries by GPT4O (gpt-4o-2024-05-13)

1. Model Performance and Benchmarking

2. GPU and Hardware Discussions

3. Fine-tuning and Optimization Techniques

4. UI/UX Issues in AI Platforms

5. Open-Source AI Frameworks and Community Efforts


PART 1: High level Discord summaries

Unsloth AI (Daniel Han) Discord


CUDA MODE Discord


LM Studio Discord


OpenAI Discord


Perplexity AI Discord


Stability.ai (Stable Diffusion) Discord


OpenRouter (Alex Atallah) Discord


Modular (Mojo 🔥) Discord


Cohere Discord


Torchtune Discord


OpenAccess AI Collective (axolotl) Discord


LAION Discord


tinygrad (George Hotz) Discord


MLOps @Chipro Discord


LangChain AI Discord


OpenInterpreter Discord


Alignment Lab AI Discord


LLM Finetuning (Hamel + Dan) Discord


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Unsloth AI (Daniel Han) ▷ #general (167 messages🔥🔥):

  • Unsloth Pro
  • GPU choices
  • LLM Leaderboard results
  • Dolphin Model
  • Model fine-tuning

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (12 messages🔥):

  • Camping
  • Australia

Link mentioned: Cosine Genie - SOTA AI Engineer Announcement: Genie is the best AI software engineer in the world by far - scoring 30% on the industry standard benchmark SWE-Bench we have beaten the previous SOTA scores...


Unsloth AI (Daniel Han) ▷ #help (83 messages🔥🔥):

  • Unsloth model loading/saving
  • Llama 3.1 fine-tuning with Hindi
  • Model merging and HF hub
  • Unsloth with VLLM
  • Dataset creation

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (8 messages🔥):

  • Lexi model
  • LLM Leaderboard 2
  • Ahma-3B Instruct
  • Finnish-NLP/Ahma-3B
  • Finnish language model

Links mentioned:


Unsloth AI (Daniel Han) â–· #research (5 messages):

  • 1.5-Pints
  • Tree Attention
  • Mistral
  • Llama 2
  • OpenELM

Link mentioned: 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data: This paper presents a compute-efficient approach to pre-training a Language Model-the "1.5-Pints"-in only 9 days, while outperforming state-of-the-art models as an instruction-following assist...


CUDA MODE ▷ #general (48 messages🔥):

  • TorchAO
  • CUDA developer hiring
  • CPU matmul optimization
  • CPU matmul performance
  • FP16/BF16 weights

Links mentioned:


CUDA MODE â–· #torch (4 messages):

  • PyTorch Full FP16
  • PyTorch Optimizer
  • torch.compile
  • Fairseq Fine-tuning

Links mentioned:


CUDA MODE â–· #cool-links (7 messages):

  • Rust GPU
  • Zig
  • Fal Research Grants
  • Open Source Support

Links mentioned:


CUDA MODE ▷ #jobs (11 messages🔥):

  • CUDA Developers
  • CUDA Freshers
  • CUDA Hiring
  • CUDA Engineer
  • Triton

CUDA MODE â–· #beginner (7 messages):

  • Multithreading and GPU Use
  • Network Requests and GPUs
  • Magnum IO Architecture

Link mentioned: Accelerating IO in the Modern Data Center: Magnum IO Architecture | NVIDIA Technical Blog: This is the first post in the Accelerating IO series, which describes the architecture, components, storage, and benefits of Magnum IO, the IO subsystem of the modern data center.


CUDA MODE â–· #off-topic (1 messages):

iron_bound: https://www.youtube.com/watch?v=aNAtbYSxzuA


CUDA MODE ▷ #llmdotc (126 messages🔥🔥):

  • cuDNN stability
  • HuggingFace Llama 3 AutoTokenizer issues
  • Curand GPU weight initialization
  • copy_and_cast_kernel
  • cudaMallocAsync/cudaFreeAsync

Links mentioned:


LM Studio ▷ #general (150 messages🔥🔥):

  • Vision Adapters
  • Model Merging
  • Mistral Large
  • GPT-4o Mini
  • LLM Studio Headless

Links mentioned:


LM Studio ▷ #hardware-discussion (15 messages🔥):

  • Portable LLM inference
  • Apple Mac
  • GPU modding
  • Copper modding
  • Flashing NVIDIA BIOS

OpenAI ▷ #ai-discussions (151 messages🔥🔥):

  • Gemini Live
  • Google Fi
  • Strawberries
  • Project Astra
  • LLMs

Links mentioned:


OpenAI â–· #gpt-4-discussions (5 messages):

  • Prompt Library
  • System Prompt in LangChain

OpenAI â–· #prompt-engineering (3 messages):

  • ChatGPT website access

OpenAI â–· #api-discussions (3 messages):

  • ChatGPT accessing websites
  • ChatGPT's hallucination and web crawling

Perplexity AI ▷ #general (106 messages🔥🔥):

  • Perplexity bug reports
  • Perplexity Pro Models
  • Perplexity's UI/UX
  • Perplexity's website stability
  • Perplexity's future

Links mentioned:


Perplexity AI ▷ #sharing (9 messages🔥):

  • Coursera
  • Programming Courses
  • AI/ML
  • Cloud Computing
  • Data Science

Links mentioned:


Perplexity AI â–· #pplx-api (6 messages):

  • Perplexity Search Parameters
  • Search Location Options
  • Image Generation from Narrative

Stability.ai (Stable Diffusion) â–· #announcements (1 messages):

  • SXSW Panel
  • OpenAI Models
  • AI Risks and Opportunities
  • Government Regulation
  • AI Impact

Link mentioned: PanelPicker | SXSW Conference & Festivals: PanelPicker® is the official SXSW user-generated session proposal platform. Enter ideas and vote to help shape Conference programming for SXSW and SXSW EDU.


Stability.ai (Stable Diffusion) ▷ #general-chat (111 messages🔥🔥):

  • Google Colab Runtime
  • Stable Diffusion Installation
  • Stable Diffusion Model Merging
  • CUDA Installation
  • Flux Realism

Links mentioned:


OpenRouter (Alex Atallah) â–· #announcements (2 messages):

  • Gemini Flash 1.5
  • GPT-4o Extended
  • OpenRouter Pricing

Links mentioned:


OpenRouter (Alex Atallah) ▷ #general (80 messages🔥🔥):

  • Gemini Flash Price Updates
  • GCP Cost Table
  • Token:Character Ratio
  • Euryale 70B Downtime
  • Infermatic Downtime

Links mentioned:


Modular (Mojo 🔥) ▷ #general (30 messages🔥):

  • Mojo Licensing Concerns
  • Mojo Open-Sourcing
  • Mojo Development
  • Mojo Learning Resources
  • Mojo Compiler

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (19 messages🔥):

  • Java by Microsoft
  • C# relevance
  • Stable Diffusion Memory Issue
  • WSL2 limitations
  • Mojo Optimization

Cohere ▷ #discussions (11 messages🔥):

  • Cohere For AI
  • Pricing changes
  • Cohere's Research Lab
  • Hackathons
  • Computer Vision

Link mentioned: Cohere For AI (C4AI): Cohere For AI is a non-profit research lab that seeks to solve complex machine learning problems. We support fundamental research that explores the unknown, and are focused on creating more points of ...


Cohere ▷ #questions (26 messages🔥):

  • JSONL Upload Issue
  • Azure JSON Formatting
  • Rerank Overview
  • Cohere API Usage
  • Python Kernel Restart

Link mentioned: Rerank - Cohere API References: This endpoint takes in a query and a list of texts and produces an ordered array with each text assigned a relevance score.


Cohere â–· #api-discussions (7 messages):

  • JSON Snippet Embeddings
  • Intermediate Text

Torchtune ▷ #dev (44 messages🔥):

  • TransformerDecoderLayer Refactor
  • RLHF with DPO/PPO
  • Torchtune & WandB
  • Torchtune Performance
  • PyTorch Conference

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (19 messages🔥):

  • Perplexity Pro
  • Llama 3
  • Grad Clipping
  • OpenAI Benchmark

Link mentioned: GitHub - cognitivecomputations/grokadamw: Contribute to cognitivecomputations/grokadamw development by creating an account on GitHub.


OpenAccess AI Collective (axolotl) â–· #general-help (4 messages):

  • AutoGPTQ
  • Axolotl

OpenAccess AI Collective (axolotl) â–· #deployment-help (1 messages):

  • LLM Inference
  • VLLM
  • SkyPilot
  • Fireworks
  • Lora Adapters

LAION ▷ #general (16 messages🔥):

  • Grok 2.0
  • Flux.1 Model
  • Grok Image Generation
  • Open Source Image Annotation
  • Elon and Models

Link mentioned: Tweet from Nima Owji (@nima_owji): BREAKING: Here's an early look at Grok 2.0 features and abilities! It's better at coding, writing, and generating news! It'll also generate images using the FLUX.1 model!


LAION â–· #research (4 messages):

  • Position Encoding
  • 2D Pooling

tinygrad (George Hotz) â–· #general (1 messages):

flammit_: no worries - just left hopefully helpful hints on your nvidia FP8 PR


tinygrad (George Hotz) ▷ #learn-tinygrad (8 messages🔥):

  • Tensor Filtering
  • Transcendental Folding Optimization
  • CUDA TIMEOUT ERROR

MLOps @Chipro â–· #events (3 messages):

  • Poe Previews Hackathon
  • Agihouse Hackathon
  • Poe Platform Announcement
  • In-Chat Generative UI Experiences
  • Discord Channel

Links mentioned:


MLOps @Chipro â–· #general-ml (4 messages):

  • Virtual Try On
  • Image Feature Extraction
  • Model Size

LangChain AI â–· #general (5 messages):

  • Llama 3.1 8b structured output
  • RAG on technical documents with images
  • Next.js and FastAPI interaction
  • AWS pip install issues

LangChain AI â–· #share-your-work (1 messages):

  • Profundo
  • Profundo use cases
  • Profundo AI
  • Profundo product hunt
  • Profundo benefits

Link mentioned: Profundo | Research Redefined: Profundo is a research platform that allows you to conduct research in a way that is more efficient and effective than ever before.


OpenInterpreter â–· #general (1 messages):

  • AI Agents in Enterprises
  • Monitoring and Governance of AI Agents

OpenInterpreter â–· #O1 (2 messages):

  • Screenless personal tutor for kids

OpenInterpreter â–· #ai-content (3 messages):

  • Open Interpreter in Obsidian
  • Convert Anything Tool

Links mentioned:


Alignment Lab AI â–· #general (1 messages):

  • SlimOrca without deduplication
  • Fine-tuning (FT) with deduplication

LLM Finetuning (Hamel + Dan) â–· #general (1 messages):

  • Agentic System for Jupyter Notebook Automation



{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}