Frozen AI News archive

GitHub Copilot Strikes Back

**GitHub's tenth annual Universe conference** introduced the **Multi-model Copilot** featuring **Anthropic's Claude 3.5 Sonnet**, **Google's Gemini 1.5 Pro**, and **OpenAI's o1-preview** models in a new picker UI, allowing developers to choose from multiple companies' models. The event also showcased **GitHub Spark**, an AI-native tool for building natural language applications with deployment-free hosting and integrated model prompting. Additionally, GitHub updated its Copilot Workspace with new agents and security Autofix features. **Weights & Biases** launched Weave with multimodal observability supporting audio, text, and images, integrating the OpenAI Realtime API. Twitter recaps highlighted **tinygrad's** codebase optimization and discussions on GenAI adoption and **Gemini Flash-8B's** cost efficiency at **$0.0375 per million tokens**.

Canonical issue URL

GitHub may be all you need for AI-Native coding.

AI News for 10/28/2024-10/29/2024. We checked 7 subreddits, 433 Twitters and 32 Discords (231 channels, and 2681 messages) for you. Estimated reading time saved (at 200wpm): 279 minutes. You can now tag @smol_ai for AINews discussions!

GitHub's tenth annual Universe conference was today:

image

And it brought a raft of notable announcements (full blogpost here): mostly being GitHub's takes on popular code AI tools.

  1. Multi-model Copilot: adding Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5 Pro, and OpenAI’s o1-preview in a new model picker UI. Copilot's base model has moved from Codex, GPT3.5, GPT4, 4o, and 4o-mini, but for the first time developers get to choose models from other companies including Google. This was big enough to reach mainstream media today and one can't help but tie this story with reports of a "fraying" Microsoft-OpenAI partnership.

image

Cassidy Williams also demoed the new multi-file editing capability of Copilot and a custom instructions file - analogous to the Composer and .cursorrules features of Cursor.

  1. GitHub Spark: "the AI-native tool to build applications entirely in natural language. Sparks are fully functional micro apps that can integrate AI features and external data sources without requiring any management of cloud resources." basically their v0 and bolt.new and Claude Artifacts competitor, complete with deployment-free hosting, themable design system, persistent data storage, and integrated model prompting.

"Utilizing a creativity feedback loop, users start with an initial prompt, see live previews of their app as it’s built, easily see options for each of their requests, and automatically save versions of each iteration so they can compare versions as they go."

image

The presenters also discussed the latest GitHub Models (now off waitlist), and last year's big launch, Copilot Workspace and Code Reviews (now with 2 more agents: Brainstorm and Build/Repair, to the existing 3 Spec/Plan/Implement agents, with a new VSCode extension) and security Autofix updates.


[This issue brought to you by Weights & Biases]!: Your LLMs aren't just text-only anymore - so why should your observability be?

Weave from Weights & Biases now supports audio tracing alongside text, images and other modalities. With just 3 lines of code, track every input, output and metadata across your multimodal AI stack.

Try it yourself in our interactive Colab notebook!

swyx commentary: this notebook looks short, but IMO the gold is the 19 cells hidden under "Advanced Usage: Realtime Audio API with Weave"! You wouldn't expect a normal LLM Ops product to have updated to support the OpenAI Realtime API so soon, but it looks like the WandB team have been cooking.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3.5 Sonnet, best of 4 runs.

AI Development and Industry Trends

AI Applications and Tools

AI Research and Model Updates

AI Ethics and Societal Impact


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Optimizing LLM Inference on Consumer Hardware

Theme 2. Advancements in Open-Source LLMs for Creative and Uncensored Use Cases

Theme 3. Innovations in LLM Tooling and Infrastructure

Theme 4. Challenges in AI Document Understanding and Real-World Applications

Other AI Subreddit Recap

r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity

AI Model Releases and Capabilities

AI Applications and Demonstrations

AI Policy and Infrastructure

AI Impact and Societal Discussion

Memes and Humor


AI Discord Recap

A summary of Summaries of Summaries by O1-preview

Theme 1: AI Model Releases Shake Up the Scene

Theme 2: AI Tooling Gets a Turbo Boost

Theme 3: AI Privacy and Security Take Center Stage

Theme 4: AI Jobs Abound on New Platforms

Theme 5: AI Community Buzzes with Events and Insights


PART 1: High level Discord summaries

HuggingFace Discord


Unsloth AI (Daniel Han) Discord


Stability.ai (Stable Diffusion) Discord


Nous Research AI Discord


Perplexity AI Discord


Notebook LM Discord Discord


GPU MODE Discord


LM Studio Discord


aider (Paul Gauthier) Discord


OpenAI Discord


Cohere Discord


OpenRouter (Alex Atallah) Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


Interconnects (Nathan Lambert) Discord


OpenInterpreter Discord


Torchtune Discord


DSPy Discord


LlamaIndex Discord


LLM Agents (Berkeley MOOC) Discord


LAION Discord


tinygrad (George Hotz) Discord


LangChain AI Discord


Gorilla LLM (Berkeley Function Calling) Discord


LLM Finetuning (Hamel + Dan) Discord


OpenAccess AI Collective (axolotl) Discord


The Alignment Lab AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Mozilla AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The DiscoResearch Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

HuggingFace ▷ #announcements (1 messages):

  • Clem intro
  • Live Workshop Promotion

HuggingFace ▷ #general (870 messages🔥🔥🔥):

  • TensorFlow frustrations
  • LLM training approaches
  • Learning paths in AI/ML
  • Using Hugging Face API
  • Stopping model overfitting

Links mentioned:


HuggingFace ▷ #today-im-learning (4 messages):

  • Highlighting cool projects
  • New members in the server
  • Community engagement

Link mentioned: Pokemon Pikachu GIF - Pokemon Pikachu Clap - Discover & Share GIFs: Click to view the GIF


HuggingFace ▷ #cool-finds (53 messages🔥):

  • ML and quantum computing
  • Understanding advanced research papers
  • Using AI to enhance paper reading
  • Homomorphic encryption in privacy
  • The impact of attention mechanisms in ML

Links mentioned:


HuggingFace ▷ #i-made-this (209 messages🔥🔥):

  • Hemp Nanosheets
  • Autonomous Science Workflows
  • Custom WordPiece Tokenizer
  • Model Usage Analytics
  • Research Collaboration

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

  • Discord Event Details

HuggingFace ▷ #computer-vision (8 messages🔥):

  • Swin Transformer v2
  • DINO model for vision
  • Attention masks in vision transformers
  • Fine-tuning molmo VLM

HuggingFace ▷ #NLP (8 messages🔥):

  • LangChain SQL Agent
  • Hugging Face NLP Course Resources
  • LLM Fine-Tuning
  • Research Papers on Modern Models

Link mentioned: langchain/cookbook/LLaMA2_sql_chat.ipynb at master · langchain-ai/langchain: 🦜🔗 Build context-aware reasoning applications. Contribute to langchain-ai/langchain development by creating an account on GitHub.


Unsloth AI (Daniel Han) ▷ #general (170 messages🔥🔥):

  • Unsloth Training Tools
  • FP8 Fine-Tuning
  • Gradio UI for Model Training
  • Cloud GPU Connections
  • Job Opportunities in AI

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (57 messages🔥🔥):

  • Frustrations with School
  • Life After Education
  • Value of Experience
  • Plans Beyond Masters
  • Perceptions of PhDs

Unsloth AI (Daniel Han) ▷ #help (65 messages🔥🔥):

  • Converting HF to GGUF
  • Continued Pretraining
  • Fine-tuning Llama 3.1
  • Unsloth Installation Issues
  • Memory for Neural Networks

Links mentioned:


Unsloth AI (Daniel Han) ▷ #community-collaboration (1 messages):

mrdragonfox: and im sure you have the capital required to make that happen yes ? lol


Unsloth AI (Daniel Han) ▷ #research (1 messages):

  • PyTorch Quantization
  • Optimizer CPU Offload

Link mentioned: ao/torchao/prototype/low_bit_optim at main · pytorch/ao: PyTorch native quantization and sparsity for training and inference - pytorch/ao


Stability.ai (Stable Diffusion) ▷ #announcements (1 messages):

  • Stable Diffusion 3.5 Medium
  • Model performance and hardware compatibility
  • Community feedback implementation
  • Open release and licensing
  • Improvements from Stable Diffusion 3 Medium

Link mentioned: Introducing Stable Diffusion 3.5 — Stability AI: Today we are introducing Stable Diffusion 3.5. This open release includes multiple model variants, including Stable Diffusion 3.5 Large and Stable Diffusion 3.5 Large Turbo, and as of October 29th, St...


Stability.ai (Stable Diffusion) ▷ #general-chat (253 messages🔥🔥):

  • Stable Diffusion 3.5 Medium
  • GPU Performance Comparison
  • Sana Autoencoder
  • ComfyUI vs A1111

Links mentioned:


Nous Research AI ▷ #general (103 messages🔥🔥):

  • AI Newsletter Recommendations
  • Roleplaying AI Character Development
  • Layer Skip Research Release
  • GitHub Copilot Enhancements
  • OAI and Microsoft Relationship Dynamics

Links mentioned:


Nous Research AI ▷ #ask-about-llms (4 messages):

  • Hermes 3 for roleplaying bots
  • Fine-tuning considerations
  • Character.ai mimicry

Link mentioned: piotr25691/llama-3-cat-8b-instruct-v1-gguf · Hugging Face: no description found


Nous Research AI ▷ #research-papers (1 messages):

trre: https://arxiv.org/abs/2410.14157


Nous Research AI ▷ #research-papers (1 messages):

trre: https://arxiv.org/abs/2410.14157


Perplexity AI ▷ #announcements (1 messages):

  • Curators Program
  • Discover Feed Contributions
  • Content Creation
  • Perplexity Engagement

Perplexity AI ▷ #general (99 messages🔥🔥):

  • Grok Model Updates
  • Perplexity Pro Features
  • Coding Issues
  • Integration with GitHub
  • User Experience Feedback

Links mentioned:


Perplexity AI ▷ #sharing (8 messages🔥):

  • Apple Smartwatch Settlement
  • NASA Economic Contributions
  • Neural Impact Studies
  • Red Panda Image Model
  • Advancements in Photonic Computing

Notebook LM Discord ▷ #use-cases (29 messages🔥):

  • BYD's Electric Vehicle Expansion
  • NotebookLM for Staff Resources
  • Podcast Generation Challenges
  • Therapeutic Use of NotebookLM
  • Real-Time Avatar Integration

Links mentioned:


Notebook LM Discord ▷ #general (60 messages🔥🔥):

  • NotebookLM Upload Issues
  • Spanish Podcast Generation
  • Open Source Alternatives to NotebookLM
  • Audio Overview Features
  • Limitations of Notebook Uploads

Links mentioned:


GPU MODE ▷ #general (6 messages):

  • Unsloth Kernels
  • Numerical Precision in Evaluations
  • CUDA/Triton Projects

Link mentioned: GitHub - unslothai/unsloth: Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory: Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory - unslothai/unsloth


GPU MODE ▷ #triton (5 messages):

  • Triton kernel performance
  • BF16 operations in Triton
  • Triton nightly builds issues
  • Triton AST to PTX
  • Merging improvements in Triton

Link mentioned: Wheels · Workflow runs · triton-lang/triton: Development repository for the Triton language and compiler - Wheels · Workflow runs · triton-lang/triton


GPU MODE ▷ #torch (4 messages):

  • H100 speed-up tricks
  • FSDP2 API deprecation
  • TorchAO optimizers

Link mentioned: Issues · pytorch/pytorch): Tensors and Dynamic neural networks in Python with strong GPU acceleration - Issues · pytorch/pytorch


GPU MODE ▷ #cool-links (1 messages):

  • Colossus Supercomputer
  • NVIDIA Spectrum-X
  • xAI Grok Models
  • AI Networking Performance

Link mentioned: NVIDIA Ethernet Networking Accelerates World’s Largest AI Supercomputer, Built by xAI: NVIDIA today announced that xAI’s Colossus supercomputer cluster comprising 100,000 NVIDIA Hopper GPUs in Memphis, Tennessee, achieved this massive scale by using the NVIDIA Spectrum-X™ Ethernet netwo...


GPU MODE ▷ #jobs (6 messages):

  • Cracked Engineers platform
  • Tech jobs newsletter
  • AI startups
  • User feedback
  • Gigachad image

Links mentioned:


GPU MODE ▷ #beginner (14 messages🔥):

  • CUDA Learning Resources
  • Math Prerequisites for PMPP
  • Choosing Hardware for CUDA Development

GPU MODE ▷ #torchao (1 messages):

  • PR Feedback Request
  • Inference Throughput
  • Performance Improvements

Link mentioned: LLM.int8() Refactoring: Part 1 by matthewdouglas · Pull Request #1401 · bitsandbytes-foundation/bitsandbytes: This PR is the initial phase of a set of changes aimed at improving the LLM.int8() implementation. Still in draft at the moment, but since there's a lot here I'm ready to have eyes on ...


GPU MODE ▷ #triton-puzzles (1 messages):

  • Triton Installation
  • Dependencies for Triton Visualization

GPU MODE ▷ #hqq-mobius (37 messages🔥):

  • GEMV optimization without tl.dot
  • Performance comparisons on different GPUs
  • Machete kernel for H100
  • Instruction ordering in Triton
  • Custom operations in Triton kernels

Links mentioned:


GPU MODE ▷ #llmdotc (1 messages):

  • Cracked Engineers Job Platform
  • Weekly Tech Jobs Newsletter
  • AI-Assisted Job Posting

Link mentioned: Tweet from Aleksa Gordić 🍿🤖 (@gordic_aleksa): [🚀] Super excited to share this: I've built a job platform for technical roles called "Cracked Engineers". :) If you want to land a job with some of the world's best AI/tech startups ...


GPU MODE ▷ #rocm (6 messages):

  • Composable Kernel
  • MFMA bank conflicts
  • MI250x performance

GPU MODE ▷ #liger-kernel (1 messages):

0x000ff4: https://github.com/linkedin/Liger-Kernel/pull/321


GPU MODE ▷ #thunderkittens (1 messages):

  • ThunderKittens 0.000000002 Release
  • New Features in TK Kernels
  • Latest Paper on AI Kernels
  • Upcoming Blog Posts
  • Call for Contributions

Links mentioned:


LM Studio ▷ #general (17 messages🔥):

  • Token Processing Speed
  • Recommendations for LLM
  • Context Management Issues
  • CPU Thread Count Confusion
  • Large Context Error Handling

LM Studio ▷ #hardware-discussion (63 messages🔥🔥):

  • Mac Memory Concerns
  • NGINX Proxy Issues
  • PCIE Bandwidth and Inference
  • Fractal Torrent Case
  • Multi-GPU setups

Links mentioned:


aider (Paul Gauthier) ▷ #general (31 messages🔥):

  • Aider Performance Issues
  • Web Scraping and Automation Tools
  • Git Repository Management
  • GitHub Copilot vs Aider
  • Using Aider with Claude

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (46 messages🔥):

  • FireCrawl for scraping
  • Reddit data extraction
  • Debugging strategies
  • Prompt engineering
  • Tool recommendations

Links mentioned:


aider (Paul Gauthier) ▷ #links (1 messages):

  • New Bash and Editor Tools
  • Aider Code Assistants

Link mentioned: GitHub - disler/anthropic-computer-use-bash-and-files: Contribute to disler/anthropic-computer-use-bash-and-files development by creating an account on GitHub.


OpenAI ▷ #ai-discussions (54 messages🔥):

  • AI Research Grants
  • Evolution of Algorithms
  • Anthropomorphism of AI
  • Ethical Considerations in AI
  • AGI Development Stages

Link mentioned: Tweet from Sam Altman (@sama): @kyliebytes fake news out of control


OpenAI ▷ #gpt-4-discussions (5 messages):

  • GPT Typo Issues
  • Voice Reading Problems on iOS
  • Chat Count Reduction
  • GPT Non-Responsive Behavior
  • Frustration with Model Responses

OpenAI ▷ #prompt-engineering (1 messages):

darthgustav.: Stochasticity.


OpenAI ▷ #api-discussions (1 messages):

darthgustav.: Stochasticity.


Cohere ▷ #discussions (35 messages🔥):

  • Algorithmic Trading Insights
  • Bias in News Articles
  • Garbled Output in AI Models
  • Parameter Adjustments for AI Responses
  • EDGAR and Market Insights

Cohere ▷ #questions (14 messages🔥):

  • Serverless Model Usage
  • Max Tokens Importance
  • Response Length Insights
  • Cohere Model Platforms
  • Classification Use Cases

Links mentioned:


Cohere ▷ #api-discussions (7 messages):

  • Reporting Hallucinated Data
  • Cohere Rerank API timeout issues

Cohere ▷ #projects (1 messages):

  • Synthetic Data Generation
  • Medical Note Automation

Link mentioned: pathology report: A patient undergoes a biopsy or surgical procedure, and a tissue sample is collected. The sample is sent to a pathology lab where a pathologist examines it under a microscope. The pathologist writes a...


Cohere ▷ #cohere-toolkit (2 messages):

  • Cohere Installation Issues
  • Tokenizers Compatibility

OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

  • Inflection
  • Billing Issues

Link mentioned: Inflection 3 Productivity - API, Providers, Stats: Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. Run Inflection 3 Productivity with API


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

  • Flexible chat app for macOS
  • Alpha testers recruitment

Link mentioned: imgur.com: Discover the magic of the internet at Imgur, a community powered entertainment destination. Lift your spirits with funny jokes, trending memes, entertaining gifs, inspiring stories, viral videos, and ...


OpenRouter (Alex Atallah) ▷ #general (46 messages🔥):

  • OpenRouter API issues
  • API Key Security
  • Service Outages
  • Activity Logging
  • Usage Tracking Tools

Links mentioned:


OpenRouter (Alex Atallah) ▷ #beta-feedback (8 messages🔥):

  • Access to Integrations
  • Beta Access Requests

Latent Space ▷ #ai-general-chat (48 messages🔥):

  • Moondream funding
  • AI-powered search engines
  • GitHub Copilot updates
  • Vector databases
  • OpenAI chat features

Links mentioned:


Modular (Mojo 🔥) ▷ #general (5 messages):

  • Modular products discussions
  • General software discussions
  • Advancement in community levels

Modular (Mojo 🔥) ▷ #mojo (43 messages🔥):

  • Mojo memory-safe references proposal
  • FlatBuffers vs ProtoBuf comparison
  • Swapping references implementation
  • Optimizations and performance concerns
  • Alias and noalias in Mojo

Links mentioned:


Eleuther ▷ #general (6 messages):

  • Hugging Face talk
  • Open Lean datasets
  • GPT-NeoX on Colab
  • Local completion with self-signed certificates
  • NeurIPS registration

Link mentioned: GPT-NeoX-Colab/notebooks/shakespeare_training.ipynb at main · markNZed/GPT-NeoX-Colab: Example Colab notebooks for GPT-NeoX. Contribute to markNZed/GPT-NeoX-Colab development by creating an account on GitHub.


Eleuther ▷ #research (17 messages🔥):

  • Hellaswag training results
  • Parameter sharing in Transformers
  • FP8 training efficiency
  • Modular dualization in optimization
  • Type check-focused optimization theory

Links mentioned:


Eleuther ▷ #interpretability-general (1 messages):

  • Sparse Autoencoder Guides
  • Mechanistic Interpretability Series

Link mentioned: AI-Plans: no description found


Eleuther ▷ #lm-thunderdome (2 messages):

  • Custom Certificates
  • Workaround for Certificates

Link mentioned: Issues · EleutherAI/lm-evaluation-harness): A framework for few-shot evaluation of language models. - Issues · EleutherAI/lm-evaluation-harness


Interconnects (Nathan Lambert) ▷ #news (13 messages🔥):

  • OpenAI CFO Insights
  • SearchGPT Promotion
  • ROCKET-1 Development
  • Anthropic Hiring Surge
  • Claude in GitHub Copilot

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (2 messages):

  • Human Annotation Pricing
  • Domain Sourcing Difficulty

Interconnects (Nathan Lambert) ▷ #ml-drama (7 messages):

  • NeurIPS Registration Changes
  • Concerns about Lottery System
  • Grad Students Impacted

Link mentioned: Tweet from NeurIPS Conference (@NeurIPSConf): Due to a high demand for registrations, NeurIPS will be moving towards a randomized lottery system, effective immediately. Authors of accepted conference and workshop papers are still guaranteed regis...


Interconnects (Nathan Lambert) ▷ #memes (2 messages):

  • Masayoshi Son ASI breakdown
  • Bubble concerns

Link mentioned: Tweet from sunny madra (@sundeep): Masayoshi Son just broke down how it will take $9T and 200m chips to achieve ASI 👀


OpenInterpreter ▷ #general (16 messages🔥):

  • Open Interpreter Model Capabilities
  • Local Model Limitations
  • Using the Computer API
  • Hackathon Winners Setup
  • Open Interpreter Community Support

Links mentioned:


OpenInterpreter ▷ #ai-content (5 messages):

  • OpenAI Advanced Voice
  • Apple AI Server Hacking Bounty
  • Muvi V2M
  • ChatGPT Chat History Search

Links mentioned:


Torchtune ▷ #dev (19 messages🔥):

  • Quantization of Base Models
  • FSDP CPU Offloading
  • Quantized KV-Caches
  • Non-Trainable Model Quantization

Link mentioned: Support NF4 quantization of linear layers without LoRA applied · Issue #1093 · pytorch/torchtune: As pointed out by @janeyx99, our quantize_base argument will only quantize the base model weights of linear layers with LoRA applied to them (see e.g. here in our Llama3 self-attention builder). Bu...


DSPy ▷ #papers (2 messages):

  • AI Privacy
  • Privacy-Conscious Delegation
  • Local vs Proprietary LLMs

Link mentioned: PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles: Users can divulge sensitive information to proprietary LLM providers, raising significant privacy concerns. While open-source models, hosted locally on the user's machine, alleviate some concerns,...


DSPy ▷ #general (14 messages🔥):

  • DSpy usage
  • MIPROv2 optimizer
  • DSPy programming language
  • Bug fix on MIPROv2
  • ELI5 explanation of DSPy

Links mentioned:


LlamaIndex ▷ #blog (4 messages):

  • Retrieval Augmented Generation (RAG)
  • Cohere Multi-Modal Embeddings
  • Azure AI App Templates
  • MLflow Integration
  • NVIDIA AI Insights

Links mentioned:


LlamaIndex ▷ #general (8 messages🔥):

  • Blockchain Engineering Contributions
  • Chroma Vector Store Retrieval Behavior
  • Web Scraping with LLM
  • Date-related Vector Search Queries

Link mentioned: This is how I scrape 99% websites via LLM: How to do web scraping with LLM in 2024Use AgentQL to scrape website for free: https://www.agentql.com/?utm_source=YouTube&utm_medium=Creator&utm_id=AIJason_...


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

  • LLM Agents Hackathon Registration
  • Team Formation Open
  • Prize Pool Announcement
  • Tracks Overview
  • Social Media Promotion

Link mentioned: Tweet from Dawn Song (@dawnsongtweets): 🚀Really excited that 1K+ innovators have already signed up for our LLM Agents MOOC Hackathon within just a few days! 🎉Build, collaborate, and compete for $200K+ in prizes/resources/credits across 5...


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

  • 8th lecture announcement
  • Neural and Symbolic Decision Making
  • Yuandong Tian's presentation

LLM Agents (Berkeley MOOC) ▷ #mooc-questions (6 messages):

  • Subtitles in live streams
  • React-based agent development

LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (4 messages):

  • Study Group Interest
  • Lecture Summaries
  • Virtual Meetings

Link mentioned: LLM Agents Peer Study Group (Virtual): Use this to express your interest in joining a peer study group virtually. We might use Discord events or Zoom. We will go through the lectures starting from the first and also discuss the additional...


LAION ▷ #general (3 messages):

  • Class conditioned latent diffusion model
  • Pink pixel patches
  • DDIM p_sample clipping
  • IJEPA and autoregressive architecture

LAION ▷ #research (5 messages):

  • Model Parameters
  • Token vs Parameters Confusion

tinygrad (George Hotz) ▷ #general (7 messages):

  • Negative Line Day Effects
  • CI Test Improvements
  • Uops Readability Challenges
  • Documentation Maintenance Concerns
  • Premature Optimization in Code

LangChain AI ▷ #general (4 messages):

  • RAGAS
  • CSV files in open source models
  • LangChain Python Compatibility
  • LangChain-JS Course

Link mentioned: GitHub - explodinggradients/ragas: Supercharge Your LLM Application Evaluations 🚀: Supercharge Your LLM Application Evaluations 🚀. Contribute to explodinggradients/ragas development by creating an account on GitHub.


LangChain AI ▷ #tutorials (2 messages):

  • Web scraping with LLM
  • LangChain-JS course

Link mentioned: This is how I scrape 99% websites via LLM: How to do web scraping with LLM in 2024Use AgentQL to scrape website for free: https://www.agentql.com/?utm_source=YouTube&utm_medium=Creator&utm_id=AIJason_...


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (5 messages):

  • Leaderboard Terminology
  • Multi-Step vs Multi-Turn Evaluation

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #general (2 messages):

  • Cracked Engineers Job Platform
  • Tech Job Newsletter
  • AI Startups Hiring

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (1 messages):

  • SymNoise methodology
  • LLaMA-2-7B performance
  • Fine-tuning techniques

Link mentioned: SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise: In this paper, we introduce a novel fine-tuning technique for language models, which involves incorporating symmetric noise into the embedding process. This method aims to enhance the model's func...






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}