Frozen AI News archive

Is this... OpenQ*?

**DeepSeekCoder V2** promises GPT4T-beating performance at a fraction of the cost. **Anthropic** released new research on reward tampering. **Runway** launched their Sora response and Gen-3 Alpha video generation model. A series of papers explore "test-time" search techniques improving mathematical reasoning with models like **LLaMa-3 8B**. **Apple** announced Apple Intelligence with smarter Siri and image/document understanding, partnered with **OpenAI** to integrate ChatGPT into iOS 18, and released 20 new CoreML models with LoRA fine-tuning for specialization. **NVIDIA** released **Nemotron-4 340B**, an open model matching GPT-4 performance. **DeepSeek-Coder-V2** excels in coding and math with 338 programming languages and 128K context length. **Stability AI** released Stable Diffusion 3 Medium weights. **Luma Labs** launched Dream Machine for 5-second video generation from text and images.

Canonical issue URL

AI News for 6/14/2024-6/17/2024. We checked 7 subreddits, 384 Twitters and 30 Discords (414 channels, and 5506 messages) for you. Estimated reading time saved (at 200wpm): 669 minutes. You can now tag @smol_ai for AINews discussions!

A bunch of incremental releases over this weekend; DeepSeekCoder V2 promises GPT4T-beating performance (validated by aider) at $0.14/$0.28 per million tokens (vs GPT4T's $10/$30), Anthropic dropped some Reward Tampering research, and Runway finally dropped their Sora response.

However probably the longer lasting, meatier thing to dive into is the discussion around "test-time" search:

image.png

spawning a list of related papers:

We'll be honest that we haven't read any of these papers yet, but we did cover OpenAI's thoughts on verifier-generator process supervision on the ICLR podcast, and have lined the remaining papers up for the Latent Space Discord Paper Club.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

all recaps done by Claude 3 Opus, best of 4 runs. We are working on clustering and flow engineering with Haiku.

Apple's AI Developments and Partnerships

Open Source LLMs Matching GPT-4 Performance

New Video Generation Models

Robotics and Embodied AI Developments

Miscellaneous AI Research and Applications


AI Reddit Recap

Across r/LocalLlama, r/machinelearning, r/openai, r/stablediffusion, r/ArtificialInteligence, /r/LLMDevs, /r/Singularity. Comment crawling works now but has lots to improve!

AI Models and Techniques

Stable Diffusion Models and Techniques

Llama and Local LLM Models

AI Ethics and Regulation

AI and the Future


AI Discord Recap

A summary of Summaries of Summaries

1. AI Model Performance and Scaling

2. Integration and Implementation Across Platforms

3. Ethical AI and Governance

4. New AI Developments and Benchmarking

5. Collaborative AI Projects and User Engagement


PART 1: High level Discord summaries

Stability.ai (Stable Diffusion) Discord


Unsloth AI (Daniel Han) Discord


CUDA MODE Discord


LM Studio Discord


HuggingFace Discord

AI Alternatives for GPT-4 on Low-End Hardware: Users debated on practical AI models for less powerful servers with suggestions like "llama3 (70B-7B), mixtral 8x7B, or command r+" for self-hosted AI similar to GPT-4.

RWKV-TS Challenges RNN Dominance: An arXiv paper introduces RWKV-TS, proposing it as a more efficient alternative to RNNs in time series forecasting, by effectively capturing long-term dependencies and scaling computationally.

Model Selection Matters in Business Use: In the choice of AI for business applications, it's crucial to consider use cases, tools, and deployment constraints, even with a limitation like the 7B model size. For tailored advice, members suggested focusing on specifics.

Innovations and Integrations Abound: From Difoosion, a user-friendly web interface for Stable Diffusion, to Ask Steve, a Chrome extension designed to streamline web tasks using LLMs, community members are actively integrating AI into practical tools and workflows.

Issues and Suggestions in Model Handling and Fine-Tuning:


OpenAI Discord


LAION Discord


OpenAccess AI Collective (axolotl) Discord


Perplexity AI Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


LLM Finetuning (Hamel + Dan) Discord


Interconnects (Nathan Lambert) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


OpenRouter (Alex Atallah) Discord


LangChain AI Discord


Latent Space Discord

OtterTune Exits Stage Left: OtterTuneAI has shut down following a failed acquisition deal, marking the end of their automatic database tuning services.

Apple and OpenAI Make Moves: Apple released optimized on-device models on Hugging Face, such as DETR Resnet50 Core ML, while OpenAI faced criticism from Edward Snowden for adding former NSA Director Paul M. Nakasone to its board.

DeepMind Stays in Its Lane: In recent community discussions, it was clarified that DeepMind has not been contributing to specific AI projects, debunking earlier speculation.

Runway and Anthropic Innovate: Runway announced their new video generation model, Gen-3 Alpha, on Twitter, while Anthropic publicized important research on AI models hacking their reward systems in a blog post.

Future of AI in Collaboration and Learning: Prime Intellect is set to open source sophisticated models DiLoco and DiPaco, Bittensor is making use of The Horde for decentralized training, and a YouTube video shared among users breaks down optimizers critical for model training.


Cohere Discord


OpenInterpreter Discord


Torchtune Discord


DiscoResearch Discord


Datasette - LLM (@SimonW) Discord

Heralding Data Engineering Job Security: ChatGPT's burgeoning role in the tech landscape drew humor-inflected commentary that it represents an infinite job generator for data engineers.

Thoughtbot Clears the Fog on LLMs: The guild appreciated a guide by Thoughtbot for its lucidity in dissecting the world of Large Language Models, specifically for their delineation of Base, Instruct, and Chat models which can aid beginners.

New Kid on the Search Block: Turso's latest release integrates native vector search with SQLite, which aims at enhancing the AI product development experience by replacing the need for independent extensions like sqlite-vss.


AI Stack Devs (Yoko Li) Discord


Mozilla AI Discord


The LLM Perf Enthusiasts AI Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The YAIG (a16z Infra) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Stability.ai (Stable Diffusion) ▷ #general-chat (723 messages🔥🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (517 messages🔥🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #random (17 messages🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (304 messages🔥🔥):

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Link mentioned: Tweet from Diwank Singh (@diwanksingh): http://x.com/i/article/1802116084507848704


Unsloth AI (Daniel Han) ▷ #community-collaboration (1 messages):

starsupernova: Oh very interesting!


CUDA MODE ▷ #general (49 messages🔥):

Links mentioned:


CUDA MODE ▷ #triton (2 messages):


CUDA MODE ▷ #torch (10 messages🔥):

Link mentioned: pytorch/torch/_inductor/utils.py at f0d68120f4e99ee6c05f1235d9b42a4524af39d5 · pytorch/pytorch: Tensors and Dynamic neural networks in Python with strong GPU acceleration - pytorch/pytorch


CUDA MODE ▷ #algorithms (2 messages):


CUDA MODE ▷ #beginner (5 messages):


CUDA MODE ▷ #jax (1 messages):

Link mentioned: GitHub - yixiaoer/tpux: A set of Python scripts that makes your experience on TPU better: A set of Python scripts that makes your experience on TPU better - yixiaoer/tpux


CUDA MODE ▷ #torchao (11 messages🔥):


CUDA MODE ▷ #off-topic (10 messages🔥):

Links mentioned:


CUDA MODE ▷ #irl-meetup (1 messages):


CUDA MODE ▷ #llmdotc (473 messages🔥🔥🔥):

Links mentioned:


CUDA MODE ▷ #oneapi (2 messages):


CUDA MODE ▷ #bitnet (49 messages🔥):

Links mentioned:


LM Studio ▷ #💬-general (204 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🤖-models-discussion-chat (137 messages🔥🔥):

Links mentioned:


LM Studio ▷ #🧠-feedback (13 messages🔥):

Link mentioned: LM Studio Beta Releases: no description found


LM Studio ▷ #📝-prompts-discussion-chat (8 messages🔥):


LM Studio ▷ #⚙-configs-discussion (3 messages):


LM Studio ▷ #🎛-hardware-discussion (34 messages🔥):

Link mentioned: LM-Studio-0.2.23-Setup.exe - Mirrored.to - Mirrorcreator - Upload files to multiple hosts: no description found


LM Studio ▷ #🧪-beta-releases-chat (22 messages🔥):


LM Studio ▷ #autogen (1 messages):


LM Studio ▷ #open-interpreter (13 messages🔥):

Link mentioned: ChatGPT "Code Interpreter" But 100% Open-Source (Open Interpreter Tutorial): This is my second video about Open Interpreter, with many new features and much more stability, the new Open Interpreter is amazing. Update: Mixtral 7x8b was...


LM Studio ▷ #model-announcements (1 messages):

Link mentioned: lmstudio-community/DeepSeek-Coder-V2-Lite-Instruct-GGUF · Hugging Face: no description found


LM Studio ▷ #🛠-dev-chat (27 messages🔥):

Links mentioned:


HuggingFace ▷ #general (372 messages🔥🔥):

Links mentioned:


HuggingFace ▷ #today-im-learning (5 messages):


HuggingFace ▷ #cool-finds (10 messages🔥):

Links mentioned:


HuggingFace ▷ #i-made-this (18 messages🔥):

Links mentioned:


HuggingFace ▷ #reading-group (16 messages🔥):

Links mentioned:


HuggingFace ▷ #computer-vision (4 messages):


HuggingFace ▷ #NLP (5 messages):

Links mentioned:


HuggingFace ▷ #diffusion-discussions (5 messages):


OpenAI ▷ #ai-discussions (184 messages🔥🔥):

Links mentioned:


OpenAI ▷ #gpt-4-discussions (49 messages🔥):


OpenAI ▷ #prompt-engineering (28 messages🔥):


OpenAI ▷ #api-discussions (28 messages🔥):


LAION ▷ #general (250 messages🔥🔥):

Links mentioned:


LAION ▷ #research (34 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general (161 messages🔥🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-dev (4 messages):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #general-help (9 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #datasets (4 messages):


OpenAccess AI Collective (axolotl) ▷ #community-showcase (1 messages):

Link mentioned: Alex Strick van Linschoten - Finetuning my first LLM(s) for structured data extraction with axolotl: I finetuned my first LLM(s) for the task of extracting structured data from ISAF press releases. Initial tests suggest that it worked pretty well out of the box.


OpenAccess AI Collective (axolotl) ▷ #axolotl-help-bot (11 messages🔥):

Links mentioned:


OpenAccess AI Collective (axolotl) ▷ #axolotl-phorm-bot (11 messages🔥):

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Link mentioned: SoftBank Corp. Launches Strategic Partnership with Leading AI Startup Perplexity | About Us | SoftBank: SoftBank Corp.‘s corporate page provides information about “SoftBank Corp. Launches Strategic Partnership with Leading AI Startup Perplexity”.


Perplexity AI ▷ #general (187 messages🔥🔥):

Link mentioned: Reddit - Dive into anything: no description found


Perplexity AI ▷ #sharing (10 messages🔥):


Perplexity AI ▷ #pplx-api (3 messages):


Nous Research AI ▷ #off-topic (3 messages):

Links mentioned:


Nous Research AI ▷ #interesting-links (5 messages):

Links mentioned:


Nous Research AI ▷ #general (124 messages🔥🔥):

Links mentioned:


Nous Research AI ▷ #ask-about-llms (22 messages🔥):

Links mentioned:


Nous Research AI ▷ #world-sim (29 messages🔥):


Modular (Mojo 🔥) ▷ #general (40 messages🔥):

Links mentioned:


Modular (Mojo 🔥) ▷ #💬︱twitter (2 messages):


Modular (Mojo 🔥) ▷ #✍︱blog (1 messages):

Link mentioned: Modular: What’s New in Mojo 24.4? Improved collections, new traits, os module features and core language enhancements: We are building a next-generation AI developer platform for the world. Check out our latest post: What’s New in Mojo 24.4? Improved collections, new traits, os module features and core language enhanc...


Modular (Mojo 🔥) ▷ #ai (2 messages):

Link mentioned: Francois Chollet - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution: Here is my conversation with Francois Chollet and Mike Knoop on the $1 million ARC-AGI Prize they're launching today.I did a bunch of socratic grilling throu...


Modular (Mojo 🔥) ▷ #🔥mojo (107 messages🔥🔥):

Links mentioned:


Modular (Mojo 🔥) ▷ #🏎engine (3 messages):

Link mentioned: GitHub - openxla/xla: A machine learning compiler for GPUs, CPUs, and ML accelerators: A machine learning compiler for GPUs, CPUs, and ML accelerators - openxla/xla


Modular (Mojo 🔥) ▷ #nightly (9 messages🔥):


Eleuther ▷ #announcements (1 messages):

Link mentioned: Experiments in Weak-to-Strong Generalization: Writing up results from a recent project


Eleuther ▷ #general (51 messages🔥):

Links mentioned:


Eleuther ▷ #research (61 messages🔥🔥):

Links mentioned:


Eleuther ▷ #scaling-laws (18 messages🔥):

Links mentioned:


Eleuther ▷ #interpretability-general (11 messages🔥):

Links mentioned:


Eleuther ▷ #lm-thunderdome (4 messages):


Eleuther ▷ #multimodal-general (3 messages):

Link mentioned: Issues · deepglint/RWKV-CLIP: The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner" - Issues · deepglint/RWKV-CLIP


LLM Finetuning (Hamel + Dan) ▷ #general (35 messages🔥):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #🟩-modal (14 messages🔥):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #learning-resources (5 messages):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #hugging-face (2 messages):


LLM Finetuning (Hamel + Dan) ▷ #replicate (2 messages):


LLM Finetuning (Hamel + Dan) ▷ #langsmith (5 messages):


LLM Finetuning (Hamel + Dan) ▷ #berryman_prompt_workshop (2 messages):


LLM Finetuning (Hamel + Dan) ▷ #workshop-3 (4 messages):


LLM Finetuning (Hamel + Dan) ▷ #clavie_beyond_ragbasics (8 messages🔥):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #jason_improving_rag (1 messages):


LLM Finetuning (Hamel + Dan) ▷ #jeremy_python_llms (3 messages):

Link mentioned: Join the fast.ai Discord Server!: Check out the fast.ai community on Discord - hang out with 10920 other members and enjoy free voice and text chat.


LLM Finetuning (Hamel + Dan) ▷ #saroufimxu_slaying_ooms (3 messages):


LLM Finetuning (Hamel + Dan) ▷ #axolotl (27 messages🔥):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #wing-axolotl (1 messages):

Links mentioned:


LLM Finetuning (Hamel + Dan) ▷ #charles-modal (1 messages):


LLM Finetuning (Hamel + Dan) ▷ #simon_cli_llms (5 messages):

Link mentioned: no title found: no description found


LLM Finetuning (Hamel + Dan) ▷ #allaire_inspect_ai (3 messages):


LLM Finetuning (Hamel + Dan) ▷ #credits-questions (3 messages):


LLM Finetuning (Hamel + Dan) ▷ #fireworks (6 messages):


LLM Finetuning (Hamel + Dan) ▷ #braintrust (3 messages):


LLM Finetuning (Hamel + Dan) ▷ #west-coast-usa (1 messages):

.peterj: Anyone from Seattle area?


LLM Finetuning (Hamel + Dan) ▷ #east-coast-usa (1 messages):

ssilby: <@415846459016216576> I'm in! Let's set up a DMV meetup :3


LLM Finetuning (Hamel + Dan) ▷ #predibase (7 messages):

Link mentioned: isafpr_finetune/data at main · strickvl/isafpr_finetune: Finetuning an LLM for structured data extraction from press releases - strickvl/isafpr_finetune


LLM Finetuning (Hamel + Dan) ▷ #openpipe (3 messages):

Link mentioned: isafpr_finetune/data at main · strickvl/isafpr_finetune: Finetuning an LLM for structured data extraction from press releases - strickvl/isafpr_finetune


LLM Finetuning (Hamel + Dan) ▷ #openai (1 messages):

kramakurious: <@1010989949572612166> is this something you can help with?


Interconnects (Nathan Lambert) ▷ #news (69 messages🔥🔥):

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-drama (4 messages):

Link mentioned: Tweet from Jacques (@JacquesThibs): "Sam Altman recently told some shareholders that OAI is considering changing its governance structure to a for-profit business that OAI's nonprofit board doesn't control. [...] could open ...


Interconnects (Nathan Lambert) ▷ #random (63 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #blog (9 messages🔥):

Links mentioned:


LlamaIndex ▷ #general (95 messages🔥🔥):

Links mentioned:


LlamaIndex ▷ #ai-discussion (6 messages):


tinygrad (George Hotz) ▷ #general (39 messages🔥):

Links mentioned:


tinygrad (George Hotz) ▷ #learn-tinygrad (69 messages🔥🔥):

Link mentioned: Creation - tinygrad docs: no description found


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Link mentioned: GPNotes: no description found


OpenRouter (Alex Atallah) ▷ #general (68 messages🔥🔥):

Links mentioned:


OpenRouter (Alex Atallah) ▷ #일반 (1 messages):

is.maywell: <:a6adc388ea504e89751ecbbd50919d3a:1240669253699637339>


LangChain AI ▷ #general (48 messages🔥):

Links mentioned:

For demo video, see:

https://www.youtube.com/watch?v=9mMbQpofiJY

The app makes the life of academics easier by automating some tedious jobs like retrieving files from arxiv, making summaries and performing context based translation. Future goal is to make a paper survey out of a single paper.

If you feel like in need for some punishment. Check my git repo https://github.com/artnoage/Langgraph_Manuscript_Workflows": 72 likes, 1 comments - vaioslaschos on June 16, 2024: "This is the promotional video for the app that I create for Generative AI Agents Developer Contest by NVIDIA and LangChain....". Blowing Kisses Gratitude GIF - Blowing kisses Kisses Kiss - Discover & Share GIFs: Click to view the GIFHow to properly provide the input schema to the model · langchain-ai/langchain · Discussion #22899: Checked other resources I added a very descriptive title to this question. I searched the LangChain documentation with the integrated search. I used the GitHub search to find a similar question and...


LangChain AI ▷ #share-your-work (14 messages🔥):

Links mentioned:


LangChain AI ▷ #tutorials (1 messages):

emarco: https://www.youtube.com/watch?v=0gJLFTlGFVU


Latent Space ▷ #ai-general-chat (21 messages🔥):

Links mentioned:


Latent Space ▷ #ai-in-action-club (20 messages🔥):

Links mentioned:


Cohere ▷ #general (20 messages🔥):

Links mentioned:


Cohere ▷ #project-sharing (11 messages🔥):

Links mentioned:


Cohere ▷ #announcements (1 messages):

Link mentioned: Join the Cohere Community Discord Server!: Cohere community server. Come chat about Cohere API, LLMs, Generative AI, and everything in between. | 17098 members


OpenInterpreter ▷ #general (14 messages🔥):

Links mentioned:


OpenInterpreter ▷ #O1 (4 messages):


OpenInterpreter ▷ #ai-content (6 messages):

Links mentioned:


Torchtune ▷ #general (7 messages):

Link mentioned: FullyShardedDataParallel — PyTorch 2.3 documentation: no description found


DiscoResearch ▷ #discolm_german (5 messages):


Datasette - LLM (@SimonW) ▷ #ai (3 messages):

Link mentioned: Understanding open source LLMs: Do you think you can run any Large Language Model (LLM) on your machine?


Datasette - LLM (@SimonW) ▷ #llm (1 messages):

Link mentioned: Turso brings Native Vector Search to SQLite: Vector Similarity Search is now available!


AI Stack Devs (Yoko Li) ▷ #ai-town-discuss (1 messages):

gomiez: anyone know of the hospital ai town project name?


Mozilla AI ▷ #llamafile (1 messages):

cryovolcano.: can we use llamafile with tinyllama as a search engine in firefox ?





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}