Topic: "synthetic-data"

not much happened today

not much happened today

Qwen-Image: SOTA text rendering + 4o-imagegen-level Editing Open Weights MMDiT

not much happened today

OpenAI adopts MCP

not much happened today

DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens

not much happened today

Tencent's Hunyuan-Large claims to beat DeepSeek-V2 and Llama3-405B with LESS Data

The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more

not much happened today

State of AI 2024

Contextual Document Embeddings: `cde-small-v1`

Not much technical happened today

Reflection 70B, by Matt from IT Department

not much happened today

Apple Intelligence Beta + Segment Anything Model 2

AlphaProof + AlphaGeometry2 reach 1 point short of IMO Gold

Llama 3.1: The Synthetic Data Model

Llama 3.1 Leaks: big bumps to 8B, minor bumps to 70b, and SOTA OSS 405b model

SciCode: HumanEval gets a STEM PhD upgrade

Microsoft AgentInstruct + Orca 3

We Solved Hallucinations

GraphRAG: The Marriage of Knowledge Graphs and RAG

Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary

Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Meta Llama 3 (8B, 70B)

not much happened today

Welcome /r/LocalLlama!

Claude 3 just destroyed GPT 4 (see for yourself)

... and welcome AI Twitter!

1/9/2024: Nous Research lands $5m for Open Source AI

1/8/2024: The Four Wars of the AI Stack

1/4/2024: Jeff Bezos backs Perplexity's $520m Series B.

12/13/2023 SOLAR10.7B upstages Mistral7B?