Topic: "evaluation"

AI Engineer World's Fair: Second Run, Twice The Fun

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Evals-based AI Engineering

Inflection-2.5 at 94% of GPT4, and Pi at 6m MAU