All tags

Model: "llama-2-7b"

    LLaDA: Large Language Diffusion Models
    Test-Time Training, MobileLLM, Lilian Weng on Hallucination (Plus: Turbopuffer)
    Mixtral 8x22B Instruct sparks efficiency memes
    not much happened today
    12/27/2023: NYT vs OpenAI