All tags

Person: "_philschmid"

    not much happened today
    Gemini's AlphaEvolve agent uses Gemini 2.0 to find new Math and cuts Gemini cost 1% — without RL
    not much happened today
    Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model
    not much happened today
    gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API
    not much happened today
    not much happened today
    Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen
    DeepSeek's Open Source Stack
    not much happened today
    The Ultra-Scale Playbook: Training LLMs on GPU Clusters
    How To Scale Your Model, by DeepMind
    not much happened today
    not much happened this weekend
    ChatGPT Advanced Voice Mode
    Pixtral 12B: Mistral beats Llama to Multimodality
    Gemini launches context caching... or does it?
    Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata
    Not much happened today
    Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing
    Mixtral 8x22B Instruct sparks efficiency memes