Launching the Adaptive Learning Speculator System ATLAS
Together AI launches ATLAS, an adaptive learning speculator system for enhancing language models.
Together AI launches ATLAS, an adaptive learning speculator system for enhancing language models.
DSGym is a new framework for evaluating and training data science agents, offering standardized solutions.
FlashAttention-3 significantly accelerates attention in AI models, achieving 1.2 PFLOPS with FP8 and improving GPU performance.
Isaac 0.1 is a new model for visual perception and OCR by Perceptron AI.
Nano Banana Pro impresses with its capabilities in image and text creation.
IBM has launched Granite 4.0 — new open-source language models for business.
Replicate announced a remote MCP server for applications, simplifying access to APIs.
Microsoft has released Harrier-OSS-v1, a new family of multilingual models achieving SOTA results.
Salesforce AI introduces VoiceAgentRAG, an architecture that significantly boosts voice query processing speed.
Hugging Face has launched TRL v1.0, setting a new standard in LLM post-training.
Google has launched Veo 3.1 Lite, a model for low-cost video generation.
Liquid AI has released LFM2.5-350M, a model with 350M parameters trained on 28T tokens, showcasing high efficiency and intelligence density.
Learn how to evaluate invoice data using LLM as a judge to enhance AI accuracy.
Discover how a PhD student created the attention mechanism in neural networks by solving practical translation problems.
Gemma Scope 2 has been announced - a new toolkit for analyzing language models that will help researchers understand their behavior.