Evaluate AI Agent Performance with Amazon Bedrock AgentCore
Discover how Amazon Bedrock AgentCore helps evaluate AI agents and improve their performance.
Discover how Amazon Bedrock AgentCore helps evaluate AI agents and improve their performance.
Learn how to build deep agents for enterprise search using NVIDIA AI-Q and LangChain.
AI is transforming research by increasing the number of papers but decreasing their quality.
Together AI launches ATLAS, an adaptive learning speculator system for enhancing language models.
DSGym is a new framework for evaluating and training data science agents, offering standardized solutions.
FlashAttention-3 significantly accelerates attention in AI models, achieving 1.2 PFLOPS with FP8 and improving GPU performance.
Nano Banana Pro impresses with its capabilities in image and text creation.
Isaac 0.1 is a new model for visual perception and OCR by Perceptron AI.
Replicate announced a remote MCP server for applications, simplifying access to APIs.
IBM has launched Granite 4.0 — new open-source language models for business.
Salesforce AI introduces VoiceAgentRAG, an architecture that significantly boosts voice query processing speed.
Microsoft has released Harrier-OSS-v1, a new family of multilingual models achieving SOTA results.
Liquid AI has released LFM2.5-350M, a model with 350M parameters trained on 28T tokens, showcasing high efficiency and intelligence density.
Google has launched Veo 3.1 Lite, a model for low-cost video generation.
Hugging Face has launched TRL v1.0, setting a new standard in LLM post-training.
Discover how a PhD student created the attention mechanism in neural networks by solving practical translation problems.
Learn how to evaluate invoice data using LLM as a judge to enhance AI accuracy.
Gemma Scope 2 has been announced - a new toolkit for analyzing language models that will help researchers understand their behavior.