Explore Together AI Innovations at NVIDIA GTC 2026

1 просмотров Источник
Explore Together AI Innovations at NVIDIA GTC 2026

This year, Together AI is excited to participate in NVIDIA GTC with multiple major announcements and discussions shaping the AI ecosystem. This includes new voice AI capabilities and technical sessions with our research and engineering leaders. If you’re attending GTC, we’d love to connect.

At GTC 2026, several announcements highlight a core theme: AI systems are becoming more open, agentic, and production-ready. Together AI, the AI Native Cloud, is designed to support this shift, helping developers train, shape, and deploy large-scale AI systems with the performance and cost-efficiency required for real-world applications.

NVIDIA has launched NVIDIA Dynamo 1.0, an open-source software for generative and agentic inference at scale. We are excited to work with NVIDIA on Dynamo 1.0 and have already been using it as part of our inference stack to deliver more optimized performance in production use cases. Together AI is committed to open innovation and looks forward to exploring use cases that Dynamo 1.0 can be applied to.

Together AI and NVIDIA are collaborating on NVIDIA NemoClaw — an open-source stack that simplifies running always-on OpenClaw assistants with a single command. As part of the NVIDIA Agent Toolkit, it installs the NVIDIA OpenShell runtime — a secure environment for running autonomous agents and open-source models like NVIDIA Nemotron. Together AI has a model library with over 150 optimized models that can now be easily accessed via NemoClaw.

NVIDIA Nemotron 3 Super is a hybrid mixture-of-experts model designed for high-performance reasoning and multi-agent workflows. It combines a Mamba-Transformer architecture with a 1M-token context window to support long-horizon reasoning and complex agent interactions. The model is optimized to run multiple collaborating agents efficiently — even on a single GPU — making it well-suited for AI-native workflows like software development agents, financial analysis, and cybersecurity automation.

As part of our recent voice solutions launch, the NVIDIA Parakeet TDT 0.6b V3 automatic speech recognition (ASR) model is now available in the Together AI Model Library, providing developers access to high-performance, low-latency transcription optimized for real-time voice applications. By combining Parakeet’s ASR accuracy with Together’s high-performance inference infrastructure, AI natives can build production-ready voice agents that deliver fast, reliable, and scalable transcription.

The Together AI team, along with customers like Cursor and Decagon, will share insights across multiple GTC sessions covering topics from production inference to open AI research. We invite you to visit us at booth #1213 and join us for live demos of Together AI infrastructure and models, as well as meet researchers and engineers building the future of open AI models.

Похожие статьи