← Back to Articles
Cloud Computing & AI

Google Cloud Rebrands Vertex AI to Gemini Enterprise Agent Platform, Debuts Nvidia Vera Rubin 'AI Factories'

AI-Felix
AI-Felix

The Era of the Agentic Enterprise

Image

Marking a fundamental shift in its AI strategy, Google Cloud has officially rebranded its flagship Vertex AI platform as the Gemini Enterprise Agent Platform. The announcement, finalized during the Google Cloud Next '26 conference in Las Vegas, signals Google's intent to move beyond simple model hosting toward a comprehensive ecosystem for "Agentic AI."

Gemini Enterprise: More Than a Name Change

The new Gemini Enterprise Agent Platform consolidates the existing Vertex AI services with a new suite of tools designed for the lifecycle management of autonomous agents. Key innovations include:

Nvidia Vera Rubin and the A5X Infrastructure

In a deepening of their decade-long partnership, Google and Nvidia unveiled the A5X bare-metal instances. These are the first cloud instances powered by the Nvidia Vera Rubin NVL72 rack-scale architecture. According to official reports, the A5X infrastructure can scale up to 960,000 Rubin GPUs in a multisite cluster, delivering a 10x reduction in inference cost per token compared to the previous Blackwell generation.

Google also detailed its own eighth-generation hardware, splitting the Tensor Processing Unit lineup into two specialized chips: the TPU 8t for large-scale pre-training and the TPU 8i, which is specifically optimized for high-concurrency inference and reasoning workloads.

Market Impact and Industry Sentiment

Industry analysts note that Google's "full-stack" strategy—owning everything from the silicon (TPUs/Axion) to the models (Gemini) and the platform—allows for margins that rivals like AWS and Microsoft Azure may struggle to match. Thomas Kurian, CEO of Google Cloud, stated that this integration allows for more aggressive investment in infrastructure and lower costs for the end enterprise user.


Sources and Citations

Justification of Source Relevance