The Anthropic Claude models on Vertex AI offer fully managed and serverless models as APIs. To use a Claude model on Vertex AI, send a request directly to the Vertex AI API endpoint. Because the Anthropic Claude models use a managed API, there's no need to provision or manage infrastructure. You can stream your Claude responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response. You pay for Claude models as you use them (pay as you go), or you pay a fixed fee when using provisioned throughput. For pay-as-you-go pricing, see Anthropic Claude models on the Vertex AI pricing page. The following models are available from Anthropic to use in Vertex AI. To access a Claude model, go to its Model Garden model card. Anthropic's Claude models support Vertex AI request-response logging. Enable 30-day request-response logging of your prompt and completion activity to track any model misuse by your users. For more information, see Log requests and responses. Claude Opus 4.1 is Anthropic's most intelligent model and an industry leader for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence: Claude Opus 4.1 is Anthropic's most intelligent model and an industry leader for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence: Go to the Claude Opus 4.1 model card Claude Opus 4 is a state-of-the-art model for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence: Go to the Claude Opus 4 model card Claude Sonnet 4 balances impressive performance for coding with the right speed and cost for high-volume use cases: Go to the Claude Sonnet 4 model card Claude 3.7 Sonnet is Anthropic's most intelligent model to date and the first Claude model to offer extended thinking—the ability to solve complex problems with careful, step-by-step reasoning. Claude 3.7 Sonnet is a single model where you can balance speed and quality by choosing between standard thinking for near-instant responses or extended thinking for advanced reasoning. For more information about extended thinking, see Anthropic's documentation. Claude 3.7 Sonnet is optimized for the following use cases: Go to the Claude 3.7 Sonnet model card Claude 3.5 Sonnet v2 is a state-of-the-art model for real-world software engineering tasks and agentic capabilities. Claude 3.5 Sonnet v2 delivers these advancements at the same price and speed as Claude 3.5 Sonnet. The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can manipulate a computer desktop environment. For more information, see the Anthropic documentation. Claude 3.5 Sonnet is optimized for the following use cases: Go to the Claude 3.5 Sonnet v2 model card Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is optimal for use cases where speed and affordability matter. It improves on its predecessor across every skill set. Claude 3.5 Haiku is optimized for the following use cases: Go to the Claude 3.5 Haiku model card Anthropic's Claude 3 Haiku is Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions. Live customer interactions and translations. Content moderation to catch suspicious behavior or customer requests. Cost-saving tasks, such as inventory management and knowledge extraction from unstructured data. Vision tasks, such as processing images to return text output, analysis of charts, graphs, technical diagrams, reports, and other visual content. Go to the Claude 3 Haiku model card Anthropic's Claude 3.5 Sonnet outperforms Claude 3 Opus on a wide range of Anthropic's evaluations, with the speed and cost of Anthropic's mid-tier Claude 3 Sonnet. Claude 3.5 Sonnet is optimized for the following use cases: Coding, such as writing, editing, and running code with sophisticated reasoning and troubleshooting capabilities. Handle complex queries from customer support by understanding user context and orchestrating multi-step workflows. Data science and analysis by navigating unstructured data and leveraging multiple tools to generate insights. Visual processing, such as interpreting charts and graphs that require visual understanding. Writing content with a more natural, human-like tone. Go to the Claude 3.5 Sonnet model cardAvailable Claude models
Claude Opus 4.1
Claude Opus 4
Claude Sonnet 4
Claude 3.7 Sonnet
Claude 3.5 Sonnet v2
Claude 3.5 Haiku
Claude 3 Haiku
Claude 3.5 Sonnet
What's next
Anthropic's Claude models
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.