This document provides an overview of the available Anthropic Claude models on Vertex AI, covering the following topics: The Anthropic Claude models on Vertex AI are available as fully managed, serverless APIs. To use a Claude model on Vertex AI, you send a request directly to the Vertex AI API endpoint. Because the Anthropic Claude models use a managed API, you don't need to provision or manage infrastructure. You can stream responses from Claude models to reduce perceived latency for end users. A streamed response uses server-sent events (SSE) to incrementally stream the response. You can pay for Claude models as you use them (pay-as-you-go), or you can pay a fixed fee when using provisioned throughput. For pay-as-you-go pricing, see Anthropic Claude models on the Vertex AI pricing page. The following models are available from Anthropic to use in Vertex AI. To access a Claude model, go to its Model Garden model card. Anthropic's Claude models support Vertex AI request-response logging. You can enable 30-day request-response logging of your prompt and completion activity to monitor usage and troubleshoot issues. For more information, see Log requests and responses. Claude Opus 4.1 is Anthropic's most powerful model, excelling at coding and agent capabilities, especially agentic search. It is well-suited for tasks that require advanced intelligence, such as the following: Go to the Claude Opus 4.1 model card Claude Opus 4 is a powerful model for coding and agent capabilities, especially agentic search. It is well-suited for tasks that require advanced intelligence, such as the following: Go to the Claude Opus 4 model card Claude Sonnet 4 balances performance for coding with speed and cost, making it suitable for high-volume use cases such as the following: Go to the Claude Sonnet 4 model card Claude 3.7 Sonnet is a highly capable model and the first Claude model to offer extended thinking—the ability to solve complex problems with step-by-step reasoning. With this model, you can balance speed and quality by choosing between standard thinking for near-instant responses or extended thinking for advanced reasoning. For more information about extended thinking, see Anthropic's documentation. Claude 3.7 Sonnet is optimized for the following use cases: Go to the Claude 3.7 Sonnet model card Claude 3.5 Sonnet v2 is a powerful model for real-world software engineering tasks and agentic capabilities. It delivers these advancements at the same price and speed as Claude 3.5 Sonnet. The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can manipulate a computer desktop environment. For more information, see the Anthropic documentation. This model is optimized for the following use cases: Go to the Claude 3.5 Sonnet v2 model card Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is well-suited for use cases where speed and affordability are important. It improves on its predecessor across a range of skills. This model is optimized for the following use cases: Go to the Claude 3.5 Haiku model card Anthropic's Claude 3 Haiku is Anthropic's fastest vision and text model for near-instant responses to basic queries. It is designed for AI experiences that mimic human interactions. It's optimized for the following use cases: Go to the Claude 3 Haiku model card Anthropic's Claude 3.5 Sonnet outperforms Claude 3 Opus on a wide range of Anthropic's evaluations, with the speed and cost of Anthropic's mid-tier Claude 3 Sonnet model. This model is optimized for the following use cases: Go to the Claude 3.5 Sonnet model card Learn how to use Anthropic's models.
Available Claude models
Model Description Use Case Claude Opus 4.1 Anthropic's most powerful model, which excels at coding and agent capabilities. Complex, multi-step tasks; agentic search and analysis; expert-level coding. Claude Opus 4 A powerful model for coding and agent capabilities. Advanced coding; long-horizon tasks; AI agents; agentic search. Claude Sonnet 4 Balances performance for coding with speed and cost for high-volume use cases. Everyday development tasks; AI assistants; efficient research; large-scale content generation. Claude 3.7 Sonnet First Claude model to offer extended thinking for complex problem-solving. Agentic coding; customer-facing agents; computer use; visual data extraction. Claude 3.5 Sonnet v2 A powerful model for real-world software engineering and agentic capabilities. Agentic tasks and tool use; complex coding; document Q&A; visual data extraction. Claude 3.5 Haiku Anthropic's fastest and most cost-effective model, with improvements across a range of skills. Code completions; interactive chatbots; data extraction; real-time content moderation. Claude 3 Haiku Anthropic's fastest vision and text model for near-instant responses. Live customer interactions; content moderation; cost-saving tasks; vision tasks. Claude 3.5 Sonnet Outperforms Claude 3 Opus on many evaluations with the speed and cost of Claude 3 Sonnet. Coding; complex customer support queries; data science; visual processing. Claude Opus 4.1
Claude Opus 4
Claude Sonnet 4
Claude 3.7 Sonnet
Claude 3.5 Sonnet v2
Claude 3.5 Haiku
Claude 3 Haiku
Claude 3.5 Sonnet
What's next
Anthropic 的 Claude 模型
除非另有註明,否則本頁面中的內容是採用創用 CC 姓名標示 4.0 授權,程式碼範例則為阿帕契 2.0 授權。詳情請參閱《Google Developers 網站政策》。Java 是 Oracle 和/或其關聯企業的註冊商標。
上次更新時間:2025-08-21 (世界標準時間)。