Vertex

[PROVIDER]
id: google-vertex
npm: @ai-sdk/google-vertex
env: GOOGLE_VERTEX_PROJECT, GOOGLE_VERTEX_LOCATION, GOOGLE_APPLICATION_CREDENTIALS

Models

Claude Haiku 3.5

claude-3-5-haiku@20241022
in $0.80/M
out $4.00/M
cache read $0.08/M
cache write $1.00/M
ctx: 200,000 max out: 8,192 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Haiku 4.5

claude-haiku-4-5@20251001
in $1.00/M
out $5.00/M
cache read $0.10/M
cache write $1.25/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4

claude-opus-4@20250514
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Opus 4.1

claude-opus-4-1@20250805
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Opus 4.5

claude-opus-4-5@20251101
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.6

claude-opus-4-6@default
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.7

claude-opus-4-7@default
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.8

claude-opus-4-8@default
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Sonnet 4

claude-sonnet-4@20250514
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Sonnet 4.5

claude-sonnet-4-5@20250929
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Sonnet 4.6

claude-sonnet-4-6@default
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

DeepSeek V3.1

deepseek-ai/deepseek-v3.1-maas
in $0.60/M
out $1.70/M
ctx: 163,840 max out: 32,768 in: text, pdf out: text
reasoning tools vision structured temp open weights

DeepSeek V3.2

deepseek-ai/deepseek-v3.2-maas
in $0.56/M
out $1.68/M
cache read $0.06/M
ctx: 163,840 max out: 65,536 in: text, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Flash

gemini-2.5-flash
in $0.30/M
out $2.50/M
cache read $0.07/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Flash TTS

gemini-2.5-flash-tts
in $0.50/M
out $10.00/M
ctx: 32,768 max out: 16,384 in: text out: audio
reasoning tools vision structured temp open weights

Gemini 2.5 Flash-Lite

gemini-2.5-flash-lite
in $0.10/M
out $0.40/M
cache read $0.01/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Pro

gemini-2.5-pro
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Pro TTS

gemini-2.5-pro-tts
in $1.00/M
out $20.00/M
ctx: 32,768 max out: 16,384 in: text out: audio
reasoning tools vision structured temp open weights

Gemini 3 Flash Preview

gemini-3-flash-preview
in $0.50/M
out $3.00/M
cache read $0.05/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite
in $0.25/M
out $1.50/M
cache read $0.03/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Lite Preview

gemini-3.1-flash-lite-preview
in $0.25/M
out $1.50/M
cache read $0.03/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights deprecated

Gemini 3.1 Pro Preview

gemini-3.1-pro-preview
in $2.00/M
out $12.00/M
cache read $0.20/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Pro Preview Custom Tools

gemini-3.1-pro-preview-customtools
in $2.00/M
out $12.00/M
cache read $0.20/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.5 Flash

gemini-3.5-flash
in $1.50/M
out $9.00/M
cache read $0.15/M
ctx: 1,048,576 max out: 65,536 in: text, image, video, audio, pdf out: text
reasoning tools vision structured temp open weights

Gemini Embedding 001

gemini-embedding-001
in $0.15/M
out $0.00/M
ctx: 2,048 max out: 1 in: text out: text
reasoning tools vision structured temp open weights

Gemini Flash Latest

gemini-flash-latest
in $0.30/M
out $2.50/M
cache read $0.07/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini Flash-Lite Latest

gemini-flash-lite-latest
in $0.10/M
out $0.40/M
cache read $0.03/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

GLM-4.7

zai-org/glm-4.7-maas
in $0.60/M
out $2.20/M
ctx: 200,000 max out: 128,000 in: text, pdf out: text
reasoning tools vision structured temp open weights

GLM-5

zai-org/glm-5-maas
in $1.00/M
out $3.20/M
cache read $0.10/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GPT OSS 120B

openai/gpt-oss-120b-maas
in $0.09/M
out $0.36/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

GPT OSS 20B

openai/gpt-oss-20b-maas
in $0.07/M
out $0.25/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2 Thinking

moonshotai/kimi-k2-thinking-maas
in $0.60/M
out $2.50/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.3 70B Instruct

meta/llama-3.3-70b-instruct-maas
in $0.72/M
out $0.72/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Llama 4 Maverick 17B 128E Instruct

meta/llama-4-maverick-17b-128e-instruct-maas
in $0.35/M
out $1.15/M
ctx: 524,288 max out: 8,192 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3 235B A22B Instruct

qwen/qwen3-235b-a22b-instruct-2507-maas
in $0.22/M
out $0.88/M
ctx: 262,144 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights