Vercel AI Gateway

[PROVIDER]
id: vercel
npm: @ai-sdk/gateway
env: AI_GATEWAY_API_KEY

Models

Claude Haiku 3

anthropic/claude-3-haiku
in $0.25/M
out $1.25/M
cache read $0.03/M
cache write $0.30/M
ctx: 200,000 max out: 4,096 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Haiku 3.5

anthropic/claude-3.5-haiku
in $0.80/M
out $4.00/M
cache read $0.08/M
cache write $1.00/M
ctx: 200,000 max out: 8,192 in: text, image, pdf out: text
reasoning tools vision structured temp open weights deprecated

Claude Haiku 4.5

anthropic/claude-haiku-4.5
in $1.00/M
out $5.00/M
cache read $0.10/M
cache write $1.25/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4

anthropic/claude-opus-4
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.1

anthropic/claude-opus-4.1
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.5

anthropic/claude-opus-4.5
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.6

anthropic/claude-opus-4.6
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.7

anthropic/claude-opus-4.7
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Opus 4.8

anthropic/claude-opus-4.8
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Sonnet 4

anthropic/claude-sonnet-4
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Sonnet 4.5

anthropic/claude-sonnet-4.5
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Claude Sonnet 4.6

anthropic/claude-sonnet-4.6
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Codestral (latest)

mistral/codestral
in $0.30/M
out $0.90/M
ctx: 256,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Codestral Embed

mistral/codestral-embed
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Cohere Rerank 3.5

cohere/rerank-v3.5
in
out
ctx: 4,096 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Cohere Rerank 4 Fast

cohere/rerank-v4-fast
in
out
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Cohere Rerank 4 Pro

cohere/rerank-v4-pro
in
out
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Command A

cohere/command-a
in $2.50/M
out $10.00/M
ctx: 256,000 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V3 0324

deepseek/deepseek-v3
in $0.27/M
out $1.12/M
cache read $0.14/M
ctx: 163,840 max out: 163,840 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus
in $0.27/M
out $1.00/M
cache read $0.14/M
ctx: 131,072 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V3.2

deepseek/deepseek-v3.2
in $0.28/M
out $0.42/M
cache read $0.03/M
ctx: 128,000 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V3.2 Thinking

deepseek/deepseek-v3.2-thinking
in $0.62/M
out $1.85/M
ctx: 128,000 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V4 Flash

deepseek/deepseek-v4-flash
in $0.14/M
out $0.28/M
cache read $0.00/M
ctx: 1,000,000 max out: 384,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V4 Pro

deepseek/deepseek-v4-pro
in $0.43/M
out $0.87/M
cache read $0.00/M
ctx: 1,000,000 max out: 384,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek-R1

deepseek/deepseek-r1
in $1.35/M
out $5.40/M
ctx: 128,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek-V3.1

deepseek/deepseek-v3.1
in $0.56/M
out $1.68/M
cache read $0.28/M
ctx: 163,840 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Devstral 2

mistral/devstral-2
in $0.40/M
out $2.00/M
ctx: 256,000 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

Devstral Small 1.1

mistral/devstral-small
in $0.10/M
out $0.30/M
ctx: 128,000 max out: 64,000 in: text out: text
reasoning tools vision structured temp open weights

Devstral Small 2

mistral/devstral-small-2
in $0.10/M
out $0.30/M
ctx: 256,000 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

Embed v4.0

cohere/embed-v4.0
in
out
ctx: 128,000 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Flux Schnell

prodia/flux-fast-schnell
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.1 Fill [pro]

bfl/flux-pro-1.0-fill
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.1 Kontext Max

bfl/flux-kontext-max
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.1 Kontext Pro

bfl/flux-kontext-pro
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.2 [flex]

bfl/flux-2-flex
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.2 [klein] 4B

bfl/flux-2-klein-4b
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.2 [klein] 9B

bfl/flux-2-klein-9b
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX.2 [max]

bfl/flux-2-max
in
out
ctx: 67,300 max out: 67,300 in: text out: image
reasoning tools vision structured temp open weights

FLUX.2 [pro]

bfl/flux-2-pro
in
out
ctx: 67,300 max out: 67,300 in: text out: image
reasoning tools vision structured temp open weights

FLUX1.1 [pro]

bfl/flux-pro-1.1
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

FLUX1.1 [pro] Ultra

bfl/flux-pro-1.1-ultra
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Fugu Ultra

sakana/fugu-ultra
in $5.00/M
out $30.00/M
cache read $0.50/M
ctx: 1,000,000 max out: 1,000,000 in: text, image out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Flash

google/gemini-2.5-flash
in $0.30/M
out $2.50/M
cache read $0.03/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite
in $0.10/M
out $0.40/M
cache read $0.01/M
ctx: 1,048,576 max out: 65,536 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 2.5 Pro

google/gemini-2.5-pro
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3 Flash

google/gemini-3-flash
in $0.50/M
out $3.00/M
cache read $0.05/M
ctx: 1,000,000 max out: 65,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3 Pro Preview

google/gemini-3-pro-preview
in $2.00/M
out $12.00/M
cache read $0.20/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Image (Nano Banana 2)

google/gemini-3.1-flash-image
in $0.50/M
out $3.00/M
cache read $0.05/M
ctx: 131,072 max out: 32,768 in: text, image out: text, image
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Image Preview (Nano Banana 2)

google/gemini-3.1-flash-image-preview
in $0.50/M
out $3.00/M
cache read $0.05/M
ctx: 131,072 max out: 32,768 in: text, image out: text, image
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite
in $0.25/M
out $1.50/M
cache read $0.03/M
ctx: 1,000,000 max out: 65,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview
in $0.25/M
out $1.50/M
cache read $0.03/M
ctx: 1,000,000 max out: 65,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.1 Pro Preview

google/gemini-3.1-pro-preview
in $2.00/M
out $12.00/M
cache read $0.20/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini 3.5 Flash

google/gemini-3.5-flash
in $1.50/M
out $9.00/M
cache read $0.15/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemini Embedding 001

google/gemini-embedding-001
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Gemini Embedding 2

google/gemini-embedding-2
in
out
ctx: 0 max out: 0 in: text out: text
reasoning tools vision structured temp open weights

Gemma 4 26B A4B IT

google/gemma-4-26b-a4b-it
in $0.15/M
out $0.60/M
cache read $0.01/M
ctx: 262,144 max out: 131,072 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Gemma 4 31B IT

google/gemma-4-31b-it
in $0.14/M
out $0.40/M
ctx: 262,144 max out: 131,072 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GLM 4.5

zai/glm-4.5
in $0.60/M
out $2.20/M
cache read $0.11/M
ctx: 128,000 max out: 96,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 4.5 Air

zai/glm-4.5-air
in $0.20/M
out $1.10/M
cache read $0.03/M
ctx: 128,000 max out: 96,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 4.5V

zai/glm-4.5v
in $0.60/M
out $1.80/M
cache read $0.11/M
ctx: 66,000 max out: 16,000 in: text, image out: text
reasoning tools vision structured temp open weights

GLM 4.6

zai/glm-4.6
in $0.60/M
out $2.20/M
cache read $0.11/M
ctx: 200,000 max out: 96,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 4.7

zai/glm-4.7
in $2.25/M
out $2.75/M
cache read $2.25/M
ctx: 131,000 max out: 40,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 4.7 Flash

zai/glm-4.7-flash
in $0.07/M
out $0.40/M
ctx: 200,000 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 4.7 FlashX

zai/glm-4.7-flashx
in $0.06/M
out $0.40/M
cache read $0.01/M
ctx: 200,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 5 Turbo

zai/glm-5-turbo
in $1.20/M
out $4.00/M
cache read $0.24/M
ctx: 202,800 max out: 131,100 in: text out: text
reasoning tools vision structured temp open weights

GLM 5.1

zai/glm-5.1
in $1.40/M
out $4.40/M
cache read $0.26/M
ctx: 202,800 max out: 64,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 5.2

zai/glm-5.2
in $1.50/M
out $4.50/M
cache read $0.30/M
ctx: 1,000,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 5.2 Fast

zai/glm-5.2-fast
in $3.00/M
out $10.25/M
cache read $0.50/M
ctx: 1,000,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

GLM 5V Turbo

zai/glm-5v-turbo
in $1.20/M
out $4.00/M
cache read $0.24/M
ctx: 200,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GLM-4.6V

zai/glm-4.6v
in $0.30/M
out $0.90/M
cache read $0.05/M
ctx: 128,000 max out: 24,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GLM-4.6V-Flash

zai/glm-4.6v-flash
in
out
ctx: 128,000 max out: 24,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GLM-5

zai/glm-5
in $1.00/M
out $3.20/M
cache read $0.20/M
ctx: 202,800 max out: 131,100 in: text out: text
reasoning tools vision structured temp open weights

GPT 4o Mini Search Preview

openai/gpt-4o-mini-search-preview
in $0.15/M
out $0.60/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

GPT 5.1 Codex Max

openai/gpt-5.1-codex-max
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.1 Thinking

openai/gpt-5.1-thinking
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text, image
reasoning tools vision structured temp open weights

GPT 5.2

openai/gpt-5.2-pro
in $21.00/M
out $168.00/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.3 Codex

openai/gpt-5.3-codex
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.4

openai/gpt-5.4
in $2.50/M
out $15.00/M
cache read $0.25/M
ctx: 1,050,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.4 Mini

openai/gpt-5.4-mini
in $0.75/M
out $4.50/M
cache read $0.07/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.4 Nano

openai/gpt-5.4-nano
in $0.20/M
out $1.25/M
cache read $0.02/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.4 Pro

openai/gpt-5.4-pro
in $30.00/M
out $180.00/M
ctx: 1,050,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.5

openai/gpt-5.5
in $5.00/M
out $30.00/M
cache read $0.50/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT 5.5 Pro

openai/gpt-5.5-pro
in $30.00/M
out $180.00/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT Image 1

openai/gpt-image-1
in $5.00/M
out $40.00/M
cache read $1.25/M
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

GPT Image 1 Mini

openai/gpt-image-1-mini
in $2.00/M
out $8.00/M
cache read $0.20/M
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

GPT Image 1.5

openai/gpt-image-1.5
in $5.00/M
out $32.00/M
cache read $1.25/M
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

GPT Image 2

openai/gpt-image-2
in $5.00/M
out $30.00/M
cache read $1.25/M
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

GPT OSS 120B

openai/gpt-oss-120b
in $0.35/M
out $0.75/M
cache read $0.25/M
ctx: 131,072 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

GPT OSS 20B

openai/gpt-oss-20b
in $0.05/M
out $0.20/M
ctx: 131,072 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

GPT-3.5 Turbo

openai/gpt-3.5-turbo
in $0.50/M
out $1.50/M
ctx: 16,385 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

GPT-3.5 Turbo Instruct

openai/gpt-3.5-turbo-instruct
in $1.50/M
out $2.00/M
ctx: 8,192 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

GPT-4 Turbo

openai/gpt-4-turbo
in $10.00/M
out $30.00/M
ctx: 128,000 max out: 4,096 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-4.1

openai/gpt-4.1
in $2.00/M
out $8.00/M
cache read $0.50/M
ctx: 1,047,576 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-4.1 mini

openai/gpt-4.1-mini
in $0.40/M
out $1.60/M
cache read $0.10/M
ctx: 1,047,576 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-4.1 nano

openai/gpt-4.1-nano
in $0.10/M
out $0.40/M
cache read $0.03/M
ctx: 1,047,576 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-4o

openai/gpt-4o
in $2.50/M
out $10.00/M
cache read $1.25/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-4o mini

openai/gpt-4o-mini
in $0.15/M
out $0.60/M
cache read $0.07/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-4o mini Transcribe

openai/gpt-4o-mini-transcribe
in $1.25/M
out $5.00/M
ctx: 0 max out: 0 in: audio out: text
reasoning tools vision structured temp open weights

GPT-4o Transcribe

openai/gpt-4o-transcribe
in $2.50/M
out $10.00/M
ctx: 0 max out: 0 in: audio out: text
reasoning tools vision structured temp open weights

GPT-5

openai/gpt-5
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-5 Chat

openai/gpt-5-chat
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text, image
reasoning tools vision structured temp open weights

GPT-5 Mini

openai/gpt-5-mini
in $0.25/M
out $2.00/M
cache read $0.03/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-5 Nano

openai/gpt-5-nano
in $0.05/M
out $0.40/M
cache read $0.01/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-5 pro

openai/gpt-5-pro
in $15.00/M
out $120.00/M
ctx: 400,000 max out: 272,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5-Codex

openai/gpt-5-codex
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

GPT-5.1 Codex mini

openai/gpt-5.1-codex-mini
in $0.25/M
out $2.00/M
cache read $0.03/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.1 Instant

openai/gpt-5.1-instant
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.1-Codex

openai/gpt-5.1-codex
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.2

openai/gpt-5.2
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.2 Chat

openai/gpt-5.2-chat
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.2-Codex

openai/gpt-5.2-codex
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

GPT-5.3 Chat

openai/gpt-5.3-chat
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 128,000 max out: 16,384 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

gpt-oss-safeguard-20b

openai/gpt-oss-safeguard-20b
in $0.07/M
out $0.30/M
cache read $0.04/M
ctx: 131,072 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

GPT-Realtime mini

openai/gpt-realtime-mini
in $0.60/M
out $2.40/M
cache read $0.06/M
ctx: 0 max out: 0 in: text, audio out: text, audio
reasoning tools vision structured temp open weights

GPT-Realtime-1.5

openai/gpt-realtime-1.5
in $4.00/M
out $16.00/M
cache read $0.40/M
ctx: 0 max out: 0 in: text, audio out: text, audio
reasoning tools vision structured temp open weights

gpt-realtime-2

openai/gpt-realtime-2
in $4.00/M
out $24.00/M
cache read $0.40/M
ctx: 0 max out: 0 in: text, audio out: text, audio
reasoning tools vision structured temp open weights

Grok 4.1 Fast Non-Reasoning

xai/grok-4.1-fast-non-reasoning
in $0.20/M
out $0.50/M
cache read $0.05/M
ctx: 1,000,000 max out: 1,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.1 Fast Reasoning

xai/grok-4.1-fast-reasoning
in $0.20/M
out $0.50/M
cache read $0.05/M
ctx: 1,000,000 max out: 1,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Beta Non-Reasoning

xai/grok-4.20-non-reasoning-beta
in $1.25/M
out $2.50/M
cache read $0.40/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Beta Reasoning

xai/grok-4.20-reasoning-beta
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Multi Agent Beta

xai/grok-4.20-multi-agent-beta
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Multi-Agent

xai/grok-4.20-multi-agent
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Non-Reasoning

xai/grok-4.20-non-reasoning
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.20 Reasoning

xai/grok-4.20-reasoning
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok 4.3

xai/grok-4.3
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 1,000,000 max out: 1,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Grok Build 0.1

xai/grok-build-0.1
in $1.00/M
out $2.00/M
cache read $0.20/M
ctx: 256,000 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

Grok Imagine

xai/grok-imagine-video
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Grok Imagine Image

xai/grok-imagine-image
in
out
ctx: 0 max out: 0 in: text out: text, image
reasoning tools vision structured temp open weights

Grok Imagine Video 1.5 Preview

xai/grok-imagine-video-1.5-preview
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Grok STT

xai/grok-stt
in
out
ctx: 0 max out: 0 in: audio out: text
reasoning tools vision structured temp open weights

Grok TTS

xai/grok-tts
in
out
ctx: 0 max out: 0 in: text out: audio
reasoning tools vision structured temp open weights

Grok Voice Think Fast 1.0

xai/grok-voice-think-fast-1.0
in
out
ctx: 0 max out: 0 in: text, audio out: text, audio
reasoning tools vision structured temp open weights

Imagen 4

google/imagen-4.0-generate-001
in
out
ctx: 480 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Imagen 4 Fast

google/imagen-4.0-fast-generate-001
in
out
ctx: 480 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Imagen 4 Ultra

google/imagen-4.0-ultra-generate-001
in
out
ctx: 480 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Interfaze Beta

interfaze/interfaze-beta
in $1.50/M
out $3.50/M
ctx: 1,000,000 max out: 32,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Kat Coder Pro V2

kwaipilot/kat-coder-pro-v2
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 256,000 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

KAT-Coder-Pro V1

kwaipilot/kat-coder-pro-v1
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 256,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2 Instruct

moonshotai/kimi-k2
in $0.57/M
out $2.30/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2 Thinking

moonshotai/kimi-k2-thinking
in $0.60/M
out $2.50/M
cache read $0.15/M
ctx: 262,114 max out: 262,114 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2.5

moonshotai/kimi-k2.5
in $0.60/M
out $3.00/M
cache read $0.10/M
ctx: 262,114 max out: 262,114 in: text, image out: text
reasoning tools vision structured temp open weights

Kimi K2.6

moonshotai/kimi-k2.6
in $0.95/M
out $4.00/M
cache read $0.16/M
ctx: 262,000 max out: 262,000 in: text, image out: text
reasoning tools vision structured temp open weights

Kimi K2.7 Code

moonshotai/kimi-k2.7-code
in $0.95/M
out $4.00/M
cache read $0.19/M
ctx: 256,000 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Kimi K2.7 Code High Speed

moonshotai/kimi-k2.7-code-highspeed
in $1.90/M
out $8.00/M
cache read $0.38/M
ctx: 262,144 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Kling v2.5 Turbo Image-to-Video

klingai/kling-v2.5-turbo-i2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v2.5 Turbo Text-to-Video

klingai/kling-v2.5-turbo-t2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v2.6 Image-to-Video

klingai/kling-v2.6-i2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v2.6 Motion Control

klingai/kling-v2.6-motion-control
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v2.6 Text-to-Video

klingai/kling-v2.6-t2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v3.0 Image-to-Video

klingai/kling-v3.0-i2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v3.0 Motion Control

klingai/kling-v3.0-motion-control
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Kling v3.0 Text-to-Video

klingai/kling-v3.0-t2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Llama 3.1 70B Instruct

meta/llama-3.1-70b
in $0.72/M
out $0.72/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.1 8B Instruct

meta/llama-3.1-8b
in $0.22/M
out $0.22/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.2 11B Vision Instruct

meta/llama-3.2-11b
in $0.16/M
out $0.16/M
ctx: 128,000 max out: 8,192 in: text, image out: text
reasoning tools vision structured temp open weights

Llama 3.2 1B Instruct

meta/llama-3.2-1b
in $0.10/M
out $0.10/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.2 3B Instruct

meta/llama-3.2-3b
in $0.15/M
out $0.15/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.2 90B Vision Instruct

meta/llama-3.2-90b
in $0.72/M
out $0.72/M
ctx: 128,000 max out: 8,192 in: text, image out: text
reasoning tools vision structured temp open weights

Llama-3.3-70B-Instruct

meta/llama-3.3-70b
in $0.00/M
out $0.00/M
ctx: 128,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Llama-4-Maverick-17B-128E-Instruct-FP8

meta/llama-4-maverick
in $0.00/M
out $0.00/M
ctx: 128,000 max out: 4,096 in: text, image out: text
reasoning tools vision structured temp open weights

Llama-4-Scout-17B-16E-Instruct-FP8

meta/llama-4-scout
in $0.00/M
out $0.00/M
ctx: 128,000 max out: 4,096 in: text, image out: text
reasoning tools vision structured temp open weights

LongCat Flash Chat

meituan/longcat-flash-chat
in
out
ctx: 128,000 max out: 100,000 in: text out: text
reasoning tools vision structured temp open weights

LongCat Flash Thinking 2601

meituan/longcat-flash-thinking-2601
in
out
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Magistral Medium (latest)

mistral/magistral-medium
in $2.00/M
out $5.00/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Magistral Small

mistral/magistral-small
in $0.50/M
out $1.50/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Mercury 2

inception/mercury-2
in $0.25/M
out $0.75/M
cache read $0.02/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Mercury Coder Small Beta

inception/mercury-coder-small
in $0.25/M
out $1.00/M
ctx: 32,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

MiMo M2.5

xiaomi/mimo-v2.5
in $0.14/M
out $0.28/M
cache read $0.00/M
ctx: 1,050,000 max out: 131,100 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

MiMo V2 Flash

xiaomi/mimo-v2-flash
in $0.10/M
out $0.30/M
cache read $0.01/M
ctx: 262,144 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

MiMo V2 Pro

xiaomi/mimo-v2-pro
in $1.00/M
out $3.00/M
cache read $0.20/M
ctx: 1,000,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

MiMo V2.5 Pro

xiaomi/mimo-v2.5-pro
in $0.43/M
out $0.87/M
cache read $0.00/M
ctx: 1,050,000 max out: 131,000 in: text, pdf out: text
reasoning tools vision structured temp open weights

MiniMax M2

minimax/minimax-m2
in $0.30/M
out $1.20/M
cache read $0.03/M
cache write $0.38/M
ctx: 205,000 max out: 205,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M2.1

minimax/minimax-m2.1
in $0.30/M
out $1.20/M
cache read $0.03/M
cache write $0.38/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M2.1 Lightning

minimax/minimax-m2.1-lightning
in $0.30/M
out $2.40/M
cache read $0.03/M
cache write $0.38/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M2.5

minimax/minimax-m2.5
in $0.30/M
out $1.20/M
cache read $0.03/M
cache write $0.38/M
ctx: 204,800 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M2.5 High Speed

minimax/minimax-m2.5-highspeed
in $0.60/M
out $2.40/M
cache read $0.03/M
cache write $0.38/M
ctx: 204,800 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

Minimax M2.7

minimax/minimax-m2.7
in $0.30/M
out $1.20/M
cache read $0.06/M
cache write $0.38/M
ctx: 204,800 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M2.7 High Speed

minimax/minimax-m2.7-highspeed
in $0.60/M
out $2.40/M
cache read $0.06/M
cache write $0.38/M
ctx: 204,800 max out: 131,100 in: text out: text
reasoning tools vision structured temp open weights

MiniMax M3

minimax/minimax-m3
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 1,000,000 max out: 1,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Ministral 14B

mistral/ministral-14b
in $0.20/M
out $0.20/M
ctx: 256,000 max out: 256,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Ministral 3B (latest)

mistral/ministral-3b
in $0.04/M
out $0.04/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Ministral 8B (latest)

mistral/ministral-8b
in $0.10/M
out $0.10/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Mistral Embed

mistral/mistral-embed
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Mistral Large 3

mistral/mistral-large-3
in $0.50/M
out $1.50/M
ctx: 256,000 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral Medium 3.1

mistral/mistral-medium
in $0.40/M
out $2.00/M
ctx: 128,000 max out: 64,000 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral Medium Latest

mistral/mistral-medium-3.5
in $1.50/M
out $7.50/M
ctx: 256,000 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral Nemo

mistral/mistral-nemo
in $0.15/M
out $0.15/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Mistral Small (latest)

mistral/mistral-small
in $0.10/M
out $0.30/M
ctx: 32,000 max out: 4,000 in: text, image out: text
reasoning tools vision structured temp open weights

Morph v3 Fast

morph/morph-v3-fast
in $0.80/M
out $1.20/M
ctx: 16,000 max out: 16,000 in: text out: text
reasoning tools vision structured temp open weights

Morph v3 Large

morph/morph-v3-large
in $0.90/M
out $1.90/M
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Nano Banana (Gemini 2.5 Flash Image)

google/gemini-2.5-flash-image
in $0.30/M
out $2.50/M
cache read $0.03/M
ctx: 32,768 max out: 65,536 in: text out: text, image
reasoning tools vision structured temp open weights

Nano Banana Pro (Gemini 3 Pro Image)

google/gemini-3-pro-image
in $2.00/M
out $12.00/M
cache read $0.20/M
ctx: 65,536 max out: 32,768 in: text out: text, image
reasoning tools vision structured temp open weights

Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b
in $0.05/M
out $0.24/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Nemotron 3 Ultra

nvidia/nemotron-3-ultra-550b-a55b
in $0.60/M
out $2.40/M
cache read $0.12/M
ctx: 1,000,000 max out: 65,000 in: text out: text
reasoning tools vision structured temp open weights

Nova 2 Lite

amazon/nova-2-lite
in $0.30/M
out $2.50/M
cache read $0.07/M
ctx: 1,000,000 max out: 1,000,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Nova Lite

amazon/nova-lite
in $0.06/M
out $0.24/M
cache read $0.01/M
ctx: 300,000 max out: 8,192 in: text, image, video out: text
reasoning tools vision structured temp open weights

Nova Micro

amazon/nova-micro
in $0.04/M
out $0.14/M
cache read $0.01/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Nova Pro

amazon/nova-pro
in $0.80/M
out $3.20/M
cache read $0.20/M
ctx: 300,000 max out: 8,192 in: text, image, video out: text
reasoning tools vision structured temp open weights

NVIDIA Nemotron 3 Super 120B A12B

nvidia/nemotron-3-super-120b-a12b
in $0.15/M
out $0.65/M
ctx: 256,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Nvidia Nemotron Nano 12B V2 VL

nvidia/nemotron-nano-12b-v2-vl
in $0.20/M
out $0.60/M
ctx: 131,072 max out: 131,072 in: text, image out: text
reasoning tools vision structured temp open weights

Nvidia Nemotron Nano 9B V2

nvidia/nemotron-nano-9b-v2
in $0.06/M
out $0.23/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

o1

openai/o1
in $15.00/M
out $60.00/M
cache read $7.50/M
ctx: 200,000 max out: 100,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

o3

openai/o3
in $2.00/M
out $8.00/M
cache read $0.50/M
ctx: 200,000 max out: 100,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

o3 Pro

openai/o3-pro
in $20.00/M
out $80.00/M
ctx: 200,000 max out: 100,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

o3-deep-research

openai/o3-deep-research
in $10.00/M
out $40.00/M
cache read $2.50/M
ctx: 200,000 max out: 100,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

o3-mini

openai/o3-mini
in $1.10/M
out $4.40/M
cache read $0.55/M
ctx: 200,000 max out: 100,000 in: text out: text
reasoning tools vision structured temp open weights

o4-mini

openai/o4-mini
in $1.10/M
out $4.40/M
cache read $0.28/M
ctx: 200,000 max out: 100,000 in: text, image out: text
reasoning tools vision structured temp open weights

Pixtral 12B

mistral/pixtral-12b
in $0.15/M
out $0.15/M
ctx: 128,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

Pixtral Large (latest)

mistral/pixtral-large
in $2.00/M
out $6.00/M
ctx: 128,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen 3 Coder 30B A3B Instruct

alibaba/qwen3-coder-30b-a3b
in $0.15/M
out $0.60/M
ctx: 262,144 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3 Max Thinking

alibaba/qwen3-max-thinking
in $1.20/M
out $6.00/M
cache read $0.24/M
ctx: 256,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3.32B

alibaba/qwen-3-32b
in $0.16/M
out $0.64/M
ctx: 128,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3.5 Flash

alibaba/qwen3.5-flash
in $0.10/M
out $0.40/M
cache read $0.00/M
cache write $0.13/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen 3.5 Plus

alibaba/qwen3.5-plus
in $0.40/M
out $2.40/M
cache read $0.04/M
cache write $0.50/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen 3.6 27B

alibaba/qwen3.6-27b
in $0.60/M
out $3.60/M
ctx: 256,000 max out: 256,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen 3.6 Max Preview

alibaba/qwen-3.6-max-preview
in $1.30/M
out $7.80/M
cache read $0.26/M
cache write $1.63/M
ctx: 240,000 max out: 64,000 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3.6 Plus

alibaba/qwen3.6-plus
in $0.50/M
out $3.00/M
cache read $0.10/M
cache write $0.63/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen 3.7 Max

alibaba/qwen3.7-max
in $1.25/M
out $3.75/M
cache read $0.25/M
cache write $1.56/M
ctx: 991,000 max out: 64,000 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3.7 Plus

alibaba/qwen3.7-plus
in $0.40/M
out $1.60/M
cache read $0.08/M
cache write $0.50/M
ctx: 1,000,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen3 235B A22B Instruct 2507

alibaba/qwen-3-235b
in $0.22/M
out $0.88/M
ctx: 262,144 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 235B A22B Thinking 2507

alibaba/qwen3-235b-a22b-thinking
in $0.40/M
out $4.00/M
ctx: 131,072 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen3 Coder 480B A35B Instruct

alibaba/qwen3-coder
in $1.50/M
out $7.50/M
cache read $0.30/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Coder Next

alibaba/qwen3-coder-next
in $0.50/M
out $1.20/M
ctx: 256,000 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Coder Plus

alibaba/qwen3-coder-plus
in $1.00/M
out $5.00/M
cache read $0.20/M
ctx: 1,000,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Embedding 0.6B

alibaba/qwen3-embedding-0.6b
in
out
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Embedding 4B

alibaba/qwen3-embedding-4b
in
out
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Embedding 8B

alibaba/qwen3-embedding-8b
in
out
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Max

alibaba/qwen3-max
in $1.20/M
out $6.00/M
cache read $0.24/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Max Preview

alibaba/qwen3-max-preview
in $1.20/M
out $6.00/M
cache read $0.24/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Next 80B A3B Instruct

alibaba/qwen3-next-80b-a3b-instruct
in $0.15/M
out $1.20/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 Next 80B A3B Thinking

alibaba/qwen3-next-80b-a3b-thinking
in $0.15/M
out $1.20/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 VL 235B A22B Instruct

alibaba/qwen3-vl-235b-a22b-instruct
in $0.40/M
out $1.60/M
ctx: 131,072 max out: 129,024 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3 VL Instruct

alibaba/qwen3-vl-instruct
in $0.40/M
out $1.60/M
ctx: 131,072 max out: 129,024 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3 VL Thinking

alibaba/qwen3-vl-thinking
in $0.40/M
out $4.00/M
ctx: 131,072 max out: 32,768 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Qwen3-14B

alibaba/qwen-3-14b
in $0.12/M
out $0.24/M
ctx: 40,960 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-30B-A3B

alibaba/qwen-3-30b
in $0.12/M
out $0.50/M
ctx: 40,960 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Recraft V2

recraft/recraft-v2
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V3

recraft/recraft-v3
in
out
ctx: 512 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4

recraft/recraft-v4
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4 Pro

recraft/recraft-v4-pro
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4.1

recraft/recraft-v4.1
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4.1 Pro

recraft/recraft-v4.1-pro
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4.1 Utility

recraft/recraft-v4.1-utility
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Recraft V4.1 Utility Pro

recraft/recraft-v4.1-utility-pro
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Seed 1.6

bytedance/seed-1.6
in $0.25/M
out $2.00/M
cache read $0.05/M
ctx: 256,000 max out: 32,000 in: text, image out: text
reasoning tools vision structured temp open weights

Seed 1.8

bytedance/seed-1.8
in $0.25/M
out $2.00/M
cache read $0.05/M
ctx: 256,000 max out: 64,000 in: text, image out: text
reasoning tools vision structured temp open weights

Seedance 2.0

bytedance/seedance-2.0
in
out
ctx: 0 max out: 0 in: text, image out: video
reasoning tools vision structured temp open weights

Seedance 2.0 Fast

bytedance/seedance-2.0-fast
in
out
ctx: 0 max out: 0 in: text, image out: video
reasoning tools vision structured temp open weights

Seedance v1.0 Pro

bytedance/seedance-v1.0-pro
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Seedance v1.0 Pro Fast

bytedance/seedance-v1.0-pro-fast
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Seedance v1.5 Pro

bytedance/seedance-v1.5-pro
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Seedream 4.0

bytedance/seedream-4.0
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Seedream 4.5

bytedance/seedream-4.5
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Seedream 5.0 Lite

bytedance/seedream-5.0-lite
in
out
ctx: 0 max out: 0 in: text out: image
reasoning tools vision structured temp open weights

Sonar

perplexity/sonar
in
out
ctx: 127,000 max out: 8,000 in: text, image out: text
reasoning tools vision structured temp open weights

Sonar Pro

perplexity/sonar-pro
in
out
ctx: 200,000 max out: 8,000 in: text, image out: text
reasoning tools vision structured temp open weights

Sonar Reasoning Pro

perplexity/sonar-reasoning-pro
in
out
ctx: 127,000 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

Step 3.7 Flash

stepfun/step-3.7-flash
in $0.20/M
out $1.15/M
cache read $0.04/M
ctx: 256,000 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

StepFun 3.5 Flash

stepfun/step-3.5-flash
in $0.09/M
out $0.30/M
cache read $0.02/M
ctx: 262,114 max out: 262,114 in: text out: text
reasoning tools vision structured temp open weights

Text Embedding 005

google/text-embedding-005
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Text Multilingual Embedding 002

google/text-multilingual-embedding-002
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

text-embedding-3-large

openai/text-embedding-3-large
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

text-embedding-3-small

openai/text-embedding-3-small
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

text-embedding-ada-002

openai/text-embedding-ada-002
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Titan Text Embeddings V2

amazon/titan-embed-text-v2
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Trinity Large Preview

arcee-ai/trinity-large-preview
in $0.25/M
out $1.00/M
ctx: 131,000 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

Trinity Large Thinking

arcee-ai/trinity-large-thinking
in $0.25/M
out $0.90/M
ctx: 262,100 max out: 80,000 in: text out: text
reasoning tools vision structured temp open weights

Trinity Mini

arcee-ai/trinity-mini
in $0.04/M
out $0.15/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

TTS-1

openai/tts-1
in
out
ctx: 0 max out: 0 in: text out: audio
reasoning tools vision structured temp open weights

TTS-1 HD

openai/tts-1-hd
in
out
ctx: 0 max out: 0 in: text out: audio
reasoning tools vision structured temp open weights

Veo 3.0

google/veo-3.0-generate-001
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Veo 3.0 Fast Generate

google/veo-3.0-fast-generate-001
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Veo 3.1

google/veo-3.1-generate-001
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Veo 3.1 Fast Generate

google/veo-3.1-fast-generate-001
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Voyage Rerank 2.5

voyage/rerank-2.5
in
out
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Voyage Rerank 2.5 Lite

voyage/rerank-2.5-lite
in
out
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

voyage-3-large

voyage/voyage-3-large
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-3.5

voyage/voyage-3.5
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-3.5-lite

voyage/voyage-3.5-lite
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-4

voyage/voyage-4
in
out
ctx: 32,000 max out: 0 in: text out: text
reasoning tools vision structured temp open weights

voyage-4-large

voyage/voyage-4-large
in
out
ctx: 32,000 max out: 0 in: text out: text
reasoning tools vision structured temp open weights

voyage-4-lite

voyage/voyage-4-lite
in
out
ctx: 32,000 max out: 0 in: text out: text
reasoning tools vision structured temp open weights

voyage-code-2

voyage/voyage-code-2
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-code-3

voyage/voyage-code-3
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-finance-2

voyage/voyage-finance-2
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

voyage-law-2

voyage/voyage-law-2
in
out
ctx: 8,192 max out: 1,536 in: text out: text
reasoning tools vision structured temp open weights

Wan v2.5 Text-to-Video Preview

alibaba/wan-v2.5-t2v-preview
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Wan v2.6 Image-to-Video

alibaba/wan-v2.6-i2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Wan v2.6 Image-to-Video Flash

alibaba/wan-v2.6-i2v-flash
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Wan v2.6 Reference-to-Video

alibaba/wan-v2.6-r2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Wan v2.6 Reference-to-Video Flash

alibaba/wan-v2.6-r2v-flash
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Wan v2.6 Text-to-Video

alibaba/wan-v2.6-t2v
in
out
ctx: 0 max out: 0 in: text out: video
reasoning tools vision structured temp open weights

Whisper

openai/whisper-1
in
out
ctx: 0 max out: 0 in: audio out: text
reasoning tools vision structured temp open weights