Cloudflare Workers AI

[PROVIDER]
id: cloudflare-workers-ai
npm: @ai-sdk/openai-compatible
env: CLOUDFLARE_ACCOUNT_ID, CLOUDFLARE_API_KEY
api: https://api.cloudflare.com/client/v4/accounts/${CLOUDFLARE_ACCOUNT_ID}/ai/v1

Models

Deepseek R1 Distill Qwen 32B

@cf/deepseek-ai/deepseek-r1-distill-qwen-32b
in $0.50/M
out $4.88/M
ctx: 80,000 max out: 80,000 in: text out: text
reasoning tools vision structured temp open weights

Gemma 4 26B A4B IT

@cf/google/gemma-4-26b-a4b-it
in $0.10/M
out $0.30/M
ctx: 256,000 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Gemma Sea Lion V4 27B It

@cf/aisingapore/gemma-sea-lion-v4-27b-it
in $0.35/M
out $0.56/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Glm 5.2

@cf/zai-org/glm-5.2
in $1.40/M
out $4.40/M
cache read $0.26/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

GLM-4.7-Flash

@cf/zai-org/glm-4.7-flash
in $0.06/M
out $0.40/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GPT OSS 120B

@cf/openai/gpt-oss-120b
in $0.35/M
out $0.75/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

GPT OSS 20B

@cf/openai/gpt-oss-20b
in $0.20/M
out $0.30/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Granite 4.0 H Micro

@cf/ibm-granite/granite-4.0-h-micro
in $0.02/M
out $0.11/M
ctx: 131,000 max out: 131,000 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2.6

@cf/moonshotai/kimi-k2.6
in $0.95/M
out $4.00/M
cache read $0.16/M
ctx: 262,144 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights

Kimi K2.7 Code

@cf/moonshotai/kimi-k2.7-code
in $0.95/M
out $4.00/M
cache read $0.19/M
ctx: 262,144 max out: 262,144 in: text, image out: text
reasoning tools vision structured temp open weights

Llama 3.1 8B Instruct fp8

@cf/meta/llama-3.1-8b-instruct-fp8
in $0.15/M
out $0.29/M
ctx: 32,000 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.2 11B Vision Instruct

@cf/meta/llama-3.2-11b-vision-instruct
in $0.05/M
out $0.68/M
ctx: 128,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

Llama 3.2 1B Instruct

@cf/meta/llama-3.2-1b-instruct
in $0.03/M
out $0.20/M
ctx: 60,000 max out: 60,000 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.2 3B Instruct

@cf/meta/llama-3.2-3b-instruct
in $0.05/M
out $0.34/M
ctx: 80,000 max out: 80,000 in: text out: text
reasoning tools vision structured temp open weights

Llama 3.3 70B Instruct fp8 Fast

@cf/meta/llama-3.3-70b-instruct-fp8-fast
in $0.29/M
out $2.25/M
ctx: 24,000 max out: 24,000 in: text out: text
reasoning tools vision structured temp open weights

Llama 4 Scout 17B 16E Instruct

@cf/meta/llama-4-scout-17b-16e-instruct
in $0.27/M
out $0.85/M
ctx: 131,000 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Llama Guard 3 8B

@cf/meta/llama-guard-3-8b
in $0.48/M
out $0.03/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Mistral Small 3.1 24B Instruct

@cf/mistralai/mistral-small-3.1-24b-instruct
in $0.35/M
out $0.56/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Nemotron 3 Super 120B

@cf/nvidia/nemotron-3-120b-a12b
in $0.50/M
out $1.50/M
ctx: 256,000 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

Qwen2.5 Coder 32B Instruct

@cf/qwen/qwen2.5-coder-32b-instruct
in $0.66/M
out $1.00/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 30B A3b fp8

@cf/qwen/qwen3-30b-a3b-fp8
in $0.05/M
out $0.34/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwq 32B

@cf/qwen/qwq-32b
in $0.66/M
out $1.00/M
ctx: 24,000 max out: 24,000 in: text out: text
reasoning tools vision structured temp open weights