Hugging Face

[PROVIDER]
id: huggingface
npm: @ai-sdk/openai-compatible
env: HF_TOKEN
api: https://router.huggingface.co/v1

Models

DeepSeek V4 Flash

deepseek-ai/DeepSeek-V4-Flash
in $0.14/M
out $0.28/M
ctx: 1,048,576 max out: 384,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek V4 Pro

deepseek-ai/DeepSeek-V4-Pro
in $0.43/M
out $0.87/M
cache read $0.00/M
ctx: 1,048,576 max out: 393,216 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek-R1

deepseek-ai/DeepSeek-R1
in $0.70/M
out $2.50/M
ctx: 64,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek-R1-0528

deepseek-ai/DeepSeek-R1-0528
in $3.00/M
out $5.00/M
ctx: 163,840 max out: 163,840 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek-V3.2

deepseek-ai/DeepSeek-V3.2
in $0.28/M
out $0.40/M
ctx: 163,840 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Gemma 4 26B A4B IT

google/gemma-4-26B-A4B-it
in $0.13/M
out $0.40/M
ctx: 262,144 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

Gemma 4 31B IT

google/gemma-4-31B-it
in $0.14/M
out $0.40/M
ctx: 262,144 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

GLM-4.5

zai-org/GLM-4.5
in $0.60/M
out $2.20/M
ctx: 131,072 max out: 98,304 in: text out: text
reasoning tools vision structured temp open weights

GLM-4.5-Air

zai-org/GLM-4.5-Air
in $0.13/M
out $0.85/M
ctx: 131,072 max out: 98,304 in: text out: text
reasoning tools vision structured temp open weights

GLM-4.5V

zai-org/GLM-4.5V
in $0.60/M
out $1.80/M
ctx: 65,536 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

GLM-4.6

zai-org/GLM-4.6
in $0.55/M
out $2.20/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GLM-4.7

zai-org/GLM-4.7
in $0.60/M
out $2.20/M
cache read $0.11/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GLM-4.7-Flash

zai-org/GLM-4.7-Flash
in $0.00/M
out $0.00/M
ctx: 200,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

GLM-5

zai-org/GLM-5
in $1.00/M
out $3.20/M
cache read $0.20/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GLM-5.1

zai-org/GLM-5.1
in $1.00/M
out $3.20/M
cache read $0.20/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

GLM-5.2

zai-org/GLM-5.2
in $1.40/M
out $4.40/M
ctx: 262,144 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Kimi K2.7 Code

moonshotai/Kimi-K2.7-Code
in $0.95/M
out $4.00/M
ctx: 262,144 max out: 262,144 in: text, image out: text
reasoning tools vision structured temp open weights

Kimi-K2-Instruct

moonshotai/Kimi-K2-Instruct
in $1.00/M
out $3.00/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Kimi-K2-Instruct-0905

moonshotai/Kimi-K2-Instruct-0905
in $1.00/M
out $3.00/M
ctx: 262,144 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Kimi-K2-Thinking

moonshotai/Kimi-K2-Thinking
in $0.60/M
out $2.50/M
cache read $0.15/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Kimi-K2.5

moonshotai/Kimi-K2.5
in $0.60/M
out $3.00/M
cache read $0.10/M
ctx: 262,144 max out: 262,144 in: text, image, video out: text
reasoning tools vision structured temp open weights

Kimi-K2.6

moonshotai/Kimi-K2.6
in $0.95/M
out $4.00/M
cache read $0.16/M
ctx: 262,144 max out: 262,144 in: text, image, video out: text
reasoning tools vision structured temp open weights

Llama-3.3-70B-Instruct

meta-llama/Llama-3.3-70B-Instruct
in $0.59/M
out $0.79/M
ctx: 131,072 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

MiMo-V2-Flash

XiaomiMiMo/MiMo-V2-Flash
in $0.10/M
out $0.30/M
ctx: 262,144 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

MiMo-V2.5-Pro

XiaomiMiMo/MiMo-V2.5-Pro
in $1.00/M
out $3.00/M
ctx: 1,048,576 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax-M2

MiniMaxAI/MiniMax-M2
in $0.30/M
out $1.20/M
ctx: 204,800 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax-M2.1

MiniMaxAI/MiniMax-M2.1
in $0.30/M
out $1.20/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax-M2.5

MiniMaxAI/MiniMax-M2.5
in $0.30/M
out $1.20/M
cache read $0.03/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax-M2.7

MiniMaxAI/MiniMax-M2.7
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax-M3

MiniMaxAI/MiniMax-M3
in $0.30/M
out $1.20/M
ctx: 524,288 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen 3 Embedding 4B

Qwen/Qwen3-Embedding-4B
in $0.01/M
out $0.00/M
ctx: 32,000 max out: 2,048 in: text out: text
reasoning tools vision structured temp open weights

Qwen 3 Embedding 8B

Qwen/Qwen3-Embedding-8B
in $0.01/M
out $0.00/M
ctx: 32,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 235B-A22B

Qwen/Qwen3-235B-A22B
in $0.20/M
out $0.80/M
ctx: 40,960 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Qwen3 32B

Qwen/Qwen3-32B
in $0.29/M
out $0.59/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-235B-A22B-Thinking-2507

Qwen/Qwen3-235B-A22B-Thinking-2507
in $0.30/M
out $3.00/M
ctx: 262,144 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-Coder 30B-A3B Instruct

Qwen/Qwen3-Coder-30B-A3B-Instruct
in $0.07/M
out $0.26/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-Coder-480B-A35B-Instruct

Qwen/Qwen3-Coder-480B-A35B-Instruct
in $2.00/M
out $2.00/M
ctx: 262,144 max out: 66,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-Coder-Next

Qwen/Qwen3-Coder-Next
in $0.20/M
out $1.50/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-Next-80B-A3B-Instruct

Qwen/Qwen3-Next-80B-A3B-Instruct
in $0.25/M
out $1.00/M
ctx: 262,144 max out: 66,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen3-Next-80B-A3B-Thinking

Qwen/Qwen3-Next-80B-A3B-Thinking
in $0.30/M
out $2.00/M
ctx: 262,144 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Qwen3.5 122B-A10B

Qwen/Qwen3.5-122B-A10B
in $0.40/M
out $3.20/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.5 27B

Qwen/Qwen3.5-27B
in $0.30/M
out $2.40/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.5 35B-A3B

Qwen/Qwen3.5-35B-A3B
in $0.25/M
out $2.00/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.5 9B

Qwen/Qwen3.5-9B
in $0.17/M
out $0.25/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.5-397B-A17B

Qwen/Qwen3.5-397B-A17B
in $0.60/M
out $3.60/M
ctx: 262,144 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.6 27B

Qwen/Qwen3.6-27B
in $0.47/M
out $3.19/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen3.6 35B-A3B

Qwen/Qwen3.6-35B-A3B
in $0.15/M
out $0.95/M
ctx: 262,144 max out: 65,536 in: text, image out: text
reasoning tools vision structured temp open weights

Step 3.5 Flash

stepfun-ai/Step-3.5-Flash
in $0.10/M
out $0.30/M
ctx: 262,144 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

Step 3.7 Flash

stepfun-ai/Step-3.7-Flash
in $0.20/M
out $1.15/M
ctx: 262,144 max out: 256,000 in: text, image out: text
reasoning tools vision structured temp open weights