Kilo Gateway

[PROVIDER]
id: kilo
npm: @ai-sdk/openai-compatible
env: KILO_API_KEY
api: https://api.kilo.ai/api/gateway

Models

AI21: Jamba Large 1.7

ai21/jamba-large-1.7
in $2.00/M
out $8.00/M
ctx: 256,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

AionLabs: Aion-1.0

aion-labs/aion-1.0
in $4.00/M
out $8.00/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

AionLabs: Aion-1.0-Mini

aion-labs/aion-1.0-mini
in $0.70/M
out $1.40/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

AionLabs: Aion-2.0

aion-labs/aion-2.0
in $0.80/M
out $1.60/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

AionLabs: Aion-RP 1.0 (8B)

aion-labs/aion-rp-llama-3.1-8b
in $0.80/M
out $1.60/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros/codellama-7b-instruct-solidity
in $0.80/M
out $1.20/M
ctx: 4,096 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

AllenAI: Olmo 3 32B Think

allenai/olmo-3-32b-think
in $0.15/M
out $0.50/M
ctx: 65,536 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Amazon: Nova 2 Lite

amazon/nova-2-lite-v1
in $0.30/M
out $2.50/M
ctx: 1,000,000 max out: 65,535 in: image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Amazon: Nova Lite 1.0

amazon/nova-lite-v1
in $0.06/M
out $0.24/M
ctx: 300,000 max out: 5,120 in: image, text out: text
reasoning tools vision structured temp open weights

Amazon: Nova Micro 1.0

amazon/nova-micro-v1
in $0.04/M
out $0.14/M
ctx: 128,000 max out: 5,120 in: text out: text
reasoning tools vision structured temp open weights

Amazon: Nova Premier 1.0

amazon/nova-premier-v1
in $2.50/M
out $12.50/M
ctx: 1,000,000 max out: 32,000 in: image, text out: text
reasoning tools vision structured temp open weights

Amazon: Nova Pro 1.0

amazon/nova-pro-v1
in $0.80/M
out $3.20/M
ctx: 300,000 max out: 5,120 in: text, image out: text
reasoning tools vision structured temp open weights

Anthropic: Claude 3 Haiku

anthropic/claude-3-haiku
in $0.25/M
out $1.25/M
cache read $0.03/M
cache write $0.30/M
ctx: 200,000 max out: 4,096 in: text, image out: text
reasoning tools vision structured temp open weights

Anthropic: Claude 3.5 Haiku

anthropic/claude-3.5-haiku
in $0.80/M
out $4.00/M
cache read $0.08/M
cache write $1.00/M
ctx: 200,000 max out: 8,192 in: text, image out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Haiku 4.5

anthropic/claude-haiku-4.5
in $1.00/M
out $5.00/M
cache read $0.10/M
cache write $1.25/M
ctx: 200,000 max out: 64,000 in: image, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Haiku Latest

~anthropic/claude-haiku-latest
in $1.00/M
out $5.00/M
cache read $0.10/M
cache write $1.25/M
ctx: 200,000 max out: 64,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4

anthropic/claude-opus-4
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.1

anthropic/claude-opus-4.1
in $15.00/M
out $75.00/M
cache read $1.50/M
cache write $18.75/M
ctx: 200,000 max out: 32,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.5

anthropic/claude-opus-4.5
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 200,000 max out: 64,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.6

anthropic/claude-opus-4.6
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.6 (Fast)

anthropic/claude-opus-4.6-fast
in $30.00/M
out $150.00/M
cache read $3.00/M
cache write $37.50/M
ctx: 1,000,000 max out: 128,000 in: image, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.7

anthropic/claude-opus-4.7
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus 4.7 (Fast)

anthropic/claude-opus-4.7-fast
in $30.00/M
out $150.00/M
cache read $3.00/M
cache write $37.50/M
ctx: 1,000,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Opus Latest

~anthropic/claude-opus-latest
in $5.00/M
out $25.00/M
cache read $0.50/M
cache write $6.25/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Sonnet 4

anthropic/claude-sonnet-4
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 200,000 max out: 64,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Sonnet 4.5

anthropic/claude-sonnet-4.5
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 1,000,000 max out: 64,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Sonnet 4.6

anthropic/claude-sonnet-4.6
in $3.00/M
out $15.00/M
ctx: 1,000,000 max out: 128,000 in: image, text out: text
reasoning tools vision structured temp open weights

Anthropic: Claude Sonnet Latest

~anthropic/claude-sonnet-latest
in $3.00/M
out $15.00/M
cache read $0.30/M
cache write $3.75/M
ctx: 1,000,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

Arcee AI: Coder Large

arcee-ai/coder-large
in $0.50/M
out $0.80/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Arcee AI: Maestro Reasoning

arcee-ai/maestro-reasoning
in $0.90/M
out $3.30/M
ctx: 131,072 max out: 32,000 in: text out: text
reasoning tools vision structured temp open weights

Arcee AI: Spotlight

arcee-ai/spotlight
in $0.18/M
out $0.18/M
ctx: 131,072 max out: 65,537 in: image, text out: text
reasoning tools vision structured temp open weights

Arcee AI: Trinity Large Thinking

arcee-ai/trinity-large-thinking
in $0.22/M
out $0.85/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Arcee AI: Trinity Mini

arcee-ai/trinity-mini
in $0.04/M
out $0.15/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Arcee AI: Virtuoso Large

arcee-ai/virtuoso-large
in $0.75/M
out $1.20/M
ctx: 131,072 max out: 64,000 in: text out: text
reasoning tools vision structured temp open weights

Auto Router

openrouter/auto
in $0.00/M
out $0.00/M
ctx: 2,000,000 max out: 32,768 in: audio, image, pdf, text, video out: image, text
reasoning tools vision structured temp open weights

Baidu: CoBuddy (free)

baidu/cobuddy:free
in $0.00/M
out $0.00/M
ctx: 131,072 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Baidu: ERNIE 4.5 21B A3B

baidu/ernie-4.5-21b-a3b
in $0.07/M
out $0.28/M
ctx: 120,000 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

Baidu: ERNIE 4.5 21B A3B Thinking

baidu/ernie-4.5-21b-a3b-thinking
in $0.07/M
out $0.28/M
ctx: 131,072 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Baidu: ERNIE 4.5 300B A47B

baidu/ernie-4.5-300b-a47b
in $0.28/M
out $1.10/M
ctx: 123,000 max out: 12,000 in: text out: text
reasoning tools vision structured temp open weights

Baidu: ERNIE 4.5 VL 28B A3B

baidu/ernie-4.5-vl-28b-a3b
in $0.14/M
out $0.56/M
ctx: 30,000 max out: 8,000 in: text, image out: text
reasoning tools vision structured temp open weights

Baidu: ERNIE 4.5 VL 424B A47B

baidu/ernie-4.5-vl-424b-a47b
in $0.42/M
out $1.25/M
ctx: 123,000 max out: 16,000 in: image, text out: text
reasoning tools vision structured temp open weights

Baidu: Qianfan-OCR-Fast

baidu/qianfan-ocr-fast
in $0.68/M
out $2.81/M
ctx: 65,536 max out: 28,672 in: image, text out: text
reasoning tools vision structured temp open weights

Body Builder (beta)

openrouter/bodybuilder
in $0.00/M
out $0.00/M
ctx: 128,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights beta

ByteDance Seed: Seed 1.6

bytedance-seed/seed-1.6
in $0.25/M
out $2.00/M
ctx: 262,144 max out: 32,768 in: image, text, video out: text
reasoning tools vision structured temp open weights

ByteDance Seed: Seed 1.6 Flash

bytedance-seed/seed-1.6-flash
in $0.07/M
out $0.30/M
ctx: 262,144 max out: 32,768 in: image, text, video out: text
reasoning tools vision structured temp open weights

ByteDance Seed: Seed-2.0-Lite

bytedance-seed/seed-2.0-lite
in $0.25/M
out $2.00/M
ctx: 262,144 max out: 131,072 in: image, text, video out: text
reasoning tools vision structured temp open weights

ByteDance Seed: Seed-2.0-Mini

bytedance-seed/seed-2.0-mini
in $0.10/M
out $0.40/M
ctx: 262,144 max out: 131,072 in: image, text, video out: text
reasoning tools vision structured temp open weights

ByteDance: UI-TARS 7B

bytedance/ui-tars-1.5-7b
in $0.10/M
out $0.20/M
ctx: 128,000 max out: 2,048 in: image, text out: text
reasoning tools vision structured temp open weights

Cohere: Command A

cohere/command-a
in $2.50/M
out $10.00/M
ctx: 256,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Cohere: Command R (08-2024)

cohere/command-r-08-2024
in $0.15/M
out $0.60/M
ctx: 128,000 max out: 4,000 in: text out: text
reasoning tools vision structured temp open weights

Cohere: Command R+ (08-2024)

cohere/command-r-plus-08-2024
in $2.50/M
out $10.00/M
ctx: 128,000 max out: 4,000 in: text out: text
reasoning tools vision structured temp open weights

Cohere: Command R7B (12-2024)

cohere/command-r7b-12-2024
in $0.04/M
out $0.15/M
ctx: 128,000 max out: 4,000 in: text out: text
reasoning tools vision structured temp open weights

Deep Cogito: Cogito v2.1 671B

deepcogito/cogito-v2.1-671b
in $1.25/M
out $1.25/M
ctx: 128,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3

deepseek/deepseek-chat
in $0.32/M
out $0.89/M
cache read $0.15/M
ctx: 163,840 max out: 163,840 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3 0324

deepseek/deepseek-chat-v3-0324
in $0.20/M
out $0.77/M
cache read $0.10/M
ctx: 163,840 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3.1

deepseek/deepseek-chat-v3.1
in $0.15/M
out $0.75/M
ctx: 32,768 max out: 7,168 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus
in $0.21/M
out $0.79/M
cache read $0.13/M
ctx: 163,840 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3.2

deepseek/deepseek-v3.2
in $0.26/M
out $0.38/M
cache read $0.13/M
ctx: 163,840 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3.2 Exp

deepseek/deepseek-v3.2-exp
in $0.27/M
out $0.41/M
ctx: 163,840 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V3.2 Speciale

deepseek/deepseek-v3.2-speciale
in $0.40/M
out $1.20/M
cache read $0.14/M
ctx: 163,840 max out: 163,840 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V4 Flash

deepseek/deepseek-v4-flash
in $0.14/M
out $0.28/M
cache read $0.00/M
ctx: 1,048,576 max out: 384,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: DeepSeek V4 Pro

deepseek/deepseek-v4-pro
in $0.43/M
out $0.87/M
cache read $0.00/M
ctx: 1,048,576 max out: 384,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: R1

deepseek/deepseek-r1
in $0.70/M
out $2.50/M
ctx: 64,000 max out: 16,000 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: R1 0528

deepseek/deepseek-r1-0528
in $0.45/M
out $2.15/M
cache read $0.20/M
ctx: 163,840 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: R1 Distill Llama 70B

deepseek/deepseek-r1-distill-llama-70b
in $0.70/M
out $0.80/M
cache read $0.01/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

DeepSeek: R1 Distill Qwen 32B

deepseek/deepseek-r1-distill-qwen-32b
in $0.29/M
out $0.29/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

EssentialAI: Rnj 1 Instruct

essentialai/rnj-1-instruct
in $0.15/M
out $0.15/M
ctx: 32,768 max out: 6,554 in: text out: text
reasoning tools vision structured temp open weights

Free Models Router

openrouter/free
in $0.00/M
out $0.00/M
ctx: 200,000 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.0 Flash

google/gemini-2.0-flash-001
in $0.10/M
out $0.40/M
cache read $0.03/M
cache write $0.08/M
ctx: 1,048,576 max out: 8,192 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.0 Flash Lite

google/gemini-2.0-flash-lite-001
in $0.07/M
out $0.30/M
ctx: 1,048,576 max out: 8,192 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Flash

google/gemini-2.5-flash
in $0.30/M
out $2.50/M
reason $2.50/M
cache read $0.03/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,535 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite
in $0.10/M
out $0.40/M
reason $0.40/M
cache read $0.01/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,535 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Flash Lite Preview 09-2025

google/gemini-2.5-flash-lite-preview-09-2025
in $0.10/M
out $0.40/M
reason $0.40/M
cache read $0.01/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Pro

google/gemini-2.5-pro
in $1.25/M
out $10.00/M
reason $10.00/M
cache read $0.13/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Pro Preview 05-06

google/gemini-2.5-pro-preview-05-06
in $1.25/M
out $10.00/M
reason $10.00/M
cache read $0.13/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,535 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 2.5 Pro Preview 06-05

google/gemini-2.5-pro-preview
in $1.25/M
out $10.00/M
reason $10.00/M
cache read $0.13/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text out: text
reasoning tools vision structured temp open weights

Google: Gemini 3 Flash Preview

google/gemini-3-flash-preview
in $0.50/M
out $3.00/M
reason $3.00/M
cache read $0.05/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite
in $0.25/M
out $1.50/M
reason $1.50/M
cache read $0.03/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview
in $0.25/M
out $1.50/M
reason $1.50/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 3.1 Pro Preview

google/gemini-3.1-pro-preview
in $2.00/M
out $12.00/M
reason $12.00/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 3.1 Pro Preview Custom Tools

google/gemini-3.1-pro-preview-customtools
in $2.00/M
out $12.00/M
reason $12.00/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini 3.5 Flash

google/gemini-3.5-flash
in $1.50/M
out $9.00/M
reason $9.00/M
cache read $0.15/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,536 in: audio, image, pdf, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemini Flash Latest

~google/gemini-flash-latest
in $0.50/M
out $3.00/M
cache read $0.05/M
cache write $0.08/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Google: Gemini Pro Latest

~google/gemini-pro-latest
in $2.00/M
out $12.00/M
cache read $0.20/M
cache write $0.38/M
ctx: 1,048,576 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Google: Gemma 2 27B

google/gemma-2-27b-it
in $0.65/M
out $0.65/M
ctx: 8,192 max out: 2,048 in: text out: text
reasoning tools vision structured temp open weights

Google: Gemma 3 12B

google/gemma-3-12b-it
in $0.04/M
out $0.13/M
cache read $0.01/M
ctx: 131,072 max out: 131,072 in: image, text out: text
reasoning tools vision structured temp open weights

Google: Gemma 3 27B

google/gemma-3-27b-it
in $0.03/M
out $0.11/M
cache read $0.02/M
ctx: 128,000 max out: 65,536 in: image, text out: text
reasoning tools vision structured temp open weights

Google: Gemma 3 4B

google/gemma-3-4b-it
in $0.04/M
out $0.08/M
ctx: 131,072 max out: 19,200 in: image, text out: text
reasoning tools vision structured temp open weights

Google: Gemma 3n 4B

google/gemma-3n-e4b-it
in $0.02/M
out $0.04/M
ctx: 32,768 max out: 6,554 in: text out: text
reasoning tools vision structured temp open weights

Google: Gemma 4 26B A4B

google/gemma-4-26b-a4b-it
in $0.12/M
out $0.40/M
ctx: 262,144 max out: 262,144 in: image, text, video out: text
reasoning tools vision structured temp open weights

Google: Gemma 4 31B

google/gemma-4-31b-it
in $0.14/M
out $0.40/M
ctx: 262,144 max out: 131,072 in: image, text, video out: text
reasoning tools vision structured temp open weights

Google: Lyria 3 Clip Preview

google/lyria-3-clip-preview
in $0.00/M
out $0.00/M
ctx: 1,048,576 max out: 65,536 in: image, text out: audio, text
reasoning tools vision structured temp open weights

Google: Lyria 3 Pro Preview

google/lyria-3-pro-preview
in $0.00/M
out $0.00/M
ctx: 1,048,576 max out: 65,536 in: image, text out: audio, text
reasoning tools vision structured temp open weights

Google: Nano Banana (Gemini 2.5 Flash Image)

google/gemini-2.5-flash-image
in $0.30/M
out $2.50/M
ctx: 32,768 max out: 32,768 in: image, text out: image, text
reasoning tools vision structured temp open weights

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

google/gemini-3.1-flash-image-preview
in $0.50/M
out $3.00/M
ctx: 65,536 max out: 65,536 in: image, text out: image, text
reasoning tools vision structured temp open weights

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

google/gemini-3-pro-image-preview
in $2.00/M
out $12.00/M
reason $12.00/M
ctx: 65,536 max out: 32,768 in: image, text out: image, text
reasoning tools vision structured temp open weights

IBM: Granite 4.0 Micro

ibm-granite/granite-4.0-h-micro
in $0.02/M
out $0.11/M
ctx: 131,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

IBM: Granite 4.1 8B

ibm-granite/granite-4.1-8b
in $0.05/M
out $0.10/M
cache read $0.05/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Inception: Mercury 2

inception/mercury-2
in $0.25/M
out $0.75/M
cache read $0.03/M
ctx: 128,000 max out: 50,000 in: text out: text
reasoning tools vision structured temp open weights

inclusionAI: Ling-2.6 Flash

inclusionai/ling-2.6-flash
in $0.08/M
out $0.24/M
cache read $0.02/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

inclusionAI: Ling-2.6-1T

inclusionai/ling-2.6-1t
in $0.30/M
out $2.50/M
cache read $0.06/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

inclusionAI: Ring-2.6-1T

inclusionai/ring-2.6-1t
in $0.07/M
out $0.63/M
cache read $0.01/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Inflection: Inflection 3 Pi

inflection/inflection-3-pi
in $2.50/M
out $10.00/M
ctx: 8,000 max out: 1,024 in: text out: text
reasoning tools vision structured temp open weights

Inflection: Inflection 3 Productivity

inflection/inflection-3-productivity
in $2.50/M
out $10.00/M
ctx: 8,000 max out: 1,024 in: text out: text
reasoning tools vision structured temp open weights

Kilo Auto Balanced

kilo-auto/balanced
in $0.60/M
out $3.00/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Kilo Auto Free

kilo-auto/free
in $0.00/M
out $0.00/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Kilo Auto Frontier

kilo-auto/frontier
in $5.00/M
out $25.00/M
ctx: 1,000,000 max out: 128,000 in: image, text out: text
reasoning tools vision structured temp open weights

Kilo Auto Small

kilo-auto/small
in $0.05/M
out $0.40/M
ctx: 400,000 max out: 128,000 in: image, text out: text
reasoning tools vision structured temp open weights

Kwaipilot: KAT-Coder-Pro V2

kwaipilot/kat-coder-pro-v2
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 256,000 max out: 80,000 in: text out: text
reasoning tools vision structured temp open weights

LiquidAI: LFM2-24B-A2B

liquid/lfm-2-24b-a2b
in $0.03/M
out $0.12/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Llama Guard 3 8B

meta-llama/llama-guard-3-8b
in $0.02/M
out $0.06/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Magnum v4 72B

anthracite-org/magnum-v4-72b
in $3.00/M
out $5.00/M
ctx: 16,384 max out: 2,048 in: text out: text
reasoning tools vision structured temp open weights

Mancer: Weaver (alpha)

mancer/weaver
in $0.75/M
out $1.00/M
ctx: 8,000 max out: 2,000 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3 70B Instruct

meta-llama/llama-3-70b-instruct
in $0.51/M
out $0.74/M
ctx: 8,192 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3 8B Instruct

meta-llama/llama-3-8b-instruct
in $0.03/M
out $0.04/M
ctx: 8,192 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.1 70B Instruct

meta-llama/llama-3.1-70b-instruct
in $0.40/M
out $0.40/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.1 8B Instruct

meta-llama/llama-3.1-8b-instruct
in $0.02/M
out $0.05/M
ctx: 16,384 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.2 11B Vision Instruct

meta-llama/llama-3.2-11b-vision-instruct
in $0.05/M
out $0.05/M
ctx: 131,072 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.2 1B Instruct

meta-llama/llama-3.2-1b-instruct
in $0.03/M
out $0.20/M
ctx: 60,000 max out: 12,000 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct
in $0.05/M
out $0.34/M
ctx: 80,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 3.3 70B Instruct

meta-llama/llama-3.3-70b-instruct
in $0.10/M
out $0.32/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Meta: Llama 4 Maverick

meta-llama/llama-4-maverick
in $0.15/M
out $0.60/M
ctx: 1,048,576 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Meta: Llama 4 Scout

meta-llama/llama-4-scout
in $0.08/M
out $0.30/M
ctx: 327,680 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Meta: Llama Guard 4 12B

meta-llama/llama-guard-4-12b
in $0.18/M
out $0.18/M
ctx: 163,840 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Microsoft: Phi 4

microsoft/phi-4
in $0.06/M
out $0.14/M
ctx: 16,384 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Microsoft: Phi 4 Mini Instruct

microsoft/phi-4-mini-instruct
in $0.08/M
out $0.35/M
cache read $0.08/M
ctx: 128,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M1

minimax/minimax-m1
in $0.40/M
out $2.20/M
ctx: 1,000,000 max out: 40,000 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M2

minimax/minimax-m2
in $0.26/M
out $1.00/M
cache read $0.03/M
ctx: 196,608 max out: 196,608 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M2-her

minimax/minimax-m2-her
in $0.30/M
out $1.20/M
ctx: 65,536 max out: 2,048 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M2.1

minimax/minimax-m2.1
in $0.27/M
out $0.95/M
cache read $0.03/M
ctx: 196,608 max out: 39,322 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M2.5

minimax/minimax-m2.5
in $0.25/M
out $1.20/M
cache read $0.03/M
ctx: 196,608 max out: 196,608 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax M2.7

minimax/minimax-m2.7
in $0.30/M
out $1.20/M
cache read $0.06/M
ctx: 204,800 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MiniMax: MiniMax-01

minimax/minimax-01
in $0.20/M
out $1.10/M
ctx: 1,000,192 max out: 1,000,192 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral Large

mistralai/mistral-large
in $2.00/M
out $6.00/M
ctx: 128,000 max out: 25,600 in: text out: text
reasoning tools vision structured temp open weights

Mistral Large 2407

mistralai/mistral-large-2407
in $2.00/M
out $6.00/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Mistral Large 2411

mistralai/mistral-large-2411
in $2.00/M
out $6.00/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Codestral 2508

mistralai/codestral-2508
in $0.30/M
out $0.90/M
ctx: 256,000 max out: 51,200 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Devstral 2 2512

mistralai/devstral-2512
in $0.40/M
out $2.00/M
cache read $0.03/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Devstral Medium

mistralai/devstral-medium
in $0.40/M
out $2.00/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Devstral Small 1.1

mistralai/devstral-small
in $0.10/M
out $0.30/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Ministral 3 14B 2512

mistralai/ministral-14b-2512
in $0.20/M
out $0.20/M
ctx: 262,144 max out: 52,429 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral: Ministral 3 3B 2512

mistralai/ministral-3b-2512
in $0.10/M
out $0.10/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Ministral 3 8B 2512

mistralai/ministral-8b-2512
in $0.15/M
out $0.15/M
ctx: 262,144 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral 7B Instruct v0.1

mistralai/mistral-7b-instruct-v0.1
in $0.11/M
out $0.19/M
ctx: 2,824 max out: 565 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Large 3 2512

mistralai/mistral-large-2512
in $0.50/M
out $1.50/M
ctx: 262,144 max out: 52,429 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Medium 3

mistralai/mistral-medium-3
in $0.40/M
out $2.00/M
ctx: 131,072 max out: 26,215 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Medium 3.1

mistralai/mistral-medium-3.1
in $0.40/M
out $2.00/M
ctx: 131,072 max out: 26,215 in: text, image out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Medium 3.5

mistralai/mistral-medium-3-5
in $1.50/M
out $7.50/M
ctx: 262,144 max out: 262,144 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Nemo

mistralai/mistral-nemo
in $0.02/M
out $0.04/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Small 3

mistralai/mistral-small-24b-instruct-2501
in $0.05/M
out $0.08/M
ctx: 32,768 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Small 3.1 24B

mistralai/mistral-small-3.1-24b-instruct
in $0.35/M
out $0.56/M
cache read $0.01/M
ctx: 128,000 max out: 131,072 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Small 3.2 24B

mistralai/mistral-small-3.2-24b-instruct
in $0.06/M
out $0.18/M
cache read $0.03/M
ctx: 131,072 max out: 131,072 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Mistral Small 4

mistralai/mistral-small-2603
in $0.15/M
out $0.60/M
cache read $0.01/M
ctx: 262,144 max out: 262,144 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Mixtral 8x22B Instruct

mistralai/mixtral-8x22b-instruct
in $2.00/M
out $6.00/M
ctx: 65,536 max out: 13,108 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Pixtral Large 2411

mistralai/pixtral-large-2411
in $2.00/M
out $6.00/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Mistral: Saba

mistralai/mistral-saba
in $0.20/M
out $0.60/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Mistral: Voxtral Small 24B 2507

mistralai/voxtral-small-24b-2507
in $0.10/M
out $0.30/M
ctx: 32,000 max out: 6,400 in: text, audio out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi K2 0711

moonshotai/kimi-k2
in $0.55/M
out $2.20/M
ctx: 131,000 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi K2 0905

moonshotai/kimi-k2-0905
in $0.40/M
out $2.00/M
cache read $0.15/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi K2 Thinking

moonshotai/kimi-k2-thinking
in $0.47/M
out $2.00/M
cache read $0.20/M
ctx: 131,072 max out: 65,535 in: text out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi K2.5

moonshotai/kimi-k2.5
in $0.45/M
out $2.20/M
ctx: 262,144 max out: 65,535 in: image, text out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi K2.6

moonshotai/kimi-k2.6
in $0.75/M
out $3.50/M
cache read $0.38/M
ctx: 262,144 max out: 65,535 in: text, image out: text
reasoning tools vision structured temp open weights

MoonshotAI: Kimi Latest

~moonshotai/kimi-latest
in $0.74/M
out $3.49/M
cache read $0.14/M
ctx: 262,142 max out: 262,142 in: text, image out: text
reasoning tools vision structured temp open weights

Morph: Morph V3 Fast

morph/morph-v3-fast
in $0.80/M
out $1.20/M
ctx: 81,920 max out: 38,000 in: text out: text
reasoning tools vision structured temp open weights

Morph: Morph V3 Large

morph/morph-v3-large
in $0.90/M
out $1.90/M
ctx: 262,144 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

MythoMax 13B

gryphe/mythomax-l2-13b
in $0.06/M
out $0.06/M
ctx: 4,096 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Nex AGI: DeepSeek V3.1 Nex N1

nex-agi/deepseek-v3.1-nex-n1
in $0.27/M
out $1.00/M
ctx: 131,072 max out: 163,840 in: text out: text
reasoning tools vision structured temp open weights

Nous: Hermes 3 405B Instruct

nousresearch/hermes-3-llama-3.1-405b
in $1.00/M
out $1.00/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Nous: Hermes 3 70B Instruct

nousresearch/hermes-3-llama-3.1-70b
in $0.30/M
out $0.30/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Nous: Hermes 4 405B

nousresearch/hermes-4-405b
in $1.00/M
out $3.00/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

Nous: Hermes 4 70B

nousresearch/hermes-4-70b
in $0.13/M
out $0.40/M
cache read $0.06/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

NousResearch: Hermes 2 Pro - Llama-3 8B

nousresearch/hermes-2-pro-llama-3-8b
in $0.14/M
out $0.14/M
ctx: 8,192 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia/llama-3.3-nemotron-super-49b-v1.5
in $0.10/M
out $0.40/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b
in $0.05/M
out $0.20/M
ctx: 262,144 max out: 52,429 in: text out: text
reasoning tools vision structured temp open weights

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
in $0.00/M
out $0.00/M
ctx: 256,000 max out: 65,536 in: text, audio, image, video out: text
reasoning tools vision structured temp open weights

NVIDIA: Nemotron 3 Super

nvidia/nemotron-3-super-120b-a12b
in $0.10/M
out $0.50/M
cache read $0.10/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free
in $0.00/M
out $0.00/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

NVIDIA: Nemotron Nano 9B V2

nvidia/nemotron-nano-9b-v2
in $0.04/M
out $0.16/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT Audio

openai/gpt-audio
in $2.50/M
out $10.00/M
ctx: 128,000 max out: 16,384 in: audio, text out: audio, text
reasoning tools vision structured temp open weights

OpenAI: GPT Audio Mini

openai/gpt-audio-mini
in $0.60/M
out $2.40/M
ctx: 128,000 max out: 16,384 in: audio, text out: audio, text
reasoning tools vision structured temp open weights

OpenAI: GPT Chat Latest

openai/gpt-chat-latest
in $5.00/M
out $30.00/M
cache read $0.50/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT Latest

~openai/gpt-latest
in $5.00/M
out $30.00/M
cache read $0.50/M
ctx: 1,050,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

OpenAI: GPT Mini Latest

~openai/gpt-mini-latest
in $0.75/M
out $4.50/M
cache read $0.07/M
ctx: 400,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-3.5 Turbo

openai/gpt-3.5-turbo
in $0.50/M
out $1.50/M
ctx: 16,385 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-3.5 Turbo (older v0613)

openai/gpt-3.5-turbo-0613
in $1.00/M
out $2.00/M
ctx: 4,095 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-3.5 Turbo 16k

openai/gpt-3.5-turbo-16k
in $3.00/M
out $4.00/M
ctx: 16,385 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-3.5 Turbo Instruct

openai/gpt-3.5-turbo-instruct
in $1.50/M
out $2.00/M
ctx: 4,095 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4

openai/gpt-4
in $30.00/M
out $60.00/M
ctx: 8,191 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4 (older v0314)

openai/gpt-4-0314
in $30.00/M
out $60.00/M
ctx: 8,191 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4 Turbo

openai/gpt-4-turbo
in $10.00/M
out $30.00/M
ctx: 128,000 max out: 4,096 in: text, image out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4 Turbo (older v1106)

openai/gpt-4-1106-preview
in $10.00/M
out $30.00/M
ctx: 128,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4 Turbo Preview

openai/gpt-4-turbo-preview
in $10.00/M
out $30.00/M
ctx: 128,000 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4.1

openai/gpt-4.1
in $2.00/M
out $8.00/M
cache read $0.50/M
ctx: 1,047,576 max out: 32,768 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4.1 Mini

openai/gpt-4.1-mini
in $0.40/M
out $1.60/M
cache read $0.10/M
ctx: 1,047,576 max out: 32,768 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4.1 Nano

openai/gpt-4.1-nano
in $0.10/M
out $0.40/M
cache read $0.03/M
ctx: 1,047,576 max out: 32,768 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o

openai/gpt-4o
in $2.50/M
out $10.00/M
cache read $1.25/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o (2024-05-13)

openai/gpt-4o-2024-05-13
in $5.00/M
out $15.00/M
ctx: 128,000 max out: 4,096 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o (2024-08-06)

openai/gpt-4o-2024-08-06
in $2.50/M
out $10.00/M
cache read $1.25/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o (2024-11-20)

openai/gpt-4o-2024-11-20
in $2.50/M
out $10.00/M
cache read $1.25/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o Audio

openai/gpt-4o-audio-preview
in $2.50/M
out $10.00/M
ctx: 128,000 max out: 16,384 in: audio, text out: audio, text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o Search Preview

openai/gpt-4o-search-preview
in $2.50/M
out $10.00/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o-mini

openai/gpt-4o-mini
in $0.15/M
out $0.60/M
cache read $0.07/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o-mini (2024-07-18)

openai/gpt-4o-mini-2024-07-18
in $0.15/M
out $0.60/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-4o-mini Search Preview

openai/gpt-4o-mini-search-preview
in $0.15/M
out $0.60/M
ctx: 128,000 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5

openai/gpt-5
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Chat

openai/gpt-5-chat
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Codex

openai/gpt-5-codex
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Image

openai/gpt-5-image
in $10.00/M
out $10.00/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: image, text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Image Mini

openai/gpt-5-image-mini
in $2.50/M
out $2.00/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: image, text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Mini

openai/gpt-5-mini
in $0.25/M
out $2.00/M
cache read $0.03/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Nano

openai/gpt-5-nano
in $0.05/M
out $0.40/M
cache read $0.01/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5 Pro

openai/gpt-5-pro
in $15.00/M
out $120.00/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.1

openai/gpt-5.1
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.1 Chat

openai/gpt-5.1-chat
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.1-Codex

openai/gpt-5.1-codex
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.1-Codex-Max

openai/gpt-5.1-codex-max
in $1.25/M
out $10.00/M
cache read $0.13/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.1-Codex-Mini

openai/gpt-5.1-codex-mini
in $0.25/M
out $2.00/M
cache read $0.03/M
ctx: 400,000 max out: 100,000 in: image, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.2

openai/gpt-5.2
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.2 Chat

openai/gpt-5.2-chat
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.2 Pro

openai/gpt-5.2-pro
in $21.00/M
out $168.00/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.2-Codex

openai/gpt-5.2-codex
in $1.75/M
out $14.00/M
cache read $0.17/M
ctx: 400,000 max out: 128,000 in: text, image out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.3 Chat

openai/gpt-5.3-chat
in $1.75/M
out $14.00/M
ctx: 128,000 max out: 16,384 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.3-Codex

openai/gpt-5.3-codex
in $1.75/M
out $14.00/M
ctx: 400,000 max out: 128,000 in: image, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.4

openai/gpt-5.4
in $2.50/M
out $15.00/M
ctx: 1,050,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.4 Image 2

openai/gpt-5.4-image-2
in $8.00/M
out $15.00/M
cache read $2.00/M
ctx: 272,000 max out: 128,000 in: image, text, pdf out: image, text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.4 Mini

openai/gpt-5.4-mini
in $0.75/M
out $4.50/M
cache read $0.07/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.4 Nano

openai/gpt-5.4-nano
in $0.20/M
out $1.25/M
cache read $0.02/M
ctx: 400,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.4 Pro

openai/gpt-5.4-pro
in $30.00/M
out $180.00/M
ctx: 1,050,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.5

openai/gpt-5.5
in $5.00/M
out $30.00/M
cache read $0.50/M
ctx: 1,050,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

OpenAI: GPT-5.5 Pro

openai/gpt-5.5-pro
in $30.00/M
out $180.00/M
ctx: 1,050,000 max out: 128,000 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

OpenAI: gpt-oss-120b

openai/gpt-oss-120b
in $0.04/M
out $0.19/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: gpt-oss-20b

openai/gpt-oss-20b
in $0.03/M
out $0.14/M
ctx: 131,072 max out: 26,215 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: gpt-oss-safeguard-20b

openai/gpt-oss-safeguard-20b
in $0.07/M
out $0.30/M
cache read $0.04/M
ctx: 131,072 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

OpenAI: o1

openai/o1
in $15.00/M
out $60.00/M
cache read $7.50/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o1-pro

openai/o1-pro
in $150.00/M
out $600.00/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o3

openai/o3
in $2.00/M
out $8.00/M
cache read $0.50/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o3 Deep Research

openai/o3-deep-research
in $10.00/M
out $40.00/M
cache read $2.50/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o3 Mini

openai/o3-mini
in $1.10/M
out $4.40/M
cache read $0.55/M
ctx: 200,000 max out: 100,000 in: pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o3 Mini High

openai/o3-mini-high
in $1.10/M
out $4.40/M
cache read $0.55/M
ctx: 200,000 max out: 100,000 in: pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o3 Pro

openai/o3-pro
in $20.00/M
out $80.00/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o4 Mini

openai/o4-mini
in $1.10/M
out $4.40/M
cache read $0.28/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o4 Mini Deep Research

openai/o4-mini-deep-research
in $2.00/M
out $8.00/M
cache read $0.50/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

OpenAI: o4 Mini High

openai/o4-mini-high
in $1.10/M
out $4.40/M
ctx: 200,000 max out: 100,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Owl Alpha

openrouter/owl-alpha
in $0.00/M
out $0.00/M
ctx: 1,048,756 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights alpha

Pareto Code Router

openrouter/pareto-code
in $0.00/M
out $0.00/M
ctx: 200,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Perceptron: Perceptron Mk1

perceptron/perceptron-mk1
in $0.15/M
out $1.50/M
ctx: 32,768 max out: 8,192 in: image, text, video out: text
reasoning tools vision structured temp open weights

Perplexity: Sonar

perplexity/sonar
in $1.00/M
out $1.00/M
ctx: 127,072 max out: 25,415 in: text, image out: text
reasoning tools vision structured temp open weights

Perplexity: Sonar Deep Research

perplexity/sonar-deep-research
in $2.00/M
out $8.00/M
ctx: 128,000 max out: 25,600 in: text out: text
reasoning tools vision structured temp open weights

Perplexity: Sonar Pro

perplexity/sonar-pro
in $3.00/M
out $15.00/M
ctx: 200,000 max out: 8,000 in: text, image out: text
reasoning tools vision structured temp open weights

Perplexity: Sonar Pro Search

perplexity/sonar-pro-search
in $3.00/M
out $15.00/M
ctx: 200,000 max out: 8,000 in: image, text out: text
reasoning tools vision structured temp open weights

Perplexity: Sonar Reasoning Pro

perplexity/sonar-reasoning-pro
in $2.00/M
out $8.00/M
ctx: 128,000 max out: 25,600 in: text, image out: text
reasoning tools vision structured temp open weights

Poolside: Laguna M.1 (free)

poolside/laguna-m.1:free
in $0.00/M
out $0.00/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Poolside: Laguna XS.2 (free)

poolside/laguna-xs.2:free
in $0.00/M
out $0.00/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Prime Intellect: INTELLECT-3

prime-intellect/intellect-3
in $0.20/M
out $1.10/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen Plus 0728

qwen/qwen-plus-2025-07-28
in $0.26/M
out $0.78/M
ctx: 1,000,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen Plus 0728 (thinking)

qwen/qwen-plus-2025-07-28:thinking
in $0.26/M
out $0.78/M
ctx: 1,000,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen-Plus

qwen/qwen-plus
in $0.40/M
out $1.20/M
cache read $0.08/M
ctx: 1,000,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen2.5 7B Instruct

qwen/qwen-2.5-7b-instruct
in $0.04/M
out $0.10/M
ctx: 32,768 max out: 6,554 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen2.5 VL 72B Instruct

qwen/qwen2.5-vl-72b-instruct
in $0.80/M
out $0.80/M
cache read $0.07/M
ctx: 32,768 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 14B

qwen/qwen3-14b
in $0.06/M
out $0.24/M
cache read $0.03/M
ctx: 40,960 max out: 40,960 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 235B A22B

qwen/qwen3-235b-a22b
in $0.46/M
out $1.82/M
cache read $0.15/M
ctx: 131,072 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 235B A22B Instruct 2507

qwen/qwen3-235b-a22b-2507
in $0.07/M
out $0.10/M
ctx: 262,144 max out: 52,429 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 235B A22B Thinking 2507

qwen/qwen3-235b-a22b-thinking-2507
in $0.11/M
out $0.60/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 30B A3B

qwen/qwen3-30b-a3b
in $0.08/M
out $0.28/M
cache read $0.03/M
ctx: 40,960 max out: 40,960 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 30B A3B Instruct 2507

qwen/qwen3-30b-a3b-instruct-2507
in $0.09/M
out $0.30/M
cache read $0.04/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 30B A3B Thinking 2507

qwen/qwen3-30b-a3b-thinking-2507
in $0.05/M
out $0.34/M
ctx: 32,768 max out: 6,554 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 32B

qwen/qwen3-32b
in $0.08/M
out $0.24/M
cache read $0.04/M
ctx: 40,960 max out: 40,960 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 8B

qwen/qwen3-8b
in $0.05/M
out $0.40/M
cache read $0.05/M
ctx: 40,960 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Coder 30B A3B Instruct

qwen/qwen3-coder-30b-a3b-instruct
in $0.07/M
out $0.27/M
ctx: 160,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Coder 480B A35B

qwen/qwen3-coder
in $0.22/M
out $1.00/M
cache read $0.02/M
ctx: 262,144 max out: 52,429 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Coder Flash

qwen/qwen3-coder-flash
in $0.20/M
out $0.97/M
cache read $0.06/M
ctx: 1,000,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Coder Next

qwen/qwen3-coder-next
in $0.12/M
out $0.75/M
cache read $0.04/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Coder Plus

qwen/qwen3-coder-plus
in $0.65/M
out $3.25/M
cache read $0.20/M
ctx: 1,000,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Max

qwen/qwen3-max
in $1.20/M
out $6.00/M
cache read $0.24/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Max Thinking

qwen/qwen3-max-thinking
in $0.78/M
out $3.90/M
ctx: 262,144 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Next 80B A3B Instruct

qwen/qwen3-next-80b-a3b-instruct
in $0.09/M
out $1.10/M
ctx: 131,072 max out: 52,429 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 Next 80B A3B Thinking

qwen/qwen3-next-80b-a3b-thinking
in $0.10/M
out $0.78/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 235B A22B Instruct

qwen/qwen3-vl-235b-a22b-instruct
in $0.20/M
out $0.88/M
cache read $0.11/M
ctx: 262,144 max out: 52,429 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 235B A22B Thinking

qwen/qwen3-vl-235b-a22b-thinking
in $0.26/M
out $2.60/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 30B A3B Instruct

qwen/qwen3-vl-30b-a3b-instruct
in $0.13/M
out $0.52/M
ctx: 131,072 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 30B A3B Thinking

qwen/qwen3-vl-30b-a3b-thinking
in $0.13/M
out $1.56/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 32B Instruct

qwen/qwen3-vl-32b-instruct
in $0.10/M
out $0.42/M
ctx: 131,072 max out: 32,768 in: text, image out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct
in $0.08/M
out $0.50/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3 VL 8B Thinking

qwen/qwen3-vl-8b-thinking
in $0.12/M
out $1.36/M
ctx: 131,072 max out: 32,768 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5 397B A17B

qwen/qwen3.5-397b-a17b
in $0.39/M
out $2.34/M
ctx: 262,144 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5 Plus 2026-02-15

qwen/qwen3.5-plus-02-15
in $0.26/M
out $1.56/M
ctx: 1,000,000 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5 Plus 2026-04-20

qwen/qwen3.5-plus-20260420
in $0.40/M
out $2.40/M
ctx: 1,000,000 max out: 65,536 in: text, image, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5-122B-A10B

qwen/qwen3.5-122b-a10b
in $0.26/M
out $2.08/M
ctx: 262,144 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5-27B

qwen/qwen3.5-27b
in $0.20/M
out $1.56/M
ctx: 262,144 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5-35B-A3B

qwen/qwen3.5-35b-a3b
in $0.16/M
out $1.30/M
ctx: 262,144 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5-9B

qwen/qwen3.5-9b
in $0.05/M
out $0.15/M
ctx: 256,000 max out: 32,768 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.5-Flash

qwen/qwen3.5-flash-02-23
in $0.10/M
out $0.40/M
ctx: 1,000,000 max out: 65,536 in: image, text, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.6 27B

qwen/qwen3.6-27b
in $0.33/M
out $3.25/M
ctx: 256,000 max out: 65,536 in: text, image, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.6 35B A3B

qwen/qwen3.6-35b-a3b
in $0.16/M
out $0.97/M
cache read $0.16/M
ctx: 262,144 max out: 65,536 in: text, image, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash
in $0.25/M
out $1.50/M
cache write $0.31/M
ctx: 1,000,000 max out: 65,536 in: text, image, video out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.6 Max Preview

qwen/qwen3.6-max-preview
in $1.04/M
out $6.24/M
cache write $1.30/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.6 Plus

qwen/qwen3.6-plus
in $0.33/M
out $1.95/M
cache read $0.03/M
cache write $0.41/M
ctx: 1,000,000 max out: 65,536 in: image, text out: text
reasoning tools vision structured temp open weights

Qwen: Qwen3.7 Max

qwen/qwen3.7-max
in $1.63/M
out $4.88/M
cache read $0.16/M
cache write $2.03/M
ctx: 1,000,000 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Qwen2.5 72B Instruct

qwen/qwen-2.5-72b-instruct
in $0.12/M
out $0.39/M
ctx: 32,768 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Qwen2.5 Coder 32B Instruct

qwen/qwen-2.5-coder-32b-instruct
in $0.20/M
out $0.20/M
cache read $0.01/M
ctx: 32,768 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Reka Edge

rekaai/reka-edge
in $0.10/M
out $0.10/M
ctx: 16,384 max out: 16,384 in: image, text, video out: text
reasoning tools vision structured temp open weights

Reka Flash 3

rekaai/reka-flash-3
in $0.10/M
out $0.20/M
ctx: 65,536 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Relace: Relace Apply 3

relace/relace-apply-3
in $0.85/M
out $1.25/M
ctx: 256,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

Relace: Relace Search

relace/relace-search
in $1.00/M
out $3.00/M
ctx: 256,000 max out: 128,000 in: text out: text
reasoning tools vision structured temp open weights

ReMM SLERP 13B

undi95/remm-slerp-l2-13b
in $0.45/M
out $0.65/M
ctx: 6,144 max out: 4,096 in: text out: text
reasoning tools vision structured temp open weights

Sao10K: Llama 3 8B Lunaris

sao10k/l3-lunaris-8b
in $0.04/M
out $0.05/M
ctx: 8,192 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Sao10k: Llama 3 Euryale 70B v2.1

sao10k/l3-euryale-70b
in $1.48/M
out $1.48/M
ctx: 8,192 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

Sao10K: Llama 3.1 70B Hanami x1

sao10k/l3.1-70b-hanami-x1
in $3.00/M
out $3.00/M
ctx: 16,000 max out: 16,000 in: text out: text
reasoning tools vision structured temp open weights

Sao10K: Llama 3.1 Euryale 70B v2.2

sao10k/l3.1-euryale-70b
in $0.85/M
out $0.85/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Sao10K: Llama 3.3 Euryale 70B

sao10k/l3.3-euryale-70b
in $0.65/M
out $0.75/M
ctx: 131,072 max out: 16,384 in: text out: text
reasoning tools vision structured temp open weights

Stealth: Claude Opus 4.6 (20% off)

stealth/claude-opus-4.6
in $4.00/M
out $20.00/M
cache read $0.40/M
cache write $5.00/M
ctx: 1,000,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Stealth: Claude Opus 4.7 (20% off)

stealth/claude-opus-4.7
in $4.00/M
out $20.00/M
cache read $0.40/M
cache write $5.00/M
ctx: 1,000,000 max out: 128,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

Stealth: Claude Sonnet 4.6 (20% off)

stealth/claude-sonnet-4.6
in $2.40/M
out $12.00/M
cache read $0.24/M
cache write $3.00/M
ctx: 1,000,000 max out: 64,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

StepFun: Step 3.5 Flash

stepfun/step-3.5-flash
in $0.10/M
out $0.30/M
cache read $0.02/M
ctx: 256,000 max out: 256,000 in: text out: text
reasoning tools vision structured temp open weights

Switchpoint Router

switchpoint/router
in $0.85/M
out $3.40/M
ctx: 131,072 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Tencent: Hunyuan A13B Instruct

tencent/hunyuan-a13b-instruct
in $0.14/M
out $0.57/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Tencent: Hy3 Preview

tencent/hy3-preview
in $0.07/M
out $0.26/M
cache read $0.03/M
ctx: 262,144 max out: 262,144 in: text out: text
reasoning tools vision structured temp open weights

TheDrummer: Cydonia 24B V4.1

thedrummer/cydonia-24b-v4.1
in $0.30/M
out $0.50/M
ctx: 131,072 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

TheDrummer: Rocinante 12B

thedrummer/rocinante-12b
in $0.17/M
out $0.43/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

TheDrummer: Skyfall 36B V2

thedrummer/skyfall-36b-v2
in $0.55/M
out $0.80/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

TheDrummer: UnslopNemo 12B

thedrummer/unslopnemo-12b
in $0.40/M
out $0.40/M
ctx: 32,768 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Upstage: Solar Pro 3

upstage/solar-pro-3
in $0.15/M
out $0.60/M
ctx: 128,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

WizardLM-2 8x22B

microsoft/wizardlm-2-8x22b
in $0.62/M
out $0.62/M
ctx: 65,535 max out: 8,000 in: text out: text
reasoning tools vision structured temp open weights

Writer: Palmyra X5

writer/palmyra-x5
in $0.60/M
out $6.00/M
ctx: 1,040,000 max out: 8,192 in: text out: text
reasoning tools vision structured temp open weights

xAI: Grok 4.20

x-ai/grok-4.20
in $2.00/M
out $6.00/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

xAI: Grok 4.20 Multi-Agent

x-ai/grok-4.20-multi-agent
in $2.00/M
out $6.00/M
cache read $0.20/M
ctx: 2,000,000 max out: 2,000,000 in: image, pdf, text out: text
reasoning tools vision structured temp open weights

xAI: Grok 4.3

x-ai/grok-4.3
in $1.25/M
out $2.50/M
cache read $0.20/M
ctx: 1,000,000 max out: 4,096 in: text, image, pdf out: text
reasoning tools vision structured temp open weights

xAI: Grok Build 0.1

x-ai/grok-build-0.1
in $1.00/M
out $2.00/M
cache read $0.20/M
ctx: 256,000 max out: 256,000 in: image, text out: text
reasoning tools vision structured temp open weights

Xiaomi: MiMo V2.5 Pro

xiaomi/mimo-v2.5-pro
in $1.00/M
out $3.00/M
cache read $0.20/M
ctx: 1,048,576 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Xiaomi: MiMo-V2-Flash

xiaomi/mimo-v2-flash
in $0.09/M
out $0.29/M
cache read $0.04/M
ctx: 262,144 max out: 65,536 in: text out: text
reasoning tools vision structured temp open weights

Xiaomi: MiMo-V2-Omni

xiaomi/mimo-v2-omni
in $0.40/M
out $2.00/M
cache read $0.08/M
ctx: 262,144 max out: 65,536 in: text, image, audio, video, pdf out: text
reasoning tools vision structured temp open weights

Xiaomi: MiMo-V2-Pro

xiaomi/mimo-v2-pro
in $1.00/M
out $3.00/M
cache read $0.20/M
ctx: 1,048,576 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Xiaomi: MiMo-V2.5

xiaomi/mimo-v2.5
in $0.40/M
out $2.00/M
cache read $0.08/M
ctx: 1,048,576 max out: 131,072 in: text, image, audio, video out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4 32B

z-ai/glm-4-32b
in $0.10/M
out $0.10/M
ctx: 128,000 max out: 32,768 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.5

z-ai/glm-4.5
in $0.60/M
out $2.20/M
cache read $0.17/M
ctx: 131,072 max out: 98,304 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.5 Air

z-ai/glm-4.5-air
in $0.13/M
out $0.85/M
cache read $0.03/M
ctx: 131,072 max out: 98,304 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.5V

z-ai/glm-4.5v
in $0.60/M
out $1.80/M
cache read $0.11/M
ctx: 65,536 max out: 16,384 in: text, image out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.6

z-ai/glm-4.6
in $0.39/M
out $1.90/M
cache read $0.17/M
ctx: 204,800 max out: 204,800 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.6V

z-ai/glm-4.6v
in $0.30/M
out $0.90/M
ctx: 131,072 max out: 131,072 in: image, text, video out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.7

z-ai/glm-4.7
in $0.38/M
out $1.98/M
cache read $0.20/M
ctx: 202,752 max out: 65,535 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash
in $0.06/M
out $0.40/M
cache read $0.01/M
ctx: 202,752 max out: 40,551 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 5

z-ai/glm-5
in $0.72/M
out $2.30/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 5 Turbo

z-ai/glm-5-turbo
in $1.20/M
out $4.00/M
cache read $0.24/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 5.1

z-ai/glm-5.1
in $1.26/M
out $3.96/M
ctx: 202,752 max out: 131,072 in: text out: text
reasoning tools vision structured temp open weights

Z.ai: GLM 5V Turbo

z-ai/glm-5v-turbo
in $1.20/M
out $4.00/M
cache read $0.24/M
ctx: 202,752 max out: 131,072 in: image, text, video out: text
reasoning tools vision structured temp open weights