Static provider page

DeepInfra public model pricing

This page turns DeepInfra's public model pricing into a static, indexable summary for search engines, AI crawlers, and humans.

85 listings18 familiesAggregator / reseller type2026-05-11 crawl

Provider summary

DeepInfra currently has 85 token-priced listings in the CheapTokenz public bundle, spanning 18 model families.

Source type: Aggregator / reseller. Crawl timestamp: 2026-05-11T06:05:43.237Z.

Pricing caveats

No extra pricing caveats are attached to this page right now.

Public listings
85
token-priced
Families
18
Alibaba Models, Claude, Bytedance, DeepSeek
Crawled at
2026-05-11
provider-level timestamp
Bundle generated
2026-05-11
public release view
Model Input Output Blended Context Link
Llama 3.1 8B meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.02 $0.03 $0.11 131k Source
Mistral Nemo mistralai/Mistral-Nemo-Instruct-2407
$0.02 $0.04 $0.14 131k Source
Qwen3.5 0.8B Qwen/Qwen3.5-0.8B
$0.01 $0.05 $0.16 262k Source
Llama 3.1 8B meta-llama/Meta-Llama-3.1-8B-Instruct
$0.02 $0.05 $0.17 131k Source
L3 Lunaris 8B Sao10K/L3-8B-Lunaris-v1-Turbo
$0.04 $0.05 $0.19 8k Source
Gemma 3 google/gemma-3-4b-it
$0.04 $0.08 $0.28 131k Source
Mistral Small mistralai/Mistral-Small-24B-Instruct-2501
$0.05 $0.08 $0.29 33k Source
Qwen3.5 2B Qwen/Qwen3.5-2B
$0.02 $0.1 $0.32 262k Source
Qwen3 235B A22B Qwen/Qwen3-235B-A22B-Instruct-2507
$0.071 $0.1 $0.371 262k Source
Gemma 3 google/gemma-3-12b-it
$0.04 $0.13 $0.43 131k Source
GPT-OSS 20B openai/gpt-oss-20b
$0.03 $0.14 $0.45 131k Source
Qwen3.5 4B Qwen/Qwen3.5-4B
$0.03 $0.15 $0.48 262k Source
Gemini 1.5 Flash 8B google/gemini-1.5-flash-8b
$0.037 $0.15 $0.487 1.0M Source
Qwen3.5 9B Qwen/Qwen3.5-9B
$0.04 $0.15 $0.49 262k Source
Phi 4 microsoft/phi-4
$0.07 $0.14 $0.49 16k Source
Nemotron Nano 9B nvidia/NVIDIA-Nemotron-Nano-9B-v2
$0.04 $0.16 $0.52 131k Source
Gemma 3 27B google/gemma-3-27b-it
$0.08 $0.16 $0.56 131k Source
GPT-OSS 120B openai/gpt-oss-120b
$0.039 $0.19 $0.609 131k Source
Nemotron Nano 30B nvidia/Nemotron-3-Nano-30B-A3B
$0.05 $0.2 $0.65 262k Source
Mistral Small mistralai/Mistral-Small-3.2-24B-Instruct-2506
$0.075 $0.2 $0.675 128k Source
Llama Guard 4 12B meta-llama/Llama-Guard-4-12B
$0.18 $0.18 $0.72 164k Source
Qwen3 14B Qwen/Qwen3-14B
$0.12 $0.24 $0.84 41k Source
Qwen3 32B Qwen/Qwen3-32B
$0.08 $0.28 $0.92 41k Source
Gemini 1.5 Flash google/gemini-1.5-flash
$0.075 $0.3 $0.975 1.0M Source
Llama 4 Scout 17B meta-llama/Llama-4-Scout-17B-16E-Instruct
$0.08 $0.3 $0.98 328k Source
Llama 3.2 11B Vision meta-llama/Llama-3.2-11B-Vision-Instruct
$0.245 $0.245 $0.98 131k Source
DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash
$0.14 $0.28 $0.98 1.0M Source
Step 3.5 Flash stepfun-ai/Step-3.5-Flash
$0.1 $0.3 $1 262k Source
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo
$0.1 $0.32 $1.06 131k Source
Gemma 4 26B google/gemma-4-26B-A4B-it
$0.07 $0.34 $1.09 262k Source
Hermes 3 70B NousResearch/Hermes-3-Llama-3.1-70B
$0.3 $0.3 $1.2 131k Source
GLM 4.7 Flash zai-org/GLM-4.7-Flash
$0.06 $0.4 $1.26 203k Source
Gemma 4 31B google/gemma-4-31B-it
$0.13 $0.38 $1.27 262k Source
Seed 2.0 Mini ByteDance/Seed-2.0-mini
$0.1 $0.4 $1.3 256k Source
Nemotron Super 49B nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
$0.1 $0.4 $1.3 131k Source
DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2
$0.26 $0.38 $1.4 164k Source
Qwen3 30B A3B Qwen/Qwen3-30B-A3B
$0.09 $0.45 $1.44 41k Source
Qwen 2.5 72B Qwen/Qwen2.5-72B-Instruct
$0.36 $0.4 $1.56 33k Source
MythoMax 13B Gryphe/MythoMax-L2-13b
$0.4 $0.4 $1.6 4k Source
Llama 3.1 70B meta-llama/Meta-Llama-3.1-70B-Instruct
$0.4 $0.4 $1.6 131k Source
Llama 3.1 70B meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
$0.4 $0.4 $1.6 131k Source
Nemotron Super 120B nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
$0.1 $0.5 $1.6 262k Source
Llama 4 Maverick 17B meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
$0.15 $0.6 $1.95 1.0M Source
GPT-OSS 120B openai/gpt-oss-120b-Turbo
$0.15 $0.6 $1.95 131k Source
Qwen3 VL 30B Qwen/Qwen3-VL-30B-A3B-Instruct
$0.15 $0.6 $1.95 262k Source
DeepSeek V3 deepseek-ai/DeepSeek-V3-0324
$0.2 $0.77 $2.51 164k Source
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1
$0.21 $0.79 $2.58 164k Source
Nemotron 3 Nano Omni 30B nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning
$0.2 $0.8 $2.6 262k Source
Qwen3 VL 235B Qwen/Qwen3-VL-235B-A22B-Instruct
$0.2 $0.88 $2.84 262k Source
DeepSeek V3 deepseek-ai/DeepSeek-V3
$0.32 $0.89 $2.99 164k Source
Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B
$0.15 $0.95 $3 262k Source
R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B
$0.7 $0.8 $3.1 131k Source
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1-Terminus
$0.27 $0.95 $3.12 164k Source
Qwen3.5 35B Qwen/Qwen3.5-35B-A3B
$0.14 $1 $3.14 262k Source
Qwen3 Coder Turbo Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
$0.3 $1 $3.3 262k Source
Qwen3 Next 80B Thinking Qwen/Qwen3-Next-80B-A3B-Instruct
$0.09 $1.1 $3.39 262k Source
L3 Euryale 70B Sao10K/L3.1-70B-Euryale-v2.2
$0.85 $0.85 $3.4 131k Source
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5
$0.15 $1.15 $3.6 197k Source
Hermes 3 405B NousResearch/Hermes-3-Llama-3.1-405B
$1 $1 $4 131k Source
GLM 4.6 zai-org/GLM-4.6
$0.43 $1.74 $5.65 203k Source
GLM 4.7 zai-org/GLM-4.7
$0.4 $1.75 $5.65 203k Source
Seed 1.8 ByteDance/Seed-1.8
$0.25 $2 $6.25 256k Source
Mimo V2 5 XiaomiMiMo/MiMo-V2.5
$0.4 $2 $6.4 262k Source
GLM 5 zai-org/GLM-5
$0.6 $2.08 $6.84 203k Source
DeepSeek R1 deepseek-ai/DeepSeek-R1-0528
$0.5 $2.15 $6.95 164k Source
Qwen3 235B A22B Qwen/Qwen3-235B-A22B-Thinking-2507
$0.23 $2.3 $7.13 262k Source
Kimi K2.5 moonshotai/Kimi-K2.5
$0.45 $2.25 $7.2 262k Source
Qwen3.5 122B Qwen/Qwen3.5-122B-A10B
$0.29 $2.4 $7.49 262k Source
Gemini 2.5 Flash google/gemini-2.5-flash
$0.3 $2.5 $7.8 1.0M Source
Qwen3.5 27B Qwen/Qwen3.5-27B
$0.26 $2.6 $8.06 262k Source
Seed 2 0 Code ByteDance/Seed-2.0-code
$0.5 $3 $9.5 256k Source
Seed 2.0 Pro ByteDance/Seed-2.0-pro
$0.5 $3 $9.5 256k Source
Qwen3.6 27B Qwen/Qwen3.6-27B
$0.32 $3.2 $9.92 262k Source
DeepSeek R1 Turbo deepseek-ai/DeepSeek-R1-0528-Turbo
$1 $3 $10 33k Source
Mimo V2 5 Pro XiaomiMiMo/MiMo-V2.5-Pro
$1 $3 $10 1.0M Source
Kimi K2.6 moonshotai/Kimi-K2.6
$0.75 $3.5 $11.25 262k Source
Qwen3.5 397B Qwen/Qwen3.5-397B-A17B
$0.49 $3.6 $11.29 262k Source
GLM 5.1 zai-org/GLM-5.1
$1.05 $3.5 $11.55 203k Source
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro
$1.74 $3.48 $12.18 66k Source
Qwen3 Max Qwen/Qwen3-Max
$1.2 $6 $19.2 256k Source
Qwen3 Max Thinking Qwen/Qwen3-Max-Thinking
$1.2 $6 $19.2 256k Source
Gemini 2.5 Pro google/gemini-2.5-pro
$1.25 $10 $31.25 1.0M Source
Claude Sonnet 3.7 anthropic/claude-3-7-sonnet-latest
$3.3 $16.5 $52.8 200k Source
Claude Sonnet 4.6 anthropic/claude-4-sonnet
$3.3 $16.5 $52.8 200k Source
Claude Opus 4.7 anthropic/claude-4-opus
$16.5 $82.5 $264 200k Source