Compay | Model | Input Price$ | Output Price$ | Unit | MMLU Score |
|---|---|---|---|---|---|
Sonar Pro | 200 | 200 | per 1M tokens | 84.00% | |
Sonar | 200 | 200 | per 1M tokens | 82.00% | |
OpenAI GPT-4-32k | 60 | 120 | per 1M tokens | 86.60% | |
GPT-4-32k | 60 | 120 | per 1M tokens | 86.40% | |
Bedrock Claude 3 Opus | 15 | 75 | per 1M tokens | 86.80% | |
Claude 3 Opus Latest | 15 | 75 | per 1M tokens | 86.80% | |
Claude 3 Opus | 15 | 75 | per 1M tokens | 86.80% | |
OpenAI o1-preview | 15 | 60 | per 1M tokens | 90.80% | |
OpenAI GPT-4 | 30 | 60 | per 1M tokens | 86.60% | |
GPT-4 | 30 | 60 | per 1M tokens | 86.40% | |
o1-preview | 15 | 60 | per 1M tokens | 90.80% | |
OpenAI GPT-4 Turbo | 10 | 30 | per 1M tokens | 86.60% | |
GPT-4 Turbo Vision | 10 | 30 | per 1M tokens | 86.60% | |
GPT-4 Turbo | 10 | 30 | per 1M tokens | 86.60% | |
Mistral Large | 8 | 24 | per 1M tokens | 81.20% | |
Mistral Large | 8 | 24 | per 1M tokens | 81.20% | |
Bedrock Mistral Large | 8 | 24 | per 1M tokens | 81.20% | |
Claude 2 | 8 | 24 | per 1M tokens | 78.00% | |
Claude 2.1 | 8 | 24 | per 1M tokens | 78.50% | |
Gemini Ultra | 7 | 21 | per 1M tokens | 83.70% | |
Jurassic-2 Ultra | 18 | 18 | per 1M tokens | 74.00% | |
Qwen-Max | 5.56 | 16.67 | per 1M tokens | 80.00% | |
ERNIE 4.0 | 16.67 | 16.67 | per 1M tokens | 74.00% | |
Claude 3.5 Sonnet | 3 | 15 | per 1M tokens | 88.70% | |
GPT-4o | 5 | 15 | per 1M tokens | 88.70% | |
Command R+ | 3 | 15 | per 1M tokens | 78.50% | |
Bedrock Claude 3 Sonnet | 3 | 15 | per 1M tokens | 79.00% | |
OpenAI GPT-4o | 5 | 15 | per 1M tokens | 88.70% | |
Claude 3.5 Sonnet Latest | 3 | 15 | per 1M tokens | 88.70% | |
Claude 3 Sonnet | 3 | 15 | per 1M tokens | 79.00% | |
Claude 3.5 Sonnet | 3 | 15 | per 1M tokens | 88.70% | |
GPT-4o | 5 | 15 | per 1M tokens | 88.70% | |
ChatGLM-4V | 13.89 | 13.89 | per 1M tokens | 76.00% | |
ChatGLM-4 | 13.89 | 13.89 | per 1M tokens | 78.00% | |
OpenAI o1-mini | 3 | 12 | per 1M tokens | 85.20% | |
o1-mini | 3 | 12 | per 1M tokens | 85.20% | |
Gemini 1.5 Pro | 3.5 | 10.5 | per 1M tokens | 85.90% | |
Gemini 1.5 Pro 001 | 3.5 | 10.5 | per 1M tokens | 85.90% | |
Gemini 1.5 Pro | 3.5 | 10.5 | per 1M tokens | 85.90% | |
Jurassic-2 Mid | 10 | 10 | per 1M tokens | 72.00% | |
Sonar 70B | 10 | 10 | per 1M tokens | 82.00% | |
Baichuan-4 | 8.33 | 8.33 | per 1M tokens | 76.00% | |
Mistral Medium | 2.7 | 8.1 | per 1M tokens | 75.30% | |
Generate | 2 | 6 | per 1M tokens | 72.00% | |
Mixtral 8x22B | 2 | 6 | per 1M tokens | 77.80% | |
Bedrock Cohere Command | 2 | 6 | per 1M tokens | 74.00% | |
Llama 3.1 405B | 5 | 5 | per 1M tokens | 86.50% | |
Jurassic-2 Light | 5 | 5 | per 1M tokens | 68.00% | |
LLaMA 3.1 Sonar Large | 5 | 5 | per 1M tokens | 86.00% | |
Sonar 8x7B | 5 | 5 | per 1M tokens | 78.00% | |
Llama 3.1 405B | 5 | 5 | per 1M tokens | 86.50% | |
SenseNova | 4.17 | 4.17 | per 1M tokens | 75.00% | |
Spark Max | 4.17 | 4.17 | per 1M tokens | 74.00% | |
Doubao-vision | 1.39 | 4.17 | per 1M tokens | 73.00% | |
OpenAI GPT-3.5 Turbo 16k | 3 | 4 | per 1M tokens | 70.00% | |
Kimi-VL | 3.33 | 3.33 | per 1M tokens | 70.00% | |
Mistral Small | 1 | 3 | per 1M tokens | 72.00% | |
Yuncong Pro | 2.78 | 2.78 | per 1M tokens | 72.00% | |
SenseChat-5 | 2.78 | 2.78 | per 1M tokens | 73.00% | |
Yi-Large | 2.78 | 2.78 | per 1M tokens | 77.00% | |
Pangu | 2.78 | 2.78 | per 1M tokens | 69.00% | |
Kimi 128k | 2.78 | 2.78 | per 1M tokens | 72.00% | |
Hunyuan Pro | 2.78 | 2.78 | per 1M tokens | 75.00% | |
Claude Instant | 0.8 | 2.4 | per 1M tokens | 73.40% | |
SageGPT Pro | 2.22 | 2.22 | per 1M tokens | 73.00% | |
Spark Pro | 2.08 | 2.08 | per 1M tokens | 70.00% | |
Doubao-pro | 0.69 | 2.08 | per 1M tokens | 75.00% | |
LLaMA 3.1 Sonar | 2 | 2 | per 1M tokens | 82.50% | |
Command | 1 | 2 | per 1M tokens | 68.00% | |
Llama 3.1 405B | 2 | 2 | per 1M tokens | 86.50% | |
OpenAI GPT-3.5 Turbo | 1.5 | 2 | per 1M tokens | 70.00% | |
GPT-3.5 Turbo Instruct | 1.5 | 2 | per 1M tokens | 70.00% | |
Wanx | 0.56 | 1.67 | per 1M tokens | 68.00% | |
Yuncong | 1.67 | 1.67 | per 1M tokens | 68.00% | |
Pangu-NLP | 1.67 | 1.67 | per 1M tokens | 68.00% | |
Kimi Chat | 1.67 | 1.67 | per 1M tokens | 72.00% | |
Doubao-character | 0.56 | 1.67 | per 1M tokens | 72.00% | |
Qwen-Plus | 0.56 | 1.67 | per 1M tokens | 72.50% | |
ERNIE 3.5 | 1.67 | 1.67 | per 1M tokens | 68.50% | |
Bedrock Titan Express | 0.8 | 1.6 | per 1M tokens | 75.00% | |
Command R | 0.5 | 1.5 | per 1M tokens | 72.50% | |
Gemini 1.0 Pro Vision | 0.5 | 1.5 | per 1M tokens | 71.80% | |
Gemini 1.0 Pro | 0.5 | 1.5 | per 1M tokens | 71.80% | |
GPT-3.5 Turbo 16k | 0.5 | 1.5 | per 1M tokens | 70.00% | |
GPT-3.5 Turbo | 0.5 | 1.5 | per 1M tokens | 70.00% | |
SenseChat-4 | 1.39 | 1.39 | per 1M tokens | 70.00% | |
Pangu-E | 1.39 | 1.39 | per 1M tokens | 66.00% | |
Spark Desk | 1.39 | 1.39 | per 1M tokens | 72.00% | |
Baichuan-NPC | 1.39 | 1.39 | per 1M tokens | 74.00% | |
abab6.5 | 1.39 | 1.39 | per 1M tokens | 75.00% | |
Hunyuan | 1.39 | 1.39 | per 1M tokens | 71.00% | |
Bedrock Claude 3 Haiku | 0.25 | 1.25 | per 1M tokens | 75.20% | |
Claude 3 Haiku Latest | 0.25 | 1.25 | per 1M tokens | 75.20% | |
Claude 3 Haiku | 0.25 | 1.25 | per 1M tokens | 75.20% | |
SageGPT | 1.11 | 1.11 | per 1M tokens | 69.00% | |
Pangu-R | 1.11 | 1.11 | per 1M tokens | 67.00% | |
Qwen-VL | 1.11 | 1.11 | per 1M tokens | 75.00% | |
Qwen1.5-110B | 1.11 | 1.11 | per 1M tokens | 80.00% | |
Gemini 1.5 Flash 001 | 0.35 | 1.05 | per 1M tokens | 78.90% | |
Gemini 1.5 Flash | 0.35 | 1.05 | per 1M tokens | 78.90% | |
Jamba | 0.5 | 1 | per 1M tokens | 80.00% | |
Rerank | 1 | 1 | per 1M tokens | N/A | |
Qwen-72B-Chat | 0.9 | 0.9 | per 1M tokens | 77.00% | |
Llama 3 70B | 0.9 | 0.9 | per 1M tokens | 82.00% | |
Llama 3.1 70B | 0.9 | 0.9 | per 1M tokens | 82.00% | |
Mixtral 8x22B | 0.9 | 0.9 | per 1M tokens | 77.80% | |
Llama 3 70B | 0.9 | 0.9 | per 1M tokens | 82.00% | |
Qwen 72B | 0.9 | 0.9 | per 1M tokens | 77.00% | |
Llama 3 70B | 0.9 | 0.9 | per 1M tokens | 82.00% | |
Codestral | 0.3 | 0.9 | per 1M tokens | 78.00% | |
DeepSeek 67B | 0.9 | 0.9 | per 1M tokens | 76.00% | |
Qwen 72B | 0.9 | 0.9 | per 1M tokens | 77.00% | |
Mixtral 8x22B | 0.9 | 0.9 | per 1M tokens | 77.80% | |
Llama 3 70B | 0.9 | 0.9 | per 1M tokens | 82.00% | |
DeepSeek-67B | 0.9 | 0.9 | per 1M tokens | 76.00% | |
Yi-Spark | 0.28 | 0.83 | per 1M tokens | 70.00% | |
InternLM-XComposer | 0.83 | 0.83 | per 1M tokens | 72.00% | |
XVERSE-65B | 0.83 | 0.83 | per 1M tokens | 74.00% | |
Yi-VL-34B | 0.83 | 0.83 | per 1M tokens | 70.00% | |
Doubao-lite | 0.42 | 0.83 | per 1M tokens | 68.00% | |
Qwen-Turbo | 0.28 | 0.83 | per 1M tokens | 70.00% | |
Yi 34B | 0.8 | 0.8 | per 1M tokens | 73.00% | |
Gemma 2 27B | 0.8 | 0.8 | per 1M tokens | 76.00% | |
Gemma 2 27B | 0.8 | 0.8 | per 1M tokens | 76.00% | |
Gemma 2 27B | 0.8 | 0.8 | per 1M tokens | 76.00% | |
Llama 3.1 70B | 0.59 | 0.79 | per 1M tokens | 82.00% | |
Llama 3 70B | 0.59 | 0.79 | per 1M tokens | 82.00% | |
Bedrock Llama 3 70B | 0.76 | 0.76 | per 1M tokens | 82.00% | |
Mixtral 8x7B | 0.7 | 0.7 | per 1M tokens | 71.30% | |
Mixtral 8x7B | 0.7 | 0.7 | per 1M tokens | 71.30% | |
ChatGLM-3-Turbo | 0.69 | 0.69 | per 1M tokens | 66.00% | |
Hunyuan Standard | 0.69 | 0.69 | per 1M tokens | 68.00% | |
Phi 3 Medium | 0.3 | 0.6 | per 1M tokens | 78.00% | |
Command Light | 0.3 | 0.6 | per 1M tokens | 65.00% | |
Bedrock Titan Lite | 0.3 | 0.6 | per 1M tokens | 70.00% | |
OpenAI GPT-4o-mini | 0.15 | 0.6 | per 1M tokens | 82.00% | |
GPT-4o-mini | 0.15 | 0.6 | per 1M tokens | 82.00% | |
InternLM2-20B | 0.56 | 0.56 | per 1M tokens | 71.00% | |
Yi-Medium | 0.56 | 0.56 | per 1M tokens | 72.00% | |
abab5.5 | 0.56 | 0.56 | per 1M tokens | 68.00% | |
Hunyuan Role | 0.56 | 0.56 | per 1M tokens | 70.00% | |
Qwen-Audio | 0.56 | 0.56 | per 1M tokens | N/A | |
Llava 13B | 0.5 | 0.5 | per 1M tokens | 68.00% | |
XVERSE-MoE | 0.42 | 0.42 | per 1M tokens | 76.00% | |
Yi-34B | 0.42 | 0.42 | per 1M tokens | 73.00% | |
Baichuan2-13B | 0.42 | 0.42 | per 1M tokens | 70.00% | |
ERNIE Character | 0.42 | 0.42 | per 1M tokens | 70.00% | |
Jamba Mini | 0.2 | 0.4 | per 1M tokens | 76.00% | |
Bedrock Llama 3 8B | 0.38 | 0.38 | per 1M tokens | 68.00% | |
CodeGeeX-4 | 0.28 | 0.28 | per 1M tokens | 65.00% | |
InternLM2.5-7B | 0.28 | 0.28 | per 1M tokens | 73.00% | |
XVERSE-13B | 0.28 | 0.28 | per 1M tokens | 68.00% | |
Embedding | 0.28 | 0.28 | per 1M tokens | N/A | |
Spark Lite | 0.28 | 0.28 | per 1M tokens | 65.00% | |
Baichuan2-7B | 0.28 | 0.28 | per 1M tokens | 68.00% | |
Baichuan-3-Turbo | 0.28 | 0.28 | per 1M tokens | 72.00% | |
abab5.5s | 0.28 | 0.28 | per 1M tokens | 65.00% | |
CodeGeeX-4 | 0.28 | 0.28 | per 1M tokens | 65.00% | |
Hunyuan Lite | 0.28 | 0.28 | per 1M tokens | 65.00% | |
ERNIE Speed | 0.28 | 0.28 | per 1M tokens | 65.00% | |
DeepSeek V2 | 0.14 | 0.28 | per 1M tokens | 78.50% | |
DeepSeek-Coder-V2 | 0.14 | 0.28 | per 1M tokens | 76.00% | |
DeepSeek-V2-Chat | 0.14 | 0.28 | per 1M tokens | 78.50% | |
DeepSeek-Coder | 0.14 | 0.28 | per 1M tokens | 74.00% | |
DeepSeek-V2 | 0.14 | 0.28 | per 1M tokens | 78.50% | |
Mixtral 8x7B | 0.27 | 0.27 | per 1M tokens | 71.30% | |
Mistral 7B | 0.25 | 0.25 | per 1M tokens | 60.00% | |
Llama 3 8B | 0.2 | 0.2 | per 1M tokens | 68.00% | |
Mistral 7B | 0.2 | 0.2 | per 1M tokens | 72.00% | |
Llama 3 8B | 0.2 | 0.2 | per 1M tokens | 68.00% | |
Mistral 7B | 0.2 | 0.2 | per 1M tokens | 72.00% | |
Llama 3 8B | 0.2 | 0.2 | per 1M tokens | 68.00% | |
Gemma 2 9B | 0.2 | 0.2 | per 1M tokens | 72.00% | |
PaLM 2 | 0.1 | 0.2 | per 1M tokens | 78.30% | |
Mistral NeMo | 0.15 | 0.15 | per 1M tokens | 72.00% | |
Bedrock Mistral 7B | 0.15 | 0.15 | per 1M tokens | 72.00% | |
CodeGeeX-2 | 0.14 | 0.14 | per 1M tokens | 60.00% | |
Embedding | 0.14 | 0.14 | per 1M tokens | N/A | |
abab6 | 0.14 | 0.14 | per 1M tokens | 70.00% | |
ChatGLM2-6B | 0.14 | 0.14 | per 1M tokens | 56.00% | |
Doubao-embedding | 0.14 | 0.14 | per 1M tokens | N/A | |
DeepSeek-MoE | 0.07 | 0.14 | per 1M tokens | 76.50% | |
Neural Chat | 0.1 | 0.1 | per 1M tokens | 68.00% | |
Embed Multilingual | 0.1 | 0.1 | per 1M tokens | N/A | |
Embed English | 0.1 | 0.1 | per 1M tokens | N/A | |
Mistral Embed | 0.1 | 0.1 | per 1M tokens | N/A | |
Llama 3.1 8B | 0.05 | 0.1 | per 1M tokens | 68.00% | |
Gemma 7B | 0.1 | 0.1 | per 1M tokens | 64.00% | |
Llama 3 8B | 0.05 | 0.1 | per 1M tokens | 68.00% | |
Text Embedding | 0.1 | 0.1 | per 1M tokens | N/A | |
Stable Image Ultra | 0.08 | 0.08 | per 1M tokens | N/A | |
DALL-E 3 | 0.04 | 0.08 | per 1M tokens | N/A | |
Embeding | 0.07 | 0.07 | per 1M tokens | N/A | |
ERNIE Tiny | 0.07 | 0.07 | per 1M tokens | 58.00% | |
SD 3.5 Large | 0.065 | 0.065 | per 1M tokens | N/A | |
Stable Diffusion 3 | 0.065 | 0.065 | per 1M tokens | N/A | |
Stable Diffusion 3 | 0.05 | 0.05 | per 1M tokens | N/A | |
SDXL 1.0 | 0.04 | 0.04 | per 1M tokens | N/A | |
SD 3.5 Medium | 0.035 | 0.035 | per 1M tokens | N/A | |
Sdxl | 0.03 | 0.03 | per 1M tokens | N/A | |
TTS HD | 0.03 | 0.03 | per 1M tokens | N/A | |
Codey | 0.025 | 0.025 | per 1M tokens | 65.00% | |
TTS | 0.015 | 0.015 | per 1M tokens | N/A | |
Whisper | 0.006 | 0.006 | per 1M tokens | N/A | |
ERNIE Lite | 0 | 0 | per 1M tokens | 62.00% | |
Llama 3.1 8B | 0 | 0 | per 1M tokens | 68.00% | |
Llama 3.1 70B | 0 | 0 | per 1M tokens | 82.00% | |
Llama 3.1 405B | 0 | 0 | per 1M tokens | 86.50% | |
Code Llama 34B | 0 | 0 | per 1M tokens | 52.00% | |
Code Llama 70B | 0 | 0 | per 1M tokens | 53.00% | |
Llama 2 7B | 0 | 0 | per 1M tokens | 45.00% | |
Llama 2 13B | 0 | 0 | per 1M tokens | 54.00% | |
Llama 2 70B | 0 | 0 | per 1M tokens | 69.00% | |
Llama 3 8B | 0 | 0 | per 1M tokens | 68.00% | |
Llama 3 70B | 0 | 0 | per 1M tokens | 82.00% |