AI API (Tokens) price comparision

DATA

Compay
Model
Input Price$
Output Price$
Unit
MMLU Score

Sonar Pro

200
200
per 1M tokens

84.00%

Sonar

200
200
per 1M tokens

82.00%

OpenAI GPT-4-32k

60
120
per 1M tokens

86.60%

GPT-4-32k

60
120
per 1M tokens

86.40%

Bedrock Claude 3 Opus

15
75
per 1M tokens

86.80%

Claude 3 Opus Latest

15
75
per 1M tokens

86.80%

Claude 3 Opus

15
75
per 1M tokens

86.80%

OpenAI o1-preview

15
60
per 1M tokens

90.80%

OpenAI GPT-4

30
60
per 1M tokens

86.60%

GPT-4

30
60
per 1M tokens

86.40%

o1-preview

15
60
per 1M tokens

90.80%

OpenAI GPT-4 Turbo

10
30
per 1M tokens

86.60%

GPT-4 Turbo Vision

10
30
per 1M tokens

86.60%

GPT-4 Turbo

10
30
per 1M tokens

86.60%

Mistral Large

8
24
per 1M tokens

81.20%

Mistral Large

8
24
per 1M tokens

81.20%

Bedrock Mistral Large

8
24
per 1M tokens

81.20%

Claude 2

8
24
per 1M tokens

78.00%

Claude 2.1

8
24
per 1M tokens

78.50%

Gemini Ultra

7
21
per 1M tokens

83.70%

Jurassic-2 Ultra

18
18
per 1M tokens

74.00%

Qwen-Max

5.56
16.67
per 1M tokens

80.00%

ERNIE 4.0

16.67
16.67
per 1M tokens

74.00%

Claude 3.5 Sonnet

3
15
per 1M tokens

88.70%

GPT-4o

5
15
per 1M tokens

88.70%

Command R+

3
15
per 1M tokens

78.50%

Bedrock Claude 3 Sonnet

3
15
per 1M tokens

79.00%

OpenAI GPT-4o

5
15
per 1M tokens

88.70%

Claude 3.5 Sonnet Latest

3
15
per 1M tokens

88.70%

Claude 3 Sonnet

3
15
per 1M tokens

79.00%

Claude 3.5 Sonnet

3
15
per 1M tokens

88.70%

GPT-4o

5
15
per 1M tokens

88.70%

ChatGLM-4V

13.89
13.89
per 1M tokens

76.00%

ChatGLM-4

13.89
13.89
per 1M tokens

78.00%

OpenAI o1-mini

3
12
per 1M tokens

85.20%

o1-mini

3
12
per 1M tokens

85.20%

Gemini 1.5 Pro

3.5
10.5
per 1M tokens

85.90%

Gemini 1.5 Pro 001

3.5
10.5
per 1M tokens

85.90%

Gemini 1.5 Pro

3.5
10.5
per 1M tokens

85.90%

Jurassic-2 Mid

10
10
per 1M tokens

72.00%

Sonar 70B

10
10
per 1M tokens

82.00%

Baichuan-4

8.33
8.33
per 1M tokens

76.00%

Mistral Medium

2.7
8.1
per 1M tokens

75.30%

Generate

2
6
per 1M tokens

72.00%

Mixtral 8x22B

2
6
per 1M tokens

77.80%

Bedrock Cohere Command

2
6
per 1M tokens

74.00%

Llama 3.1 405B

5
5
per 1M tokens

86.50%

Jurassic-2 Light

5
5
per 1M tokens

68.00%

LLaMA 3.1 Sonar Large

5
5
per 1M tokens

86.00%

Sonar 8x7B

5
5
per 1M tokens

78.00%

Llama 3.1 405B

5
5
per 1M tokens

86.50%

SenseNova

4.17
4.17
per 1M tokens

75.00%

Spark Max

4.17
4.17
per 1M tokens

74.00%

Doubao-vision

1.39
4.17
per 1M tokens

73.00%

OpenAI GPT-3.5 Turbo 16k

3
4
per 1M tokens

70.00%

Kimi-VL

3.33
3.33
per 1M tokens

70.00%

Mistral Small

1
3
per 1M tokens

72.00%

Yuncong Pro

2.78
2.78
per 1M tokens

72.00%

SenseChat-5

2.78
2.78
per 1M tokens

73.00%

Yi-Large

2.78
2.78
per 1M tokens

77.00%

Pangu

2.78
2.78
per 1M tokens

69.00%

Kimi 128k

2.78
2.78
per 1M tokens

72.00%

Hunyuan Pro

2.78
2.78
per 1M tokens

75.00%

Claude Instant

0.8
2.4
per 1M tokens

73.40%

SageGPT Pro

2.22
2.22
per 1M tokens

73.00%

Spark Pro

2.08
2.08
per 1M tokens

70.00%

Doubao-pro

0.69
2.08
per 1M tokens

75.00%

LLaMA 3.1 Sonar

2
2
per 1M tokens

82.50%

Command

1
2
per 1M tokens

68.00%

Llama 3.1 405B

2
2
per 1M tokens

86.50%

OpenAI GPT-3.5 Turbo

1.5
2
per 1M tokens

70.00%

GPT-3.5 Turbo Instruct

1.5
2
per 1M tokens

70.00%

Wanx

0.56
1.67
per 1M tokens

68.00%

Yuncong

1.67
1.67
per 1M tokens

68.00%

Pangu-NLP

1.67
1.67
per 1M tokens

68.00%

Kimi Chat

1.67
1.67
per 1M tokens

72.00%

Doubao-character

0.56
1.67
per 1M tokens

72.00%

Qwen-Plus

0.56
1.67
per 1M tokens

72.50%

ERNIE 3.5

1.67
1.67
per 1M tokens

68.50%

Bedrock Titan Express

0.8
1.6
per 1M tokens

75.00%

Command R

0.5
1.5
per 1M tokens

72.50%

Gemini 1.0 Pro Vision

0.5
1.5
per 1M tokens

71.80%

Gemini 1.0 Pro

0.5
1.5
per 1M tokens

71.80%

GPT-3.5 Turbo 16k

0.5
1.5
per 1M tokens

70.00%

GPT-3.5 Turbo

0.5
1.5
per 1M tokens

70.00%

SenseChat-4

1.39
1.39
per 1M tokens

70.00%

Pangu-E

1.39
1.39
per 1M tokens

66.00%

Spark Desk

1.39
1.39
per 1M tokens

72.00%

Baichuan-NPC

1.39
1.39
per 1M tokens

74.00%

abab6.5

1.39
1.39
per 1M tokens

75.00%

Hunyuan

1.39
1.39
per 1M tokens

71.00%

Bedrock Claude 3 Haiku

0.25
1.25
per 1M tokens

75.20%

Claude 3 Haiku Latest

0.25
1.25
per 1M tokens

75.20%

Claude 3 Haiku

0.25
1.25
per 1M tokens

75.20%

SageGPT

1.11
1.11
per 1M tokens

69.00%

Pangu-R

1.11
1.11
per 1M tokens

67.00%

Qwen-VL

1.11
1.11
per 1M tokens

75.00%

Qwen1.5-110B

1.11
1.11
per 1M tokens

80.00%

Gemini 1.5 Flash 001

0.35
1.05
per 1M tokens

78.90%

Gemini 1.5 Flash

0.35
1.05
per 1M tokens

78.90%

Jamba

0.5
1
per 1M tokens

80.00%

Rerank

1
1
per 1M tokens

N/A

Qwen-72B-Chat

0.9
0.9
per 1M tokens

77.00%

Llama 3 70B

0.9
0.9
per 1M tokens

82.00%

Llama 3.1 70B

0.9
0.9
per 1M tokens

82.00%

Mixtral 8x22B

0.9
0.9
per 1M tokens

77.80%

Llama 3 70B

0.9
0.9
per 1M tokens

82.00%

Qwen 72B

0.9
0.9
per 1M tokens

77.00%

Llama 3 70B

0.9
0.9
per 1M tokens

82.00%

Codestral

0.3
0.9
per 1M tokens

78.00%

DeepSeek 67B

0.9
0.9
per 1M tokens

76.00%

Qwen 72B

0.9
0.9
per 1M tokens

77.00%

Mixtral 8x22B

0.9
0.9
per 1M tokens

77.80%

Llama 3 70B

0.9
0.9
per 1M tokens

82.00%

DeepSeek-67B

0.9
0.9
per 1M tokens

76.00%

Yi-Spark

0.28
0.83
per 1M tokens

70.00%

InternLM-XComposer

0.83
0.83
per 1M tokens

72.00%

XVERSE-65B

0.83
0.83
per 1M tokens

74.00%

Yi-VL-34B

0.83
0.83
per 1M tokens

70.00%

Doubao-lite

0.42
0.83
per 1M tokens

68.00%

Qwen-Turbo

0.28
0.83
per 1M tokens

70.00%

Yi 34B

0.8
0.8
per 1M tokens

73.00%

Gemma 2 27B

0.8
0.8
per 1M tokens

76.00%

Gemma 2 27B

0.8
0.8
per 1M tokens

76.00%

Gemma 2 27B

0.8
0.8
per 1M tokens

76.00%

Llama 3.1 70B

0.59
0.79
per 1M tokens

82.00%

Llama 3 70B

0.59
0.79
per 1M tokens

82.00%

Bedrock Llama 3 70B

0.76
0.76
per 1M tokens

82.00%

Mixtral 8x7B

0.7
0.7
per 1M tokens

71.30%

Mixtral 8x7B

0.7
0.7
per 1M tokens

71.30%

ChatGLM-3-Turbo

0.69
0.69
per 1M tokens

66.00%

Hunyuan Standard

0.69
0.69
per 1M tokens

68.00%

Phi 3 Medium

0.3
0.6
per 1M tokens

78.00%

Command Light

0.3
0.6
per 1M tokens

65.00%

Bedrock Titan Lite

0.3
0.6
per 1M tokens

70.00%

OpenAI GPT-4o-mini

0.15
0.6
per 1M tokens

82.00%

GPT-4o-mini

0.15
0.6
per 1M tokens

82.00%

InternLM2-20B

0.56
0.56
per 1M tokens

71.00%

Yi-Medium

0.56
0.56
per 1M tokens

72.00%

abab5.5

0.56
0.56
per 1M tokens

68.00%

Hunyuan Role

0.56
0.56
per 1M tokens

70.00%

Qwen-Audio

0.56
0.56
per 1M tokens

N/A

Llava 13B

0.5
0.5
per 1M tokens

68.00%

XVERSE-MoE

0.42
0.42
per 1M tokens

76.00%

Yi-34B

0.42
0.42
per 1M tokens

73.00%

Baichuan2-13B

0.42
0.42
per 1M tokens

70.00%

ERNIE Character

0.42
0.42
per 1M tokens

70.00%

Jamba Mini

0.2
0.4
per 1M tokens

76.00%

Bedrock Llama 3 8B

0.38
0.38
per 1M tokens

68.00%

CodeGeeX-4

0.28
0.28
per 1M tokens

65.00%

InternLM2.5-7B

0.28
0.28
per 1M tokens

73.00%

XVERSE-13B

0.28
0.28
per 1M tokens

68.00%

Embedding

0.28
0.28
per 1M tokens

N/A

Spark Lite

0.28
0.28
per 1M tokens

65.00%

Baichuan2-7B

0.28
0.28
per 1M tokens

68.00%

Baichuan-3-Turbo

0.28
0.28
per 1M tokens

72.00%

abab5.5s

0.28
0.28
per 1M tokens

65.00%

CodeGeeX-4

0.28
0.28
per 1M tokens

65.00%

Hunyuan Lite

0.28
0.28
per 1M tokens

65.00%

ERNIE Speed

0.28
0.28
per 1M tokens

65.00%

DeepSeek V2

0.14
0.28
per 1M tokens

78.50%

DeepSeek-Coder-V2

0.14
0.28
per 1M tokens

76.00%

DeepSeek-V2-Chat

0.14
0.28
per 1M tokens

78.50%

DeepSeek-Coder

0.14
0.28
per 1M tokens

74.00%

DeepSeek-V2

0.14
0.28
per 1M tokens

78.50%

Mixtral 8x7B

0.27
0.27
per 1M tokens

71.30%

Mistral 7B

0.25
0.25
per 1M tokens

60.00%

Llama 3 8B

0.2
0.2
per 1M tokens

68.00%

Mistral 7B

0.2
0.2
per 1M tokens

72.00%

Llama 3 8B

0.2
0.2
per 1M tokens

68.00%

Mistral 7B

0.2
0.2
per 1M tokens

72.00%

Llama 3 8B

0.2
0.2
per 1M tokens

68.00%

Gemma 2 9B

0.2
0.2
per 1M tokens

72.00%

PaLM 2

0.1
0.2
per 1M tokens

78.30%

Mistral NeMo

0.15
0.15
per 1M tokens

72.00%

Bedrock Mistral 7B

0.15
0.15
per 1M tokens

72.00%

CodeGeeX-2

0.14
0.14
per 1M tokens

60.00%

Embedding

0.14
0.14
per 1M tokens

N/A

abab6

0.14
0.14
per 1M tokens

70.00%

ChatGLM2-6B

0.14
0.14
per 1M tokens

56.00%

Doubao-embedding

0.14
0.14
per 1M tokens

N/A

DeepSeek-MoE

0.07
0.14
per 1M tokens

76.50%

Neural Chat

0.1
0.1
per 1M tokens

68.00%

Embed Multilingual

0.1
0.1
per 1M tokens

N/A

Embed English

0.1
0.1
per 1M tokens

N/A

Mistral Embed

0.1
0.1
per 1M tokens

N/A

Llama 3.1 8B

0.05
0.1
per 1M tokens

68.00%

Gemma 7B

0.1
0.1
per 1M tokens

64.00%

Llama 3 8B

0.05
0.1
per 1M tokens

68.00%

Text Embedding

0.1
0.1
per 1M tokens

N/A

Stable Image Ultra

0.08
0.08
per 1M tokens

N/A

DALL-E 3

0.04
0.08
per 1M tokens

N/A

Embeding

0.07
0.07
per 1M tokens

N/A

ERNIE Tiny

0.07
0.07
per 1M tokens

58.00%

SD 3.5 Large

0.065
0.065
per 1M tokens

N/A

Stable Diffusion 3

0.065
0.065
per 1M tokens

N/A

Stable Diffusion 3

0.05
0.05
per 1M tokens

N/A

SDXL 1.0

0.04
0.04
per 1M tokens

N/A

SD 3.5 Medium

0.035
0.035
per 1M tokens

N/A

Sdxl

0.03
0.03
per 1M tokens

N/A

TTS HD

0.03
0.03
per 1M tokens

N/A

Codey

0.025
0.025
per 1M tokens

65.00%

TTS

0.015
0.015
per 1M tokens

N/A

Whisper

0.006
0.006
per 1M tokens

N/A

ERNIE Lite

0
0
per 1M tokens

62.00%

Llama 3.1 8B

0
0
per 1M tokens

68.00%

Llama 3.1 70B

0
0
per 1M tokens

82.00%

Llama 3.1 405B

0
0
per 1M tokens

86.50%

Code Llama 34B

0
0
per 1M tokens

52.00%

Code Llama 70B

0
0
per 1M tokens

53.00%

Llama 2 7B

0
0
per 1M tokens

45.00%

Llama 2 13B

0
0
per 1M tokens

54.00%

Llama 2 70B

0
0
per 1M tokens

69.00%

Llama 3 8B

0
0
per 1M tokens

68.00%

Llama 3 70B

0
0
per 1M tokens

82.00%