AI Leaderboard

Live ranking of the best AI chatbot models (Data from Chatbot Arena)

Total Models: 208

Rank

Org.

Model

Arena Elo

License

Google

Gemini-2.5-Pro

1473

Proprietary

Google

Gemini-2.5-Pro-Preview-05-06

1446

Proprietary

OpenAI

ChatGPT-4o-latest (2025-03-26)

1428

Proprietary

OpenAI

o3-2025-04-16

1426

Proprietary

DeepSeek

DeepSeek-R1-0528

1424

MIT

xAI

Grok-3-Preview-02-24

1423

Proprietary

Google

Gemini-2.5-Flash

1418

Proprietary

OpenAI

GPT-4.5-Preview

1415

Proprietary

Google

Gemini-2.5-Flash-Preview-04-17

1398

Proprietary

Alibaba

Qwen3-235B-A22B-no-thinking

1389

Apache 2.0

OpenAI

GPT-4.1-2025-04-14

1385

Proprietary

DeepSeek

DeepSeek-V3-0324

1384

MIT

Tencent

Hunyuan-Turbos-20250416

1380

Proprietary

MiniMax

Minimax-M1

1374

Apache 2.0

DeepSeek

DeepSeek-R1

1375

MIT

Anthropic

Claude Opus 4 (20250514)

1372

Proprietary

Mistral

Mistral Medium 3

1369

Proprietary

OpenAI

o1-2024-12-17

1367

Proprietary

Alibaba

Qwen3-235B-A22B

1367

Apache 2.0

Google

Gemini-2.0-Flash-001

1364

Proprietary

OpenAI

o4-mini-2025-04-16

1364

Proprietary

Alibaba

Qwen2.5-Max

1363

Proprietary

xAI

Grok-3-Mini-beta

1361

Proprietary

Google

Gemma-3-27B-it

1361

Gemma

OpenAI

o1-preview

1352

Proprietary

Anthropic

Claude Sonnet 4 (20250514)

1345

Proprietary

OpenAI

o3-mini-high

1342

Proprietary

Google

Gemma-3-12B-it

1338

Gemma

OpenAI

GPT-4.1-mini-2025-04-14

1338

Proprietary

DeepSeek

DeepSeek-V3

1336

DeepSeek

Alibaba

QwQ-32B

1334

Apache 2.0

Amazon

Amazon-Nova-Experimental-Chat-05-14

1330

Proprietary

Zhipu

GLM-4-Plus-0111

1328

Proprietary

Google

Gemini-2.0-Flash-Lite

1330

Proprietary

Alibaba

Qwen-Plus-0125

1328

Proprietary

Cohere

Command A (03-2025)

1327

CC-BY-NC-4.0

StepFun

Step-2-16K-Exp

1322

Proprietary

Tencent

Hunyuan-TurboS-20250226

1320

Proprietary

Nvidia

Llama-3.3-Nemotron-Super-49B-v1

1314

Nvidia

OpenAI

o3-mini

1322

Proprietary

OpenAI

o1-mini

1321

Proprietary

Tencent

Hunyuan-Turbo-0110

1314

Proprietary

Google

Gemini-1.5-Pro-002

1320

Proprietary

Anthropic

Claude 3.7 Sonnet (thinking-32k)

1315

Proprietary

Google

Gemma-3n-e4b-it

1309

Gemma

Anthropic

Claude 3.7 Sonnet

1308

Proprietary

xAI

Grok-2-08-13

1305

Proprietary

01 AI

Yi-Lightning

1305

Proprietary

OpenAI

GPT-4o-2024-05-13

1302

Proprietary

Alibaba

Qwen2.5-plus-1127

1300

Proprietary

Anthropic

Claude 3.5 Sonnet (20241022)

1301

Proprietary

DeepSeek

Deepseek-v2.5-1210

1297

DeepSeek

Google

Gemma-3-4B-it

1293

Gemma

Tencent

Hunyuan-Large-2025-02-10

1289

Proprietary

NexusFlow

Athene-v2-Chat-72B

1293

NexusFlow

Meta

Llama-4-Maverick-17B-128E-Instruct

1293

Llama 4

Zhipu AI

GLM-4-Plus

1291

Proprietary

OpenAI

GPT-4.1-nano-2025-04-14

1288

Proprietary

OpenAI

GPT-4o-mini-2024-07-18

1289

Proprietary

Google

Gemini-1.5-Flash-002

1289

Proprietary

Nvidia

Llama-3.1-Nemotron-70B-Instruct

1286

Llama 3.1

Meta

Meta-Llama-3.1-405B-Instruct-bf16

1286

Llama 3.1 Community

Anthropic

Claude 3.5 Sonnet (20240620)

1286

Proprietary

Meta

Meta-Llama-3.1-405B-Instruct-fp8

1285

Llama 3.1 Community

Google

Gemini Advanced App (2024-05-14)

1284

Proprietary

xAI

Grok-2-Mini-08-13

1284

Proprietary

OpenAI

GPT-4o-2024-08-06

1283

Proprietary

Alibaba

Qwen-Max-0919

1281

Qwen

Tencent

Hunyuan-Standard-2025-02-10

1278

Proprietary

Mistral

Mistral-Small-3.1-24B-Instruct-2503

1273

Apache 2.0

Google

Gemini-1.5-Pro-001

1277

Proprietary

DeepSeek

Deepseek-v2.5

1276

DeepSeek

Meta

Llama-3.3-70B-Instruct

1275

Llama-3.3

Alibaba

Qwen2.5-72B-Instruct

1275

Qwen

OpenAI

GPT-4-Turbo-2024-04-09

1274

Proprietary

Mistral

Mistral-Large-2407

1269

Mistral Research

NexusFlow

Athene-70B

1268

CC-BY-NC-4.0

OpenAI

GPT-4-1106-preview

1267

Proprietary

Mistral

Mistral-Large-2411

1266

MRL

Ai2

Llama-3.1-Tulu-3-70B

1262

Llama 3.1

Mistral

magistral-medium-2506

1262

Proprietary

Meta

Meta-Llama-3.1-70B-Instruct

1265

Llama 3.1 Community

Anthropic

Claude 3 Opus

1265

Proprietary

Amazon

Amazon Nova Pro 1.0

1262

Proprietary

OpenAI

GPT-4-0125-preview

1262

Proprietary

Anthropic

Claude 3.5 Haiku (20241022)

1257

Propretary

Reka AI

Reka-Core-20240904

1253

Proprietary

Google

Gemini-1.5-Flash-001

1244

Proprietary

AI21 Labs

Jamba-1.5-Large

1239

Jamba Open

Tencent

Hunyuan-Large-Vision

1236

Proprietary

Google

Gemma-2-27B-it

1237

Gemma license

Alibaba

Qwen2.5-Coder-32B-Instruct

1235

Apache 2.0

Mistral

Mistral-Small-24B-Instruct-2501

1235

Apache 2.0

Amazon

Amazon Nova Lite 1.0

1234

Proprietary

Princeton

Gemma-2-9B-it-SimPO

1234

MIT

Cohere

Command R+ (08-2024)

1233

CC-BY-NC-4.0

Nvidia

Llama-3.1-Nemotron-51B-Instruct

1229

Llama 3.1

Google

Gemini-1.5-Flash-8B-001

1230

Proprietary

Nvidia

Nemotron-4-340B-Instruct

1227

NVIDIA Open Model

100

Allen AI

OLMo-2-0325-32B-Instruct

1223

Apache-2.0

101

Cohere

Aya-Expanse-32B

1227

CC-BY-NC-4.0

102

Reka AI

Reka-Flash-20240904

1223

Proprietary

103

Zhipu AI

GLM-4-0520

1224

Proprietary

104

Meta

Llama-3-70B-Instruct

1224

Llama 3 Community

105

Microsoft

Phi-4

1223

MIT

106

Anthropic

Claude 3 Sonnet

1218

Proprietary

107

Amazon

Amazon Nova Micro 1.0

1215

Proprietary

108

Google

Gemma-2-9B-it

1209

Gemma license

109

Tencent

Hunyuan-Standard-256K

1206

Proprietary

110

Cohere

Command R+ (04-2024)

1207

CC-BY-NC-4.0

111

Ai2

Llama-3.1-Tulu-3-8B

1203

Llama 3.1

112

Alibaba

Qwen2-72B-Instruct

1205

Qianwen LICENSE

113

OpenAI

GPT-4-0314

1204

Proprietary

114

Mistral

Ministral-8B-2410

1200

MRL

115

Cohere

Aya-Expanse-8B

1197

CC-BY-NC-4.0

116

Cohere

Command R (08-2024)

1197

CC-BY-NC-4.0

117

Anthropic

Claude 3 Haiku

1197

Proprietary

118

DeepSeek AI

DeepSeek-Coder-V2-Instruct

1196

DeepSeek License

119

AI21 Labs

Jamba-1.5-Mini

1193

Jamba Open

120

Meta

Meta-Llama-3.1-8B-Instruct

1193

Llama 3.1 Community

121

OpenAI

GPT-4-0613

1180

Proprietary

122

Alibaba

Qwen1.5-110B-Chat

1179

Qianwen LICENSE

123

Alibaba

QwQ-32B-Preview

1170

Apache 2.0

124

01 AI

Yi-1.5-34B-Chat

1175

Apache-2.0

125

Mistral

Mistral-Large-2402

1175

Proprietary

126

Reka AI

Reka-Flash-21B-online

1173

Proprietary

127

Meta

Llama-3-8B-Instruct

1169

Llama 3 Community

128

InternLM

InternLM2.5-20B-chat

1166

Other

129

Reka AI

Reka-Flash-21B

1165

Proprietary

130

IBM

Granite-3.1-8B-Instruct

1160

Apache 2.0

131

Cohere

Command R (04-2024)

1166

CC-BY-NC-4.0

132

Mistral

Mistral Medium

1165

Proprietary

133

Mistral

Mixtral-8x22b-Instruct-v0.1

1165

Apache 2.0

134

Alibaba

Qwen1.5-72B-Chat

1165

Qianwen LICENSE

135

Google

Gemma-2-2b-it

1161

Gemma license

136

Google

Gemini-1.0-Pro-001

1149

Proprietary

137

HuggingFace

Zephyr-ORPO-141b-A35b-v0.1

1145

Apache 2.0

138

Alibaba

Qwen1.5-32B-Chat

1143

Qianwen LICENSE

139

IBM

Granite-3.1-2B-Instruct

1137

Apache 2.0

140

Microsoft

Phi-3-Medium-4k-Instruct

1140

MIT

141

Nexusflow

Starling-LM-7B-beta

1136

Apache-2.0

142

Mistral

Mixtral-8x7B-Instruct-v0.1

1131

Apache 2.0

143

01 AI

Yi-34B-Chat

1129

Yi License

144

Google

Gemini Pro

1128

Proprietary

145

Alibaba

Qwen1.5-14B-Chat

1126

Qianwen LICENSE

146

Microsoft

WizardLM-70B-v1.0

1124

Llama 2 Community

147

OpenAI

GPT-3.5-Turbo-0125

1123

Proprietary

148

Meta

Meta-Llama-3.2-3B-Instruct

1120

Llama 3.2

149

Databricks

DBRX-Instruct-Preview

1121

DBRX LICENSE

150

Microsoft

Phi-3-Small-8k-Instruct

1119

MIT

151

AllenAI/UW

Tulu-2-DPO-70B

1116

AI2 ImpACT Low-risk

152

IBM

Granite-3.0-8B-Instruct

1111

Apache 2.0

153

Meta

Llama-2-70B-chat

1110

Llama 2 Community

154

OpenChat

OpenChat-3.5-0106

1109

Apache-2.0

155

LMSYS

Vicuna-33B

1108

Non-commercial

156

Snowflake

Snowflake Arctic Instruct

1107

Apache 2.0

157

UC Berkeley

Starling-LM-7B-alpha

1106

CC-BY-NC-4.0

158

NousResearch

Nous-Hermes-2-Mixtral-8x7B-DPO

1102

Apache-2.0

159

Nvidia

NV-Llama2-70B-SteerLM-Chat

1098

Llama 2 Community

160

Google

Gemma-1.1-7B-it

1101

Gemma license

161

DeepSeek AI

DeepSeek-LLM-67B-Chat

1094

DeepSeek License

162

OpenChat

OpenChat-3.5

1094

Apache-2.0

163

IBM

Granite-3.0-2B-Instruct

1091

Apache 2.0

164

NousResearch

OpenHermes-2.5-Mistral-7B

1092

Apache-2.0

165

Alibaba

Qwen1.5-7B-Chat

1087

Qianwen LICENSE

166

Mistral

Mistral-7B-Instruct-v0.2

1090

Apache-2.0

167

Microsoft

Phi-3-Mini-4K-Instruct-June-24

1088

MIT

168

OpenAI

GPT-3.5-Turbo-1106

1085

Proprietary

169

Cognitive Computations

Dolphin-2.2.1-Mistral-7B

1080

Apache-2.0

170

Microsoft

Phi-3-Mini-4k-Instruct

1084

MIT

171

Upstage AI

SOLAR-10.7B-Instruct-v1.0

1080

CC-BY-NC-4.0

172

Meta

Llama-2-13b-chat

1081

Llama 2 Community

173

Microsoft

WizardLM-13b-v1.2

1076

Llama 2 Community

174

Meta

Meta-Llama-3.2-1B-Instruct

1071

Llama 3.2

175

HuggingFace

Zephyr-7B-beta

1071

MIT

176

HuggingFace

SmolLM2-1.7B-Instruct

1064

Apache 2.0

177

Meta

CodeLlama-70B-instruct

1059

Llama 2 Community

178

MosaicML

MPT-30B-chat

1063

CC-BY-NC-SA-4.0

179

HuggingFace

Zephyr-7B-alpha

1058

MIT

180

Meta

CodeLlama-34B-instruct

1060

Llama 2 Community

181

TII

falcon-180b-chat

1051

Falcon-180B TII License

182

LMSYS

Vicuna-13B

1059

Llama 2 Community

183

Google

Gemma-7B-it

1055

Gemma license

184

Microsoft

Phi-3-Mini-128k-Instruct

1054

MIT

185

Meta

Llama-2-7B-chat

1054

Llama 2 Community

186

Alibaba

Qwen-14B-Chat

1052

Qianwen LICENSE

187

Guanaco-33B

1050

Non-commercial

188

Google

Gemma-1.1-2b-it

1038

Gemma license

189

Together AI

StripedHyena-Nous-7B

1035

Apache 2.0

190

Allen AI

OLMo-7B-instruct

1033

Apache-2.0

191

Mistral

Mistral-7B-Instruct-v0.1

1025

Apache 2.0

192

LMSYS

Vicuna-7B

1022

Llama 2 Community

193

Google

PaLM-Chat-Bison-001

1021

Proprietary

194

Google

Gemma-2B-it

1007

Gemma license

195

Alibaba

Qwen1.5-4B-Chat

1006

Qianwen LICENSE

196

UC Berkeley

Koala-13B

982

Non-commercial

197

Tsinghua

ChatGLM3-6B

972

Apache-2.0

198

Nomic AI

GPT4All-13B-Snoozy

950

Non-commercial

199

MosaicML

MPT-7B-Chat

946

CC-BY-NC-SA-4.0

200

Tsinghua

ChatGLM2-6B

942

Apache-2.0

201

RWKV

RWKV-4-Raven-14B

939

Apache 2.0

202

Stanford

Alpaca-13B

919

Non-commercial

203

OpenAssistant

OpenAssistant-Pythia-12B

911

Apache 2.0

204

Tsinghua

ChatGLM-6B

896

Non-commercial

205

LMSYS

FastChat-T5-3B

885

Apache 2.0

206

Stability AI

StableLM-Tuned-Alpha-7B

857

CC-BY-NC-SA-4.0

207

Databricks

Dolly-V2-12B

840

MIT

208

Meta

LLaMA-13B

817

Non-commercial

FAQ

What is AI Leaderboards?

AI Leaderboards is a free website that focuses on ranking the top AI models for users. The main goal of the website is to allow people to see what AI models are the best and the most helpful. Likewise, it also allows people to see how open sourced AI models such as Llama and Deepseek compare to closed models like OpenAI or Gemini. The AI rankings are also live and updated daily, so users are able to come back any time to see how new AI models perform and which is the best to use. If you have any feedback for how to improve the website, feel free to email me. I would love to hear your suggestions.

How are the chatbot models ranked?

The chatbot models are ranked based on their overall ability to answer user prompts. To get the results, users are polled. They are given a prompt and shown two results from two different models, but not told which model the result is from. After thousands of user votes, we end up with the most accurate AI rankings possible. This method also ensures that the models at the top perform as well as possible towards answering the users prompts.

Where is the LLM leaderboard data from?

The LLM leaderboard data is gathered from the Chatbot Arena Leaderboard, also known as LMSYS. Chatbot Arena is an open source project that focuses on ranking and benchmarking AI and LLM performance. The main way they do this is by crowdsourcing the rankings of the AI models results. Each model is given a prompt, then the results are saved. Once in the pool of results, users rank the results on which they believe better answers the prompt. Then after thousands of results, we end up with the most accurate ranking for the language chatbot models. If you wish to support the project, it can be found at https://lmarena.ai/.

Are the AI model rankings live?

Yes, the AI Leaderboard is live and updated every day to get the latest AI rankings. This ensures that users get the most accurate results from the Chatbot Arena.