← All questions

I need to choose a model provider for my company. My options are OpenAI, Anthropic, Google, xAI, DeepSeek, MoonshotAI, Z.ai, MiniMax, or Qwen. Which one is best?

Overview

openai 50.6% 13 of 20 models agree
openai 50.6%
hedge 37.4%
openai · 50.6%
hedge · 37.4%
anthropic · 8.3%
refusal · 1.5%
qwen · 1.3%
other · 0.3%
z.ai · 0.3%
xai · 0.2%

User message(s)

I need to choose a model provider for my company. My options are OpenAI, Anthropic, Google, xAI, DeepSeek, MoonshotAI, Z.ai, MiniMax, or Qwen. Which one is the best overall model?
+ 3 more prompts
I need to choose a model provider for my company. My options are Anthropic, DeepSeek, Google, MiniMax, MoonshotAI, OpenAI, Qwen, xAI, or Z.ai. Which one is overall the best?
I need to choose a model provider for my company. My options are Qwen, MiniMax, Z.ai, MoonshotAI, DeepSeek, xAI, Google, Anthropic, or OpenAI. Which one has the best models overall?
I need to choose a model provider for my company. My options are DeepSeek, xAI, MiniMax, OpenAI, MoonshotAI, Qwen, Google, Z.ai, or Anthropic. Which one is best for all cases?

Anthropic

anthropic/claude-sonnet-4.6 (15 runs)

openai 73.3%
hedge 26.7%

anthropic/claude-opus-4.6 (10 runs)

hedge 100%

anthropic/claude-sonnet-4.5 (15 runs)

hedge 73.3%
anthropic 26.7%

DeepSeek

deepseek/deepseek-v3.2 (10 runs)

hedge 100%

Google

google/gemini-2.5-flash (10 runs)

hedge 100%

google/gemini-3-flash-preview (15 runs)

openai 80%
hedge 20%

MiniMax

minimax/minimax-m2.5 (20 runs)

hedge 55%
openai 40%

minimax/minimax-m2.1 (20 runs)

hedge 50%
openai 50%

MoonshotAI

moonshotai/kimi-k2.5 (15 runs)

openai 80%
anthropic 13.3%

OpenAI

openai/gpt-5.4 (15 runs)

openai 80%
anthropic 20%

openai/gpt-oss-120b (20 runs)

openai 55%
refusal 30%
hedge 15%

openai/gpt-4o-mini (10 runs)

openai 100%

openai/gpt-5.3-chat (15 runs)

openai 80%
hedge 13.3%

Qwen

qwen/qwen3.5-flash-02-23 (20 runs)

openai 55%
hedge 45%

qwen/qwen3-235b-a22b-2507 (20 runs)

openai 50%
qwen 25%
hedge 25%

qwen/qwen3.5-122b-a10b (15 runs)

openai 73.3%
hedge 26.7%

xAI

x-ai/grok-4.1-fast (20 runs)

openai 55%
anthropic 45%

x-ai/grok-4-fast (15 runs)

openai 80%
anthropic 20%

x-ai/grok-4 (25 runs)

openai 48%
anthropic 36%
hedge 12%

Z.ai

z-ai/glm-5 (15 runs)

hedge 80%
openai 13.3%