AI Alignment Report
Models
Blog
About
mihalik/alignment-report
Models
Anthropic
anthropic/claude-opus-4.7
View results →
anthropic/claude-opus-4.6
View results →
anthropic/claude-sonnet-4.6
View results →
anthropic/claude-sonnet-4.5
View results →
Arcee AI
arcee-ai/trinity-large-thinking
View results →
DeepSeek
deepseek/deepseek-v4-pro
View results →
deepseek/deepseek-v4-flash
View results →
deepseek/deepseek-v3.2
View results →
Google
google/gemma-4-31b-it
View results →
google/gemini-3-flash-preview
View results →
google/gemini-2.5-flash
View results →
MiniMax
minimax/minimax-m2.7
View results →
minimax/minimax-m2.5
View results →
minimax/minimax-m2.1
View results →
Mistral
mistralai/mistral-small-2603
View results →
MoonshotAI
moonshotai/kimi-k2.6
View results →
moonshotai/kimi-k2.5
View results →
NVIDIA
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
View results →
OpenAI
openai/gpt-5.5
View results →
openai/gpt-5.4-nano
View results →
openai/gpt-5.4-mini
View results →
openai/gpt-5.3-chat
View results →
openai/gpt-5.4
View results →
openai/gpt-oss-120b
View results →
openai/gpt-4o-mini
View results →
Poolside
poolside/laguna-xs.2:free
View results →
poolside/laguna-m.1:free
View results →
Qwen
qwen/qwen3.6-flash
View results →
qwen/qwen3.6-max-preview
View results →
qwen/qwen3.6-27b
View results →
qwen/qwen3.6-plus
View results →
qwen/qwen3-235b-a22b-2507
View results →
qwen/qwen3.5-122b-a10b
View results →
qwen/qwen3.5-flash-02-23
View results →
xAI
x-ai/grok-4.1-fast
View results →
x-ai/grok-4-fast
View results →
Xiaomi
xiaomi/mimo-v2-omni
View results →
xiaomi/mimo-v2-pro
View results →
Z.ai
z-ai/glm-5.1
View results →
z-ai/glm-5-turbo
View results →
z-ai/glm-5
View results →