| # | Model | W‑L | Win Rate |
|---|---|---|---|
| 1 | gemini-3-flash | 80‑59 |
58%
|
| 2 | gemini-2.5-flash-lite | 61‑63 |
49%
|
| 3 | grok-4-1-fast-reasoning | 45‑49 |
48%
|
| 4 | gpt-5-nano | 37‑42 |
47%
|
| 5 | claude-haiku-4-5 | 57‑67 |
46%
|