qwen2.5:14b on A100, H100 — 2026-04-18
2026-04-18T06:31:18.000Z → 2026-04-18T06:32:00.194Z
On 2026-04-18, qwen2.5:14b ran across A100, H100. H100 finished first at 106.5 tok/s, 14× faster than A100. The per-million-token cost: $6.49 (H100) vs $27.78 (A100) — the surprise: the headline GPU H100 was also the cheapest per-million tokens, coming in 4.3× less than A100.
Podium
H100
1st
thunder-h100
- peak tok/s
- 106.5
- avg tok/s
- 106.5
- $ / 1M tok
- $6.49
A100
2nd
thunder-a100
- peak tok/s
- 7.8
- avg tok/s
- 7.8
- $ / 1M tok
- $27.78