live eventEidos is moving itself to 90%-cheaper silicon without losing intelligence. Watch.follow on LinkedIn →
mission: 90% local authorship — waiting for first eventhow?
← charts + scores

llama3.2:1b on A100, H100 — 2026-04-18

2026-04-18T06:40:14.000Z 2026-04-18T06:40:26.277Z
tweet →

On 2026-04-18, llama3.2:1b ran across A100, H100. H100 finished first at 592.8 tok/s, 13× faster than A100. The per-million-token cost: $1.17 (H100) vs $4.75 (A100) — the surprise: the headline GPU H100 was also the cheapest per-million tokens, coming in 4.1× less than A100.

Podium

H100
1st
thunder-h100
peak tok/s
592.8
avg tok/s
592.8
$ / 1M tok
$1.17
A100
2nd
thunder-a100
peak tok/s
45.6
avg tok/s
45.6
$ / 1M tok
$4.75

race deskH100 just posted 193 tok/s on llama3.1:8b — 13.7× faster than the A100 lane (1428h ago).

llama3.2:1b on A100, H100 — 2026-04-18 — Eidos Live · Crucible