Skip to content
Bench/Leaderboard

Signed benchmark leaderboard

52 cryptographically signed, hardware-fingerprinted benchmark envelopes across 11 suites. Every row ties to a real run you can re-verify with bench verify. Sourced from the public corpus on Hugging Face.

ModelEngineHardwarepass@1TTFT p50Latency p50Tokens outTimeoutOK rateSigned
Phi-3.5-mini-instructvllm8x NVIDIA H100 80GB HBM3100%12 ms975 ms1,1660%100%
DeepSeek-Coder-V2-Lite-Instructvllm8x NVIDIA H100 80GB HBM3100%58 ms1244 ms7890%100%
Llama-3.1-8B-Instructvllm8x NVIDIA H100 80GB HBM3100%16 ms1949 ms1,5890%100%
Qwen2.5-Coder-7B-Instructvllm8x NVIDIA H100 80GB HBM3100%15 ms1141 ms9440%100%
Qwen2.5-7B-Instructvllm8x NVIDIA H100 80GB HBM3100%15 ms1699 ms1,2810%100%
Llama-3.1-70B-Instructvllm8x NVIDIA H100 80GB HBM380%28 ms4338 ms1,6530%100%
Mistral-7B-Instruct-v0.3vllm8x NVIDIA H100 80GB HBM380%15 ms1243 ms8240%100%
gemma-2-9b-itvllm8x NVIDIA H100 80GB HBM3100%18 ms2989 ms1,5210%100%

Every signed row carries a hardware fingerprint, software provenance, dataset hash, and seed. Re-verify any envelope:

pip install inferencebench
bench fetch hf://datasets/Yobitel/marathon-keyless-v0.0.2/<file>
bench verify ~/.cache/inferencebench/fetched/*.json \
  --require-issuer https://token.actions.githubusercontent.com \
  --require-identity-pattern 'github\.com/yobitelcomm/bench/'

Source + methodology on GitHub