Skip to content

Benchmark Management

Admin
A

Benchmark Management

Submit, review, and manage community benchmark datasets.

Total Datasets

146

Total Datapoints

4053

Models Covered

73

GPUs Covered

10

Submit Benchmark Results

Contribute your benchmark data for model + GPU combinations.

ISLOSLConcurrencyThroughput (t/s)TTFT (ms)
Trust Score: 45/100
Model ID is required
GPU ID is required
Framework is required
Precision is required
Invalid throughput: 0

Existing Benchmark Datasets

ModelGPUGPUs/NodeNodesFrameworkPrecisionDatapointsRun Date
snowflake/arctic-480bnvidia-h100-sxm81vllmfp8182025-03-15
baai/bge-large-en-v1.5nvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/codellama-13bnvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/codellama-34bnvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/codellama-7bnvidia-h100-sxm11vllmbf16182025-03-15
cohere/command-r-plusnvidia-h100-sxm21vllmbf16182025-03-15
deepseek/deepseek-coder-v2-236bnvidia-h100-sxm81vllmfp8182025-03-15
deepseek/deepseek-r1nvidia-h100-sxm81vllmfp8182025-02-20
deepseek/deepseek-r1nvidia-h200-sxm81vllmfp8182025-03-15
deepseek/deepseek-v3nvidia-a100-80gb-sxm81vllmfp8182025-03-15
deepseek/deepseek-v3nvidia-h100-sxm81vllmfp8182025-02-15
deepseek-ai/deepseek-v3nvidia-h20081vllmbf16182025-02-10
intfloat/e5-mistral-7bnvidia-h100-sxm11vllmbf16182025-03-15
falcon/falcon-180bnvidia-h100-sxm41vllmbf16182025-03-15
falcon/falcon-40bnvidia-h100-sxm11vllmbf16182025-03-15
falcon/falcon-7bnvidia-h100-sxm11vllmbf16182025-03-15
google/gemma-2-27bnvidia-a100-80gb-sxm11vllmbf16182025-03-15
google/gemma-2-27bnvidia-h100-sxm11vllmbf16182025-01-28
google/gemma-2-27bnvidia-h200-sxm11vllmbf16182025-03-15
google/gemma-2-2bnvidia-h100-sxm11vllmbf16182025-03-15
google/gemma-2-9bnvidia-a100-80gb-sxm11vllmbf16182025-03-15
google/gemma-2-9bnvidia-rtx-409011vllmbf16182025-03-15
google/gemma-3-12bnvidia-h100-sxm11vllmbf16182025-03-15
google/gemma-3-1bnvidia-h100-sxm11vllmbf16182025-03-15
google/gemma-3-4bnvidia-h100-sxm11vllmbf16182025-03-15
falcon/falcon-7bnvidia-a10g11pytorchbf16482024-12-13
falcon/falcon-7bnvidia-t411pytorchfp16482024-12-13
google/gemma-1.1-2bnvidia-a10g11pytorchbf16482024-12-13
google/gemma-1.1-2bnvidia-t411pytorchfp16482024-12-13
google/gemma-2-2bnvidia-a10g11pytorchbf16482024-12-13
google/gemma-2-2bnvidia-t411pytorchfp16482024-12-13
google/gemma-2-9bnvidia-a10g11pytorchfp16482024-12-13
sensetime/internlm-20bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
meta-llama/llama-2-7bnvidia-a10g11pytorchfp16482024-12-13
meta-llama/llama-2-7bnvidia-t411pytorchfp16482024-12-13
meta-llama/llama-3-8bnvidia-a10g11pytorchfp16482024-12-13
meta-llama/llama-3.1-8bnvidia-a10g11pytorchfp16482024-12-13
mistral/mistral-7bnvidia-a10g11pytorchfp16482024-12-13
mistral/mistral-7bnvidia-a100-80gb-sxm11pytorchfp16482024-12-13
mistral/mistral-7bnvidia-t411pytorchfp16482024-12-13
microsoft/phi-1.5nvidia-a10g11pytorchfp16482024-12-13
microsoft/phi-1.5nvidia-a100-80gb-sxm11pytorchbf16482024-12-13
microsoft/phi-1.5nvidia-t411pytorchfp16482024-12-13
microsoft/phi-3-mini-3.8bnvidia-a10g11pytorchbf16482024-12-13
microsoft/phi-3-mini-3.8bnvidia-t411pytorchbf16482024-12-13
alibaba/qwen-1.5-moe-a2.7bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
qwen/qwen-2.5-0.5bnvidia-a10g11pytorchbf16482024-12-13
qwen/qwen-2.5-0.5bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
qwen/qwen-2.5-0.5bnvidia-t411pytorchfp16482024-12-13
qwen/qwen-2.5-1.5bnvidia-a10g11pytorchfp16482024-12-13
qwen/qwen-2.5-1.5bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
qwen/qwen-2.5-1.5bnvidia-t411pytorchfp16482024-12-13
alibaba/qwen-2.5-14bnvidia-a100-80gb-sxm11pytorchfp16482024-12-13
alibaba/qwen-2.5-32bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
qwen/qwen-2.5-7bnvidia-a10g11pytorchfp16482024-12-13
qwen/qwen-2.5-7bnvidia-a100-80gb-sxm11pytorchfp16482024-12-13
qwen/qwen-2.5-7bnvidia-t411pytorchfp16482024-12-13
qwen/qwen-3-4bnvidia-a10g11pytorchfp16482024-12-13
qwen/qwen-3-4bnvidia-t411pytorchfp16482024-12-13
google/recurrentgemma-2bnvidia-a10g11pytorchbf16482024-12-13
google/recurrentgemma-2bnvidia-t411pytorchbf16482024-12-13
stabilityai/stablelm-2-12bnvidia-a10g11pytorchfp16482024-12-13
stabilityai/stablelm-zephyr-3bnvidia-a10g11pytorchbf16482024-12-13
stabilityai/stablelm-zephyr-3bnvidia-t411pytorchfp16482024-12-13
01-ai/yi-1.5-34bnvidia-a100-80gb-sxm11pytorchfp16482024-12-13
01-ai/yi-1.5-9bnvidia-a10g11pytorchbf16482024-12-13
01-ai/yi-1.5-9bnvidia-a100-80gb-sxm11pytorchbf16482024-12-13
01-ai/yi-1.5-9bnvidia-t411pytorchfp16482024-12-13
huggingface/zephyr-7bnvidia-a10g11pytorchfp16482024-12-13
huggingface/zephyr-7bnvidia-t411pytorchfp16482024-12-13
internlm/internlm2.5-7bnvidia-h100-sxm11vllmbf16182025-03-15
ai21/jamba-1.5-largenvidia-h100-sxm81vllmbf16182025-03-15
kimi-k2.5nvidia-b20081vllmbf16902025-03-01
meta-llama/llama-3.1-405bnvidia-h100-sxm81vllmfp8192025-01-20
meta-llama/llama-3.1-405bnvidia-h200-sxm81vllmfp8182025-03-15
meta-llama/llama-3.1-70bnvidia-a100-80gb-sxm21vllmbf16192025-01-22
meta-llama/llama-3.1-70bnvidia-h100-sxm81vllmbf16182025-01-15
meta-llama/llama-3.1-70bnvidia-h100-sxm41vllmbf16192025-02-10
meta-llama/llama-3.1-70bnvidia-h200-sxm41vllmbf16192025-02-18
meta-llama/llama-3.1-70bnvidia-l40s41vllmbf16182025-03-15
meta-llama/llama-3.1-8bnvidia-a100-80gb-sxm11vllmbf16182025-01-22
meta-llama/llama-3.1-8bnvidia-a100-80gb-sxm11vllmfp8182025-03-15
meta-llama/llama-3.1-8bnvidia-h100-sxm11vllmbf16182025-01-20
meta-llama/llama-3.1-8bnvidia-h200-sxm11vllmbf16182025-02-18
meta-llama/llama-3.1-8bnvidia-l40s11vllmbf16152025-02-05
meta-llama/llama-3.1-8bnvidia-rtx-309011vllmbf16182025-03-15
meta-llama/llama-3.1-8bnvidia-rtx-409011vllmbf16172025-01-25
meta-llama/llama-3.2-11b-visionnvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/llama-3.2-1bnvidia-a100-80gb-sxm11vllmbf16182025-03-15
meta-llama/llama-3.2-1bnvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/llama-3.2-1bnvidia-rtx-409011vllmbf16182025-03-15
meta-llama/llama-3.2-3bnvidia-h100-sxm11vllmbf16182025-03-15
meta-llama/llama-3.2-90b-visionnvidia-h100-sxm41vllmbf16182025-03-15
meta-llama/llama-3.3-70bnvidia-a100-80gb-sxm21vllmbf16182025-03-15
meta-llama/llama-3.3-70bnvidia-h100-sxm41vllmbf16192025-02-12
meta-llama/llama-3.3-70bnvidia-h200-sxm41vllmbf16182025-03-15
meta-llama/llama-4-maverick-400bnvidia-h100-sxm81vllmfp8182025-03-15
meta-llama/llama-4-scout-17bnvidia-h100-sxm11vllmbf16182025-03-15
nvidia/minitron-8bnvidia-h100-sxm11vllmbf16182025-03-15
mistral/mistral-7bnvidia-a100-80gb-sxm11vllmbf16182025-01-20
mistral/mistral-7bnvidia-a100-80gb-sxm11vllmfp8182025-03-15
mistral/mistral-7bnvidia-h100-sxm11vllmbf16182025-01-18
mistral/mistral-7bnvidia-h200-sxm11vllmbf16182025-03-15
mistral/mistral-7bnvidia-l40s11vllmbf16182025-03-15
mistral/mistral-7bnvidia-rtx-409011vllmbf16182025-03-15
mistral/mistral-largenvidia-h100-sxm21vllmbf16182025-03-15
mistral/mistral-nemo-12bnvidia-h100-sxm11vllmbf16182025-01-30
mistral/mistral-small-24bnvidia-h100-sxm11vllmbf16182025-03-15
mistral/mixtral-8x22bnvidia-h100-sxm21vllmbf16182025-03-15
mistral/mixtral-8x7bnvidia-a100-80gb-sxm21vllmbf16182025-03-15
mistral/mixtral-8x7bnvidia-h100-sxm21vllmbf16192025-01-25
nvidia/nemotron-15bnvidia-h100-sxm11vllmbf16182025-03-15
nvidia/nemotron-340bnvidia-h100-sxm81vllmfp8182025-03-15
nvidia/nemotron-70bnvidia-h100-sxm21vllmbf16182025-03-15
microsoft/phi-2nvidia-h100-sxm11vllmbf16182025-03-15
microsoft/phi-2nvidia-rtx-409011vllmbf16182025-03-15
microsoft/phi-3-mini-3.8bnvidia-h100-sxm11vllmbf16182025-03-15
microsoft/phi-4nvidia-a100-80gb-sxm11vllmbf16182025-03-15
microsoft/phi-4nvidia-h100-sxm11vllmbf16182025-02-01
microsoft/phi-4nvidia-h200-sxm11vllmbf16182025-03-15
microsoft/phi-4nvidia-l40s11vllmbf16182025-03-15
microsoft/phi-4nvidia-rtx-409011vllmbf16182025-03-15
alibaba/qwen-2.5-14bnvidia-h100-sxm11vllmbf16182025-03-15
alibaba/qwen-2.5-32bnvidia-a100-80gb-sxm21vllmbf16182025-03-15
alibaba/qwen-2.5-32bnvidia-h100-sxm11vllmbf16182025-03-15
alibaba/qwen-2.5-3bnvidia-a100-80gb-sxm11vllmbf16182025-03-15
alibaba/qwen-2.5-3bnvidia-h100-sxm11vllmbf16182025-03-15
alibaba/qwen-2.5-3bnvidia-rtx-409011vllmbf16182025-03-15
qwen/qwen-2.5-72bnvidia-a100-80gb-sxm41vllmbf16182025-03-15
qwen/qwen-2.5-72bnvidia-h100-sxm41vllmbf16192025-02-05
qwen/qwen-2.5-72bnvidia-h200-sxm41vllmbf16182025-03-15
qwen/qwen-2.5-7bnvidia-a100-80gb-sxm11vllmbf16182025-03-15
qwen/qwen-2.5-7bnvidia-h100-sxm11vllmbf16182025-02-08
qwen/qwen-2.5-7bnvidia-h200-sxm11vllmbf16182025-03-15
qwen/qwen-2.5-7bnvidia-rtx-409011vllmbf16182025-03-15
qwen/qwen-2.5-vl-72bnvidia-h100-sxm41vllmbf16182025-03-15
qwen/qwen-3-235bnvidia-h100-sxm81vllmfp8182025-03-15
qwen/qwen-3-4bnvidia-h100-sxm11vllmbf16182025-03-15
qwen/qwen-3-8bnvidia-h100-sxm11vllmbf16182025-03-15
bigcode/starcoder2-15bnvidia-h100-sxm11vllmbf16182025-03-15
bigcode/starcoder2-7bnvidia-h100-sxm11vllmbf16182025-03-15
tinyllama/tinyllama-1.1bnvidia-h100-sxm11vllmbf16182025-03-15
tinyllama/tinyllama-1.1bnvidia-rtx-409011vllmbf16182025-03-15
nvidia/vila-1.5-40bnvidia-h100-sxm21vllmbf16182025-03-15
01-ai/yi-1.5-34bnvidia-h100-sxm11vllmbf16182025-03-15
01-ai/yi-1.5-9bnvidia-h100-sxm11vllmbf16182025-03-15