Skip to content

Cost to run

Gemini 2.0 Flash

proprietary/gemini-2.0-flash

Family
Gemini
Context
1,048,576 tokens

Gemini 2.0 Flash can be run self-hosted (rent a GPU + run vLLM/TGI) or through a serverless API (pay per token). Live pricing comparisons:

Need to benchmark this model against another? Try the calculator or see where it ranks on the InferenceScore leaderboard.