Fill the Data
per hour:
or over 3 years:
per hour
or over 3 years
per hour
or over 3 years
Break-Even Analysis
% of Reuses/Hours to Break Even:
0.00%
Cost Savings ($/3y)
% Cache Hits
Model
Parameters
KV Cache Size
β
Tokens/sec/GPU
β
Prefill vs Cache Speed Ratio
β
Storage
Calculations
Additional Storage Cost/Hour
β
Additional Storage Cost/3y
β
Processing
Calculations
Number of Tokens cached
β
GPU Hours to Prefill
β
GPU Cost to Prefill
β
GPU Cost with Cache
β