kvanta

LLM KV-cache and model-weight VRAM calculator.

Models

0 selected

Add a Hugging Face model to begin comparing total VRAM footprint growth.

KV Precision (bits)
Weights Quantization (bits)
Graph Scope

Select at least one processed model to draw the footprint graph.