GGUF Memory Estimator

Estimate VRAM requirements for any GGUF model from HuggingFace GitHub ↗

📂 Model Input

📊 KV Cache Quantization

Hardware (tokens/sec)


Fetching model metadata...
📦

Enter a HuggingFace model path and click Resolve to get started.

🧦 Model Info
💰 Model Weights
Total weight size -
Quantization Tensors Elements Size
📋 KV Cache
K cache (F16) -
V cache (F16) -
KV layers -
KV heads (GQA) -
Activations
Activation memory (FP32) -
📊 Memory Requirements
VRAM required -
Weights -
KV cache -
Activations -

RAM required -

Total system memory -
🔋 System Fit Check
VRAM -
-
-