Ollama by VRAM
Pick a command from the clean-fit tier that matches your GPU, then open the calculator when context length, quantization, or purpose matters.
Small local models and quick assistants.
Coding helpers, agents, and stronger local chat models.
Larger local models and heavier experiments.
Clean Q4 fit for 8 GB VRAM mainstream GPU.
Clean Q4 fit for 8 GB VRAM mainstream GPU.
Clean Q4 fit for 8 GB VRAM mainstream GPU.
Clean Q4 fit for 8 GB VRAM mainstream GPU.
Clean Q4 fit for 12 GB VRAM local agent GPU.
Clean Q4 fit for 12 GB VRAM local agent GPU.
Clean Q4 fit for 12 GB VRAM local agent GPU.
Clean Q4 fit for 12 GB VRAM local agent GPU.
Clean Q4 fit for 24 GB VRAM homelab workstation.
Clean Q4 fit for 24 GB VRAM homelab workstation.
Clean Q4 fit for 24 GB VRAM homelab workstation.
Clean Q4 fit for 24 GB VRAM homelab workstation.
Clean Q4 fit for 48 GB VRAM workstation.
Clean Q4 fit for 48 GB VRAM workstation.
Clean Q4 fit for 48 GB VRAM workstation.
Clean Q4 fit for 48 GB VRAM workstation.