Local AI Compatibility

Can my GPU run this LLM?

Choose a purpose, check your GPU, and get a practical local or cloud route.

Choose a setup

Pick what you want to run. The traffic light checks the selected GPU and RAM.

Loading purpose check The application dataset is loading. Route will appear here. Model suggestions will appear here.
i

Start with hardware and purpose. Fine-tune model settings only when needed.

i
i
i
i
Choose a purpose The app fit check loads with the application dataset.
Local fit--
VRAM target--
Route--
Best local -- --
Stretch -- --
Fallback -- --
Advanced model settings
i
i
Q4 / 4-bit

Default local inference balance.

i
Context planning

Long context increases KV-cache pressure.

Loading

Loading model data

The calculator will run once the local dataset is available.

i
Ollama
ollama run ...
Technical details
Need -- GB
VRAM fit -- GB
Speed --

Compatibility is an estimate for planning. Real memory and speed depend on backend, context length, KV cache, drivers, quantization file, and offloading settings.