GPU compatibility
Ultra high-end consumer workstation for 30B+ models with extra headroom. Examples: RTX 5090.
This preset has enough clean Q4 headroom for most curated local models in this dataset.
22 models fit inside the clean planning capacity.
2 models can run with RAM/offload tradeoffs.
Examples to avoid locally: None in this dataset.
Small multimodal local assistant and low-resource setups
Small local coding assistant and agent tool generation
Fast local chat and simple agent tasks
Fast local chat, lightweight agents, low-cost local testing