Run LLMs on your own hardware. Find the right launcher, engine, and configuration for your setup.
NVIDIA-first. Mac-strong. Pick your GPU, get your stack.
| Name | Role | Backends | Formats | Score | Install |
|---|---|---|---|---|---|
| GGUF GPT-Generated Unified Format for efficient LLM storage | Format | cuda, metal, rocm... | - | — | |
| safetensors Safe and fast tensor serialization format by Hugging Face | Format | cuda, metal, rocm... | - | — |