🏠

Local LLM Hub

Run LLMs on your own hardware. Find the right launcher, engine, and configuration for your setup.

NVIDIA-first. Mac-strong. Pick your GPU, get your stack.

Quick Start: CPU Only (16GB RAM)

Beginnergguf
ollama + llama.cpp
Quant: Q4_K_M
3B-7B models. Slow but works
GUIgguf
lm-studio + llama.cpp
Quant: Q4_K_M
Runs in CPU inference mode
1 local LLM tools
NameRoleBackendsFormatsScoreInstall
LLM (Python CLI)
Access large language models from the command-line
Toolcuda, metal, cpugguf52 (C-)🍎🐧