Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFPython (CPython)llama.cppExLlamaV2
ScoreA-83B71C+63D47
TypeLanguage
Executionaotinterpretedaotaot
Interfaceembeddedcliclisdk
Cold Start<1ms50ms100ms1000ms
Memory0MB15MB50MB300MB
Startup<1ms10ms10ms200ms
Isolationprocessprocessprocessprocess
Maturityproductionproductionproductionstable
LanguagesAnyPythonC, C++Python, C++, CUDA
LicenseMITOtherMITMIT
Links