Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFsafetensorsExLlamaV2
ScoreA-83A-83D47
Type
Executionaotaotaot
Interfaceembeddedembeddedsdk
Cold Start<1ms<1ms1000ms
Memory0MB0MB300MB
Startup<1ms<1ms200ms
Isolationprocessprocessprocess
Maturityproductionproductionstable
LanguagesAnyAnyPython, C++, CUDA
LicenseMITApache-2.0MIT
Links