Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFExLlamaV2Text Generation Inference
ScoreA-83D47F39
Type
Executionaotaothybrid
Interfaceembeddedsdkapi
Cold Start<1ms1000ms10000ms
Memory0MB300MB2000MB
Startup<1ms200ms5000ms
Isolationprocessprocesscontainer
Maturityproductionstableproduction
LanguagesAnyPython, C++, CUDARust, Python
LicenseMITMITApache-2.0
Links