Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricExLlamaV2Open WebUIText Generation Inference
ScoreD47F41F39
Type
Executionaothybridhybrid
Interfacesdkguiapi
Cold Start1000ms3000ms10000ms
Memory300MB500MB2000MB
Startup200ms1000ms5000ms
Isolationprocesscontainercontainer
Maturitystablestableproduction
LanguagesPython, C++, CUDAPython, TypeScriptRust, Python
LicenseMITMITApache-2.0
Links