Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricExLlamaV2Text Generation InferencevLLM
ScoreD47F39F35
Type
Executionaothybridjit
Interfacesdkapiapi
Cold Start1000ms10000ms5000ms
Memory300MB2000MB2000MB
Startup200ms5000ms3000ms
Isolationprocesscontainerprocess
Maturitystableproductionproduction
LanguagesPython, C++, CUDARust, PythonPython
LicenseMITApache-2.0Apache-2.0
Links