Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricExLlamaV2Text Generation InferenceLocalAI
ScoreD47F39F37
Type
Executionaothybridhybrid
Interfacesdkapiapi
Cold Start1000ms10000ms3000ms
Memory300MB2000MB800MB
Startup200ms5000ms1000ms
Isolationprocesscontainercontainer
Maturitystableproductionstable
LanguagesPython, C++, CUDARust, PythonGo, Python
LicenseMITApache-2.0MIT
Links