Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricExLlamaV2Text Generation InferenceKoboldCpp
ScoreD47F39F38
Type
Executionaothybridhybrid
Interfacesdkapigui
Cold Start1000ms10000ms1500ms
Memory300MB2000MB400MB
Startup200ms5000ms300ms
Isolationprocesscontainerprocess
Maturitystableproductionstable
LanguagesPython, C++, CUDARust, PythonC++, Python
LicenseMITApache-2.0AGPL-3.0
Links