Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)llama.cppExLlamaV2Text Generation Inference
ScoreB71C+63D47F39
TypeLanguage
Executioninterpretedaotaothybrid
Interfacecliclisdkapi
Cold Start50ms100ms1000ms10000ms
Memory15MB50MB300MB2000MB
Startup10ms10ms200ms5000ms
Isolationprocessprocessprocesscontainer
Maturityproductionproductionstableproduction
LanguagesPythonC, C++Python, C++, CUDARust, Python
LicenseOtherMITMITApache-2.0
Links