Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

Metricllama.cppExLlamaV2Text Generation Inference
ScoreC+63D47F39
Type
Executionaotaothybrid
Interfaceclisdkapi
Cold Start100ms1000ms10000ms
Memory50MB300MB2000MB
Startup10ms200ms5000ms
Isolationprocessprocesscontainer
Maturityproductionstableproduction
LanguagesC, C++Python, C++, CUDARust, Python
LicenseMITMITApache-2.0
Links