Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

Metricllama.cppExLlamaV2vLLM
ScoreC+63D47F35
Type
Executionaotaotjit
Interfaceclisdkapi
Cold Start100ms1000ms5000ms
Memory50MB300MB2000MB
Startup10ms200ms3000ms
Isolationprocessprocessprocess
Maturityproductionstableproduction
LanguagesC, C++Python, C++, CUDAPython
LicenseMITMITApache-2.0
Links