Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeExLlamaV2vLLM
ScoreB-65D47F35
Type
Executionaotaotjit
Interfacesdksdkapi
Cold Start100ms1000ms5000ms
Memory500MB300MB2000MB
Startup50ms200ms3000ms
Isolationhardwareprocessprocess
Maturityproductionstableproduction
LanguagesC, C++, PythonPython, C++, CUDAPython
LicenseProprietaryMITApache-2.0
Links