Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeDockerExLlamaV2
ScoreB-65C-54D47
TypeContainer
Executionaothybridaot
Interfacesdkclisdk
Cold Start100ms500ms1000ms
Memory500MB50MB300MB
Startup50ms200ms200ms
Isolationhardwarecontainerprocess
Maturityproductionproductionstable
LanguagesC, C++, PythonAnyPython, C++, CUDA
LicenseProprietaryApache-2.0MIT
Links