Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA RuntimeVulkanExLlamaV2
ScoreB71B-65C+64D47
TypeLanguage
Executioninterpretedaotaotaot
Interfaceclisdksdksdk
Cold Start50ms100ms100ms1000ms
Memory15MB500MB200MB300MB
Startup10ms50ms30ms200ms
Isolationprocesshardwarehardwareprocess
Maturityproductionproductionproductionstable
LanguagesPythonC, C++, PythonC, C++Python, C++, CUDA
LicenseOtherProprietaryApache-2.0MIT
Links