Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeVulkanExLlamaV2
ScoreB-65C+64D47
Type
Executionaotaotaot
Interfacesdksdksdk
Cold Start100ms100ms1000ms
Memory500MB200MB300MB
Startup50ms30ms200ms
Isolationhardwarehardwareprocess
Maturityproductionproductionstable
LanguagesC, C++, PythonC, C++Python, C++, CUDA
LicenseProprietaryApache-2.0MIT
Links