Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFPython (CPython)CUDA RuntimeVulkan
ScoreA-83B71B-65C+64
TypeLanguage
Executionaotinterpretedaotaot
Interfaceembeddedclisdksdk
Cold Start<1ms50ms100ms100ms
Memory0MB15MB500MB200MB
Startup<1ms10ms50ms30ms
Isolationprocessprocesshardwarehardware
Maturityproductionproductionproductionproduction
LanguagesAnyPythonC, C++, PythonC, C++
LicenseMITOtherProprietaryApache-2.0
Links