Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeNode.jsExLlamaV2
ScoreB-65B-65D47
TypeLanguage
Executionaotjitaot
Interfacesdkclisdk
Cold Start100ms50ms1000ms
Memory500MB40MB300MB
Startup50ms20ms200ms
Isolationhardwareprocessprocess
Maturityproductionproductionstable
LanguagesC, C++, PythonJavaScript, TypeScriptPython, C++, CUDA
LicenseProprietaryMITMIT
Links