Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeNode.jsText Generation InferenceKoboldCpp
ScoreB-65B-65F39F38
TypeLanguage
Executionaotjithybridhybrid
Interfacesdkcliapigui
Cold Start100ms50ms10000ms1500ms
Memory500MB40MB2000MB400MB
Startup50ms20ms5000ms300ms
Isolationhardwareprocesscontainerprocess
Maturityproductionproductionproductionstable
LanguagesC, C++, PythonJavaScript, TypeScriptRust, PythonC++, Python
LicenseProprietaryMITApache-2.0AGPL-3.0
Links