Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeNode.jsLLM (Python CLI)Text Generation Inference
ScoreB-65B-65D49F39
TypeLanguage
Executionaotjithybridhybrid
Interfacesdkclicliapi
Cold Start100ms50ms500ms10000ms
Memory500MB40MB100MB2000MB
Startup50ms20ms100ms5000ms
Isolationhardwareprocessprocesscontainer
Maturityproductionproductionstableproduction
LanguagesC, C++, PythonJavaScript, TypeScriptPythonRust, Python
LicenseProprietaryMITApache-2.0Apache-2.0
Links