Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeNode.jsllama.cppText Generation Inference
ScoreB-65B-65C+63F39
TypeLanguage
Executionaotjitaothybrid
Interfacesdkclicliapi
Cold Start100ms50ms100ms10000ms
Memory500MB40MB50MB2000MB
Startup50ms20ms10ms5000ms
Isolationhardwareprocessprocesscontainer
Maturityproductionproductionproductionproduction
LanguagesC, C++, PythonJavaScript, TypeScriptC, C++Rust, Python
LicenseProprietaryMITMITApache-2.0
Links