Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA Runtimellama.cppText Generation Inference
ScoreB71B-65C+63F39
TypeLanguage
Executioninterpretedaotaothybrid
Interfaceclisdkcliapi
Cold Start50ms100ms100ms10000ms
Memory15MB500MB50MB2000MB
Startup10ms50ms10ms5000ms
Isolationprocesshardwareprocesscontainer
Maturityproductionproductionproductionproduction
LanguagesPythonC, C++, PythonC, C++Rust, Python
LicenseOtherProprietaryMITApache-2.0
Links