Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA RuntimellamafileText Generation Inference
ScoreB71B-65C-53F39
TypeLanguage
Executioninterpretedaotaothybrid
Interfaceclisdkcliapi
Cold Start50ms100ms500ms10000ms
Memory15MB500MB100MB2000MB
Startup10ms50ms50ms5000ms
Isolationprocesshardwareprocesscontainer
Maturityproductionproductionstableproduction
LanguagesPythonC, C++, PythonC, C++Rust, Python
LicenseOtherProprietaryApache-2.0Apache-2.0
Links