Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

Metriccontainerdllama.cppText Generation Inference
ScoreB-66C+63F39
TypeContainer
Executionhybridaothybrid
Interfaceplatformcliapi
Cold Start100ms100ms10000ms
Memory20MB50MB2000MB
Startup20ms10ms5000ms
Isolationcontainerprocesscontainer
Maturityproductionproductionproduction
LanguagesAnyC, C++Rust, Python
LicenseApache-2.0MITApache-2.0
Links