Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFPython (CPython)llama.cppText Generation Inference
ScoreA-83B71C+63F39
TypeLanguage
Executionaotinterpretedaothybrid
Interfaceembeddedclicliapi
Cold Start<1ms50ms100ms10000ms
Memory0MB15MB50MB2000MB
Startup<1ms10ms10ms5000ms
Isolationprocessprocessprocesscontainer
Maturityproductionproductionproductionproduction
LanguagesAnyPythonC, C++Rust, Python
LicenseMITOtherMITApache-2.0
Links