Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFPython (CPython)CUDA RuntimeText Generation Inference
ScoreA-83B71B-65F39
TypeLanguage
Executionaotinterpretedaothybrid
Interfaceembeddedclisdkapi
Cold Start<1ms50ms100ms10000ms
Memory0MB15MB500MB2000MB
Startup<1ms10ms50ms5000ms
Isolationprocessprocesshardwarecontainer
Maturityproductionproductionproductionproduction
LanguagesAnyPythonC, C++, PythonRust, Python
LicenseMITOtherProprietaryApache-2.0
Links