Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricExLlamaV2CTransformersText Generation Inference
ScoreD47D46F39
Type
Executionaothybridhybrid
Interfacesdksdkapi
Cold Start1000ms800ms10000ms
Memory300MB200MB2000MB
Startup200ms100ms5000ms
Isolationprocessprocesscontainer
Maturitystablestableproduction
LanguagesPython, C++, CUDAPython, C++Rust, Python
LicenseMITMITApache-2.0
Links