Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricllamafileExLlamaV2Text Generation Inference
ScoreC-53D47F39
Type
Executionaotaothybrid
Interfaceclisdkapi
Cold Start500ms1000ms10000ms
Memory100MB300MB2000MB
Startup50ms200ms5000ms
Isolationprocessprocesscontainer
Maturitystablestableproduction
LanguagesC, C++Python, C++, CUDARust, Python
LicenseApache-2.0MITApache-2.0
Links