Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricLLM (Python CLI)ExLlamaV2vLLM
ScoreD49D47F35
Type
Executionhybridaotjit
Interfaceclisdkapi
Cold Start500ms1000ms5000ms
Memory100MB300MB2000MB
Startup100ms200ms3000ms
Isolationprocessprocessprocess
Maturitystablestableproduction
LanguagesPythonPython, C++, CUDAPython
LicenseApache-2.0MITApache-2.0
Links