Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

Metricllama.cppLLM (Python CLI)ExLlamaV2
ScoreC+63D49D47
Type
Executionaothybridaot
Interfacecliclisdk
Cold Start100ms500ms1000ms
Memory50MB100MB300MB
Startup10ms100ms200ms
Isolationprocessprocessprocess
Maturityproductionstablestable
LanguagesC, C++PythonPython, C++, CUDA
LicenseMITApache-2.0MIT
Links