Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)llama.cppONNX RuntimeExLlamaV2
ScoreB71C+63C-50D47
TypeLanguage
Executioninterpretedaothybridaot
Interfacecliclisdksdk
Cold Start50ms100ms500ms1000ms
Memory15MB50MB300MB300MB
Startup10ms10ms100ms200ms
Isolationprocessprocessprocessprocess
Maturityproductionproductionproductionstable
LanguagesPythonC, C++Python, C++, C#, JavaPython, C++, CUDA
LicenseOtherMITMITMIT
Links