Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA Runtimellama.cppExLlamaV2
ScoreB71B-65C+63D47
TypeLanguage
Executioninterpretedaotaotaot
Interfaceclisdkclisdk
Cold Start50ms100ms100ms1000ms
Memory15MB500MB50MB300MB
Startup10ms50ms10ms200ms
Isolationprocesshardwareprocessprocess
Maturityproductionproductionproductionstable
LanguagesPythonC, C++, PythonC, C++Python, C++, CUDA
LicenseOtherProprietaryMITMIT
Links