Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA RuntimeONNX RuntimeExLlamaV2
ScoreB71B-65C-50D47
TypeLanguage
Executioninterpretedaothybridaot
Interfaceclisdksdksdk
Cold Start50ms100ms500ms1000ms
Memory15MB500MB300MB300MB
Startup10ms50ms100ms200ms
Isolationprocesshardwareprocessprocess
Maturityproductionproductionproductionstable
LanguagesPythonC, C++, PythonPython, C++, C#, JavaPython, C++, CUDA
LicenseOtherProprietaryMITMIT
Links