Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA Runtimellama.cppExLlamaV2
ScoreB-65C+63D47
Type
Executionaotaotaot
Interfacesdkclisdk
Cold Start100ms100ms1000ms
Memory500MB50MB300MB
Startup50ms10ms200ms
Isolationhardwareprocessprocess
Maturityproductionproductionstable
LanguagesC, C++, PythonC, C++Python, C++, CUDA
LicenseProprietaryMITMIT
Links