Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimellamafileExLlamaV2
ScoreB-65C-53D47
Type
Executionaotaotaot
Interfacesdkclisdk
Cold Start100ms500ms1000ms
Memory500MB100MB300MB
Startup50ms50ms200ms
Isolationhardwareprocessprocess
Maturityproductionstablestable
LanguagesC, C++, PythonC, C++Python, C++, CUDA
LicenseProprietaryApache-2.0MIT
Links