Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeExLlamaV2CTransformers
ScoreB-65D47D46
Type
Executionaotaothybrid
Interfacesdksdksdk
Cold Start100ms1000ms800ms
Memory500MB300MB200MB
Startup50ms200ms100ms
Isolationhardwareprocessprocess
Maturityproductionstablestable
LanguagesC, C++, PythonPython, C++, CUDAPython, C++
LicenseProprietaryMITMIT
Links