Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFCUDA RuntimeCTransformers
ScoreA-83B-65D46
Type
Executionaotaothybrid
Interfaceembeddedsdksdk
Cold Start<1ms100ms800ms
Memory0MB500MB200MB
Startup<1ms50ms100ms
Isolationprocesshardwareprocess
Maturityproductionproductionstable
LanguagesAnyC, C++, PythonPython, C++
LicenseMITProprietaryMIT
Links