Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricGGUFCUDA Runtimellama.cpp
ScoreA-83B-65C+63
Type
Executionaotaotaot
Interfaceembeddedsdkcli
Cold Start<1ms100ms100ms
Memory0MB500MB50MB
Startup<1ms50ms10ms
Isolationprocesshardwareprocess
Maturityproductionproductionproduction
LanguagesAnyC, C++, PythonC, C++
LicenseMITProprietaryMIT
Links