Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA Runtimellama.cppONNX Runtime
ScoreB-65C+63C-50
Type
Executionaotaothybrid
Interfacesdkclisdk
Cold Start100ms100ms500ms
Memory500MB50MB300MB
Startup50ms10ms100ms
Isolationhardwareprocessprocess
Maturityproductionproductionproduction
LanguagesC, C++, PythonC, C++Python, C++, C#, Java
LicenseProprietaryMITMIT
Links