Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeONNX RuntimeLLM (Python CLI)KoboldCpp
ScoreB-65C-50D49F38
Type
Executionaothybridhybridhybrid
Interfacesdksdkcligui
Cold Start100ms500ms500ms1500ms
Memory500MB300MB100MB400MB
Startup50ms100ms100ms300ms
Isolationhardwareprocessprocessprocess
Maturityproductionproductionstablestable
LanguagesC, C++, PythonPython, C++, C#, JavaPythonC++, Python
LicenseProprietaryMITApache-2.0AGPL-3.0
Links