Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeAWS LambdaText Generation InferenceKoboldCpp
ScoreB-65B-65F39F38
TypeServerless
Executionaothybridhybridhybrid
Interfacesdkplatformapigui
Cold Start100ms200ms10000ms1500ms
Memory500MB128MB2000MB400MB
Startup50ms100ms5000ms300ms
Isolationhardwaremicrovmcontainerprocess
Maturityproductionproductionproductionstable
LanguagesC, C++, PythonJavaScript, TypeScript, Python, Java, Go, Ruby, .NET, RustRust, PythonC++, Python
LicenseProprietaryProprietaryApache-2.0AGPL-3.0
Links