Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricsafetensorsCUDA RuntimeText Generation Inference
ScoreA-83B-65F39
Type
Executionaotaothybrid
Interfaceembeddedsdkapi
Cold Start<1ms100ms10000ms
Memory0MB500MB2000MB
Startup<1ms50ms5000ms
Isolationprocesshardwarecontainer
Maturityproductionproductionproduction
LanguagesAnyC, C++, PythonRust, Python
LicenseApache-2.0ProprietaryApache-2.0
Links