Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricCUDA RuntimeMLXText Generation Inference
ScoreB-65D47F39
Type
Executionaotjithybrid
Interfacesdksdkapi
Cold Start100ms500ms10000ms
Memory500MB200MB2000MB
Startup50ms100ms5000ms
Isolationhardwareprocesscontainer
Maturityproductionstableproduction
LanguagesC, C++, PythonPython, C++, SwiftRust, Python
LicenseProprietaryMITApache-2.0
Links