Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricMetalExLlamaV2Text Generation Inference
ScoreB74D47F39
Type
Executionaotaothybrid
Interfacesdksdkapi
Cold Start50ms1000ms10000ms
Memory100MB300MB2000MB
Startup20ms200ms5000ms
Isolationhardwareprocesscontainer
Maturityproductionstableproduction
LanguagesSwift, Objective-C, C++Python, C++, CUDARust, Python
LicenseProprietaryMITApache-2.0
Links