Select runtimes to compare side by side. Click chips below to toggle selection.
| Metric | containerd | CUDA Runtime | ExLlamaV2 |
|---|---|---|---|
| Score | B-66 | B-65 | D47 |
| Type | Container | ||
| Execution | hybrid | aot | aot |
| Interface | platform | sdk | sdk |
| Cold Start | 100ms | 100ms | 1000ms |
| Memory | 20MB | 500MB | 300MB |
| Startup | 20ms | 50ms | 200ms |
| Isolation | container | hardware | process |
| Maturity | production | production | stable |
| Languages | Any | C, C++, Python | Python, C++, CUDA |
| License | Apache-2.0 | Proprietary | MIT |
| Links |