feat: replace cuda13 flavour with per-GPU-generation packages
All checks were successful
deploy-ui / build-and-deploy (push) Successful in 44s

Build separate packages for each GPU generation instead of a single
cuda13 package:
- mistralrs-ampere (sm_86, RTX 3060)
- mistralrs-ada (sm_89, RTX 4090)
- mistralrs-blackwell (sm_120, RTX 5090)

All use the same CUDA 13.0 toolkit and features (cuda, cudnn, flash-attn,
nccl), varying only the compute capability target.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
2026-05-11 14:45:28 +03:00
parent 661cf574f2
commit ace6037a2d
5 changed files with 53 additions and 10 deletions

View File

@@ -30,7 +30,23 @@ jobs:
fail-fast: false
matrix:
include:
- name: cuda13
- name: ampere
fedora_version: "43"
runner: cuda-13.0
cuda_home: /usr/local/cuda-13.0
cargo_features: "cuda cudnn flash-attn nccl"
compute_caps: "86"
build_jobs: 12
nvcc_threads: 4
- name: ada
fedora_version: "43"
runner: cuda-13.0
cuda_home: /usr/local/cuda-13.0
cargo_features: "cuda cudnn flash-attn nccl"
compute_caps: "89"
build_jobs: 12
nvcc_threads: 4
- name: blackwell
fedora_version: "43"
runner: cuda-13.0
cuda_home: /usr/local/cuda-13.0
@@ -89,7 +105,11 @@ jobs:
fail-fast: false
matrix:
include:
- name: cuda13
- name: ampere
fedora_version: "43"
- name: ada
fedora_version: "43"
- name: blackwell
fedora_version: "43"
steps:
- uses: actions/checkout@v4