Skip to content

Commit

Permalink
ci(torch): Build against CUDA 12.6; remove 12.3, 12.1, and lower builds
Browse files Browse the repository at this point in the history
Additionally, this change switches to base images with updated
NCCL (up to v2.23.4-1) and HPC-X (v2.20) versions for torch:nccl.
  • Loading branch information
Eta0 committed Oct 11, 2024
1 parent 72c9ec2 commit ebbd1db
Show file tree
Hide file tree
Showing 3 changed files with 20 additions and 45 deletions.
12 changes: 1 addition & 11 deletions .github/configurations/torch-base.yml
Original file line number Diff line number Diff line change
@@ -1,15 +1,5 @@
cuda: [ 12.4.1, 12.3.2, 12.2.2, 12.0.1, 11.8.0 ]
cuda: [ 12.6.1, 12.4.1, 12.2.2 ]
os: [ ubuntu22.04, ubuntu20.04 ]
exclude:
# Not a supported combination
- cuda: 11.8.0
os: ubuntu22.04
- cuda: 11.8.0
os: ubuntu20.04
- cuda: 12.0.1
os: ubuntu20.04
- cuda: 12.0.1
os: ubuntu22.04
include:
- torch: 2.4.1
vision: 0.19.1
Expand Down
49 changes: 17 additions & 32 deletions .github/configurations/torch-nccl.yml
Original file line number Diff line number Diff line change
@@ -1,51 +1,36 @@
image:
# Ubuntu 22.04
- cuda: 12.4.1
- cuda: 12.6.1
cudnn: cudnn
os: ubuntu22.04
nccl: 2.21.5-1
nccl-tests-hash: 85f9143
- cuda: 12.3.2
cudnn: cudnn9
nccl: 2.23.4-1
nccl-tests-hash: 2ff05b2
- cuda: 12.4.1
cudnn: cudnn
os: ubuntu22.04
nccl: 2.20.3-1
nccl-tests-hash: 85f9143
nccl: 2.23.4-1
nccl-tests-hash: 2ff05b2
- cuda: 12.2.2
cudnn: cudnn8
os: ubuntu22.04
nccl: 2.19.3-1
nccl-tests-hash: 85f9143
# - cuda: 12.0.1
# cudnn: cudnn8
# os: ubuntu22.04
# nccl: 2.18.5-1
# nccl-tests-hash: 85f9143
nccl: 2.23.4-1
nccl-tests-hash: 2ff05b2
# Ubuntu 20.04
- cuda: 12.4.1
- cuda: 12.6.1
cudnn: cudnn
os: ubuntu20.04
nccl: 2.21.5-1
nccl-tests-hash: 85f9143
- cuda: 12.3.2
cudnn: cudnn9
nccl: 2.23.4-1
nccl-tests-hash: 2ff05b2
- cuda: 12.4.1
cudnn: cudnn
os: ubuntu20.04
nccl: 2.20.3-1
nccl-tests-hash: 85f9143
nccl: 2.23.4-1
nccl-tests-hash: 2ff05b2
- cuda: 12.2.2
cudnn: cudnn8
os: ubuntu20.04
nccl: 2.21.5-1
nccl-tests-hash: 85f9143
# - cuda: 12.0.1
# cudnn: cudnn8
# os: ubuntu20.04
# nccl: 2.19.3-1
# nccl-tests-hash: 85f9143
# - cuda: 11.8.0
# cudnn: cudnn8
# os: ubuntu20.04
# nccl: 2.16.5-1
# nccl-tests-hash: 868dc3d
nccl-tests-hash: 2ff05b2
include:
- torch: 2.4.1
vision: 0.19.1
Expand Down
4 changes: 2 additions & 2 deletions torch/Dockerfile
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# syntax=docker/dockerfile:1.4
ARG BUILDER_BASE_IMAGE="nvidia/cuda:12.0.1-devel-ubuntu22.04"
ARG FINAL_BASE_IMAGE="nvidia/cuda:12.0.1-base-ubuntu22.04"
ARG BUILDER_BASE_IMAGE="nvidia/cuda:12.4.1-devel-ubuntu22.04"
ARG FINAL_BASE_IMAGE="nvidia/cuda:12.4.1-base-ubuntu22.04"

ARG BUILD_TORCH_VERSION="2.4.1"
ARG BUILD_TORCH_VISION_VERSION="0.19.1"
Expand Down

0 comments on commit ebbd1db

Please sign in to comment.