Skip to content

torch-nccl

torch-nccl #115

Manually triggered October 12, 2024 21:35
Status Cancelled
Total duration 1h 35m 21s
Artifacts

torch-nccl.yml

on: workflow_dispatch
Get torch:nccl Config  /  Read Configuration File
9s
Get torch:nccl Config / Read Configuration File
Matrix: Build torch:nccl
Fit to window
Zoom out
Zoom in

Annotations

11 errors and 5 warnings
Build torch:nccl (12.6.1, cudnn, ubuntu20.04, 2.23.4-1, 2ff05b2) / Build torch-extras / Build torch-extras via Workflow Call / Build Images
buildx failed with: ERROR: failed to solve: process "/bin/sh -c export CUDA_MAJOR_VERSION=$(echo $CUDA_VERSION | cut -d. -f1) CUDA_MINOR_VERSION=$(echo $CUDA_VERSION | cut -d. -f2) && export CUDA_PACKAGE_VERSION=\"${CUDA_MAJOR_VERSION}-${CUDA_MINOR_VERSION}\" && apt-get install -y --no-install-recommends cuda-nvcc-${CUDA_PACKAGE_VERSION} cuda-nvml-dev-${CUDA_PACKAGE_VERSION} libcurand-dev-${CUDA_PACKAGE_VERSION} libcublas-dev-${CUDA_PACKAGE_VERSION} libcusparse-dev-${CUDA_PACKAGE_VERSION} libcusolver-dev-${CUDA_PACKAGE_VERSION} cuda-nvprof-${CUDA_PACKAGE_VERSION} cuda-profiler-api-${CUDA_PACKAGE_VERSION} cuda-nvtx-${CUDA_PACKAGE_VERSION} cuda-nvrtc-dev-${CUDA_PACKAGE_VERSION} libaio-dev ninja-build && apt-get clean" did not complete successfully: exit code: 100
Build torch:nccl (12.6.1, cudnn, ubuntu22.04, 2.23.4-1, 2ff05b2) / Build torch / Build Images
FailFast: cancelling since parallel instance has failed
Build torch:nccl (12.2.2, cudnn8, ubuntu20.04, 2.21.5-1, 2ff05b2) / Build torch / Build Images
FailFast: cancelling since parallel instance has failed
Build torch:nccl (12.2.2, cudnn8, ubuntu22.04, 2.23.4-1, 2ff05b2) / Build torch / Build Images
FailFast: cancelling since parallel instance has failed
Get torch:nccl Config / Read Configuration File
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/