Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build: Update HPC-X to v2.16 for CUDA 12 builds #25

Merged
merged 2 commits into from
Aug 25, 2023
Merged

Conversation

Eta0
Copy link
Collaborator

@Eta0 Eta0 commented Aug 25, 2023

HPC-X v2.16

This change updates all CUDA 12 build configurations to use HPC-X v2.16, and adds an extra build argument to the Dockerfile for controlling another part of the distribution name that has changed in the latest update (MLNX_OFED_LINUX-5mlnx_ofed).

@Eta0 Eta0 added the enhancement New feature or request label Aug 25, 2023
@Eta0 Eta0 requested a review from salanki August 25, 2023 16:52
@Eta0 Eta0 self-assigned this Aug 25, 2023
@github-actions
Copy link

@Eta0 Build complete, success: https://github.com/coreweave/nccl-tests/actions/runs/5978363390
Image: ghcr.io/coreweave/nccl-tests:11.8.0-cudnn8-devel-ubuntu20.04-nccl2.16.2-1-253a5b1

@github-actions
Copy link

@Eta0 Build complete, success: https://github.com/coreweave/nccl-tests/actions/runs/5978363390
Image: ghcr.io/coreweave/nccl-tests:11.7.1-cudnn8-devel-ubuntu20.04-nccl2.14.3-1-253a5b1

@github-actions
Copy link

@Eta0 Build complete, success: https://github.com/coreweave/nccl-tests/actions/runs/5978363390
Image: ghcr.io/coreweave/nccl-tests:12.1.1-cudnn8-devel-ubuntu20.04-nccl2.18.3-1-253a5b1

@github-actions
Copy link

@Eta0 Build complete, success: https://github.com/coreweave/nccl-tests/actions/runs/5978363390
Image: ghcr.io/coreweave/nccl-tests:12.0.1-cudnn8-devel-ubuntu20.04-nccl2.18.3-1-253a5b1

@github-actions
Copy link

@Eta0 Build complete, success: https://github.com/coreweave/nccl-tests/actions/runs/5978363390
Image: ghcr.io/coreweave/nccl-tests:12.2.0-devel-ubuntu20.04-nccl2.18.3-1-253a5b1

@salanki
Copy link
Contributor

salanki commented Aug 25, 2023

Please merge and update README. Can I also get an updated build of ghcr.io/coreweave/ml-containers/torch:eca8c09-nccl-cuda12.0.1-nccl2.18.3-1-torch2.0.1-vision0.15.2-audio2.0.2 as a follow on based on this new base image?

@Eta0
Copy link
Collaborator Author

Eta0 commented Aug 25, 2023

Please merge and update README. Can I also get an updated build of ghcr.io/coreweave/ml-containers/torch:eca8c09-nccl-cuda12.0.1-nccl2.18.3-1-torch2.0.1-vision0.15.2-audio2.0.2 as a follow on based on this new base image?

@salanki Done, and updated in ml-containers in coreweave/ml-containers#36.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants