Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mount NCCL_TOPO_FILE in NCCL test #337

Merged
merged 1 commit into from
Jan 14, 2025
Merged

Conversation

TaekyungHeo
Copy link
Member

@TaekyungHeo TaekyungHeo commented Jan 14, 2025

Summary

This is a PR to mount NCCL_TOPO_FILE to the container when NCCL_TOPO_FILE is set. Previously, a pair of environment variables was used to set NCCL_TOPO_FILE. However, this approach was unnecessary and confusing. With this PR, NCCL_TOPO_FILE is mounted to the container whenever it is available.

Test Plan

Ran it on a server.

@TaekyungHeo TaekyungHeo added the bug Something isn't working label Jan 14, 2025
@TaekyungHeo TaekyungHeo merged commit c6ffbf8 into NVIDIA:main Jan 14, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants