Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bug] Can't import smdistributed from the pytorch-training:2.4.0-gpu-py311 container #4914

Open
gui-miotto opened this issue Oct 30, 2024 · 1 comment
Labels

Comments

@gui-miotto
Copy link

Summary

Can't import smdistributed in the new pytorch image

To reproduce

Running

import smdistributed.dataparallel.torch.torch_smddp

will result in an error:

ModuleNotFoundError: No module named 'smdistributed'

This happens when using the pytorch 2.4 image, more precisely, the 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:2.4.0-gpu-py311-cu124-ubuntu22.04-sagemaker

smdistributed could be imported in the older version (pytorch 2.3 and 2.2)

@gui-miotto gui-miotto added the bug label Oct 30, 2024
@gui-miotto
Copy link
Author

Related to this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant