Skip to content

Fix spmd sharding visualization when device index is >= 10 #9475

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

jeffhataws
Copy link
Collaborator

Previously you would get an error trying to visualize the sharding spec when device index >= 10. This fixes the problem, and also add a print of the spec itself for additional level of debug.

@jeffhataws jeffhataws requested a review from ManfeiBai July 11, 2025 21:09
@jeffhataws jeffhataws force-pushed the jeffhataws_dtensor2 branch from fa3a464 to 7b2c4b7 Compare July 12, 2025 05:14
@jeffhataws jeffhataws requested review from rpsilva-aws and bfolie July 15, 2025 17:16
@jeffhataws
Copy link
Collaborator Author

jeffhataws commented Jul 15, 2025

@bfolie do you know why torchprime tests are failing?

@jeffhataws
Copy link
Collaborator Author

torchprime testing is being fixed #9481

@bfolie
Copy link
Collaborator

bfolie commented Jul 15, 2025

torchprime testing is being fixed #9481

That might not fix it -- the torchprime e2e test has several issues. I wouldn't worry about it for a change like this.

You've also got the TPU tests not running, which is being worked on by @bhavya01

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants