Skip to content

Conversation

@guptaaka
Copy link
Collaborator

@guptaaka guptaaka commented Dec 8, 2025

Add elaborate instructions to validate that the service components are running.

Add elaborate instructions to validate that the service components are running.
@guptaaka guptaaka requested a review from shauryagup December 9, 2025 02:10
(Detailed instructions are <a href="https://docs.cloud.google.com/ai-hypercomputer/docs/workloads/pathways-on-cloud/troubleshooting-pathways#health_monitoring" target="_blank">here</a>)

```
# Set the environment variables
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe find these programmatically since you already have the jobset name?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can get the pod names using kubectl get pods --selector=jobset.sigs.k8s.io/jobset-name=<your-jobset-name> -o name. But, not sure how to filter the pod for a specific container type.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added the commands.

@lukebaumann
Copy link
Collaborator

Make sure to merge your commits before merging the PR.

@lukebaumann lukebaumann self-requested a review December 17, 2025 20:34
@guptaaka guptaaka merged commit d4bd5c1 into main Dec 17, 2025
29 checks passed
@guptaaka guptaaka deleted the sps branch December 17, 2025 22:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants