Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hang detection solely based on lack of output to stdout/stderr not good enough #132

Open
boegel opened this issue Jul 6, 2017 · 2 comments

Comments

@boegel
Copy link
Member

boegel commented Jul 6, 2017

The hang detection that was implemented in #88 only looks at the (lack of) output to stdout/stderr to detect whether the MPI program is hanging or not.

This may lead to too many false positives, since for CP2K for example the output can be redirected via the -o option of CP2K, meaning there is no output generated to stdout/stderr anymore?

Can we do better?

Workaround in case this occurs: use mympirun --disable-output-check-fatal

@boegel
Copy link
Member Author

boegel commented Jul 7, 2017

@stdweird Thoughts on this, can we do better, e.g. detect somehow (reliably...) whether or not the program is creating output files?

@boegel boegel changed the title hang detection solely based on lack of output to stdout/stderr not good enough? hang detection solely based on lack of output to stdout/stderr not good enough Jul 7, 2017
@stdweird
Copy link
Member

stdweird commented Jul 8, 2017

yes, don't enable the check by default. i was never a big fan of this default, even with your anecdotal evidence.
replace the current default with a mode that only logs to syslog, we will evaluate in 3 to 6 months how many times this actually reported an issue. worst case, we can couple it with the monitoring to notify us and/or the user.
what you are trying to solve can only be handled at the MPI lib level.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants