Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question : When disable-timeout is enabled, the timeout behavior is different for each fence_agent. #431

Open
HideoYamauchi opened this issue Aug 31, 2021 · 4 comments
Assignees

Comments

@HideoYamauchi
Copy link
Contributor

Hi All,

If disable-timeout is enabled, you should get a Pacemaker timeout, but it actually works differently depending on the fence_agent.
This may enable or disable the Pacemaker pcmk_xxxx_retries parameter, depending on the fence_agent.
Originally, shouldn't all failures time out if disable-timeout is enabled?

Here are examples of fence_ipmitool and fence_vmware_rest, with the differences we're talking about in Bugzilla below.

I haven't checked all the fence_agents, but it seems that there are many fence_agents that make a difference.

Best Regards,
Hideo Yamauchi.

@oalbrigt
Copy link
Collaborator

oalbrigt commented Sep 1, 2021

It should. So seems like we need to update it ro use -N/-R or combination of them to work optimally in this case:
https://access.redhat.com/discussions/5474551

@oalbrigt oalbrigt self-assigned this Sep 1, 2021
@HideoYamauchi
Copy link
Contributor Author

Hi Oyvind,

Thanks!

Are other fence_agents okay?
I think it is necessary to look comprehensively.

Best Regards,
Hideo Yamauchi.

@oalbrigt
Copy link
Collaborator

oalbrigt commented Sep 3, 2021

If you run grep -e "run_command" -e "pexpect" agents/*/*.py you should get most of the agents that might have similar issues if the tool they use uses internal timeout.

@HideoYamauchi
Copy link
Contributor Author

Hi Oyvind,

If you run grep -e "run_command" -e "pexpect" agents/*/*.py you should get most of the agents that might have similar issues if the tool they use uses internal timeout.

Okay!

Let's check all fence_agents once.

However, if there are a wide variety of external commands to be executed, it will be very troublesome to deal with and confirm.
I think it is desirable to make it so that the timeout can be ignored only on the main body side of fence_agent, but I do not know if it is possible.

Best Regards,
Hideo Yamauchi.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants