-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Re-promoting a peer with an active connection fails. #2572
Comments
Let's try to reproduce it in our e2e simulation test. |
I have encountered an instance of this issue, today when looking at the logs of the deployed nodes running PeerSharing. I noticed that after a miniprotocol errors I see the node getting shutdown by a DemoteAsynchronous trace, but I do not see a clean exit message anywhere, before the node attempting to be promoted again which in turn leads to a ConnectionExists error |
Regarding this I analysed the logs and extracted the following trace sequence since node start to the first connection exists error:
|
I believe this is fixed with #4734 and #4555. Hot demotions failing are a bit more frequent than what we expected and the node wasn't handling these cases very well, in particular it wasn't waiting for the connection to effectively terminate before removing the peers from the appropriate sets. I believe |
A connection that was initiated locally and went from
Cold -> Warm -> Hot -> Cold
(other side still use us so it isWarm
) gets stuck in an un-promotable state.Subsequent attempts to promote the peer happens every 10s and will fail with the error above.
The text was updated successfully, but these errors were encountered: