-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AddPeer API #5123
base: main
Are you sure you want to change the base?
AddPeer API #5123
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What specific problem are we trying to solve? Peers automatically get added in. Is this specific to after a peer remove step?
Yes, after peer removal, if we want to join the cluster again, one method is to change the server name, but in our case, we want to add it after some minutes or hours. The solution from the code only works for peers already removed, but that was kept in the hashmap |
You could simply shutdown the server and do the maintenance needed and restart? If you need to move stream and consumer peers off that machine during the downtime you can do that separately. I will double check when the system will re-add a peer that was removed.. |
We can't do that because we want to adjust the Quorum Number. For example, we have 9 nodes, so we need 5 to reach the meta leader quorum. During our maintenance, we scale up 3 new nodes and scale down 3. Also, the need to repeat that process multiple times, and we also want to remove the added nodes when the first ones removed are recovered. |
With your 9 node cluster, you can have 4 failures in terms of the whole cluster being available (meta). What purpose is being served by scaling up to 12? |
Why do you want to adjust the quorum number? The process of swapping machines in and out works really well in a rolling fashion if you bring nodes back with set Rather than carry API bloat I'd rather want to see a better process used here for maintenance - and discover what we can help you to achieve a better process that keeps the RAFT layer stable over time. |
WIP AddPeer API.
Need to:
Signed-off-by: Ramon Berrutti [email protected]