False failure detection triggered by the "extra first heartbeat"
When joining several (10-20) nodes at approximately the same time (but not exactly the same time) there can be a false failure detection, which will not heal.
Leave a comment
on 2013-10-22 12:16 *
By Patrik Nordwall
Assigned to set to Patrik Nordwall
Status changed from New to Accepted
on 2013-10-29 15:30 *
By Patrik Nordwall
Assigned to changed from Patrik Nordwall to -none-
Status changed from Fixed to New
I re-open this because I have seen it once again. Probably a race. I suggest that we solve it by changing to ping-pong heartbeating.