Uploaded image for project: 'Flume (READ-ONLY)'
  1. Flume (READ-ONLY)
  2. FLUME-396

A Master peer that goes down can not always rejoin the ensemble.

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: v0.9.5
    • Component/s: Master
    • Labels:
      None

      Description

      • Scenario 1:

      Have a 3 master setup.
      Have some nodes heart beating across the nodes.
      Kill one of the masters.

      Nodes that once heartbeated with dead master move to other masters.

      Attempt to bring dead master back up.

      • Revived master eventually get into exception loop that never stops.
      • Scenario 2:

      Have a 3 master setup.
      Have some nodes heart beating across the nodes.
      Kill one of the masters.

      Nodes that once heartbeated with dead master move to other masters.

      Kill a second master.

      All nodes now heartbeat with single remaining master.

      Bring up a dead master.
      It rejoins.

      Bring up other dead master.
      It rejoins.

      (All heart beats remain with the master that initally survived)

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              jon Jonathan Hsieh
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated: