Uploaded image for project: 'Flume (READ-ONLY)'
  1. Flume (READ-ONLY)
  2. FLUME-708

Flume nodes drop heartbeats when reconfiguring

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Not A Bug
    • Affects Version/s: v0.9.5
    • Fix Version/s: None
    • Component/s: Node, Technical Debt
    • Labels:
      None

      Description

      When flume receives a reconfiguration command from the master it performs the changes in the heartbeat thread. This causes the node to drop heartbeats for as long as it takes to either complete the task or for flume to forcibly kill the existing driver thread. This isn't a show stopper because there is a timeout associated with shutting down the existing driver, but it's easy to see this as a place where errors can occur.

      I believe this is indicative of a larger issue in the way the node handles heartbeats and (re)configuration and we should revisit this communication as part of the master re-arch which implicitly involves the heartbeat and communication systems.

        Attachments

          Activity

            People

            • Assignee:
              esammer Eric Sammer
              Reporter:
              esammer Eric Sammer
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: