Uploaded image for project: 'Flume (READ-ONLY)'
  1. Flume (READ-ONLY)
  2. FLUME-191

Nodes cannot reconnect to master when master is not shut down gracefully

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Works for Me
    • Affects Version/s: v0.9.0
    • Fix Version/s: v0.9.1, v0.9.1u1, v0.9.2
    • Component/s: Node
    • Labels:
      None
    • Environment:
      Ubuntu 10.04

      Description

      Version: 0.9.0

      When I stop the master using /etc/init.d/flume-master stop, nodes become disconnected as expected. When I start the master back up using /etc/init.d/flume-master start, nodes reconnect as expected.

      When I stop the master using kill -9, nodes become disconnected as expected. When I start the master back up using /etc/init.d/flume-master start, nodes DO NOT reconnect as expected. When this happens, the spinning nodes do not show up in the first table of the master's web interface. They only show up in the Physical/logical Node Mapping section.

      It appears that nodes cannot reconnect to the master if it was not shut down gracefully. This problem is significant because the multiple master feature does not work. It is also significant because if you are using BESink with tail(), all data is lost until you can restart the flume-node process on every node.

      -Ben Standefer

        Attachments

          Activity

            People

            • Assignee:
              jon Jonathan Hsieh
              Reporter:
              aguynamedben Ben Standefer
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: