Uploaded image for project: 'CDH (READ-ONLY)'
  1. CDH (READ-ONLY)
  2. DISTRO-562

BlockPoolSliceScanner drop into infinite loop

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: CDH4.5.0
    • Fix Version/s: CDH 5.4.0
    • Component/s: HDFS
    • Environment:
      jdk1.6, centos6.4, CDH4.5.0

      Description

      Hello.

      When hadoop cluster starts, BlockPoolSliceScanner start scanning the blocks in my cluster.
      Then, randomly one datanode drop into infinite loop as the log show, and finally all datanodes drop into infinite loop.
      Every datanode just verify fail by one block.
      When i check the fail block like this : hadoop fsck / -files -blocks | grep blk_1223474551535936089_4702249, no hdfs file contains the block.

      It seems that in while block of BlockPoolSliceScanner's scan method drop into infinite loop .
      BlockPoolSliceScanner: 650

      while (datanode.shouldRun
      && !datanode.blockScanner.blockScannerThread.isInterrupted()
      && datanode.isBPServiceAlive(blockPoolId)) { ....

      The log finally printed in method verifyBlock(BlockPoolSliceScanner:453).

      Please excuse my poor English.

      -------------------------------------------------------------------------------------------------------------------------------------------------
      LOG:
      2014-01-21 18:36:50,582 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: Verification failed for BP-1040548460-58.229.158.13-1385606058039:blk_6833233229840997944_4702634 - may be due to race with write
      2014-01-21 18:36:50,582 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: Verification failed for BP-1040548460-58.229.158.13-1385606058039:blk_6833233229840997944_4702634 - may be due to race with write
      2014-01-21 18:36:50,582 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: Verification failed for BP-1040548460-58.229.158.13-1385606058039:blk_6833233229840997944_4702634 - may be due to race with write

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ikweesung ikweesung
            • Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: