[DISTRO-565] HDFS-5225/HDFS-5031 causing hundreds of GB of log spam - Cloudera Open Source

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: CDH4.5.0
Fix Version/s: CDH4.6.0
Component/s: HDFS
Labels:
None

Description

We are running into HDFS-5225, which is resolved as a duplicate of HDFS-5031. When the DataNode gets into this state, the logs spew out hundreds of gigs of log lines, all the same (same exact line, referencing the same block):

2014-02-02 22:56:07,543 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: BP-911678927-10.159.27.212-1363638216198:blk_2930043100041915344_33721178 is no longer in the dataset
2014-02-02 22:56:07,543 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: BP-911678927-10.159.27.212-1363638216198:blk_2930043100041915344_33721178 is no longer in the dataset
2014-02-02 22:56:07,543 INFO org.apache.hadoop.hdfs.server.datanode.BlockPoolSliceScanner: BP-911678927-10.159.27.212-1363638216198:blk_2930043100041915344_33721178 is no longer in the dataset

We can suppress this with log4j, but are not sure if it would just hide a real underlying issue. The only way to stop the log spam after it starts is to restart the datanode.

A fix or workaround would be appreciated.

Attachments

Options
- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

Attachments

threads
82 kB
04/Feb/14 4:32 AM

Activity

People

Assignee:

David Wang

Reporter:

Bryan Beaudreault

Votes:

0 Vote for this issue

Watchers:

2 Start watching this issue

Dates

Created:

03/Feb/14 4:44 PM

Updated:

10/Feb/14 9:18 PM

Resolved:

10/Feb/14 9:18 PM