Uploaded image for project: 'CDH (READ-ONLY)'
  1. CDH (READ-ONLY)
  2. DISTRO-848

in some ORC tables in Hive we get duplicate partition "base" directory inside base directory

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: CDH 5.8.3
    • Fix Version/s: None
    • Component/s: Hive
    • Labels:
      None
    • Environment:
      CDH 5.10.x

      Description

      ​Hi All,

      Periodically, in some ORC tables in Hive we get duplicate partition "base" directory inside /table_name/partition_date=/base/ meaning: all contents of /table_name/partition_date=/base/* are in /table_name/partition_date=/base/base/*. After that partition become bad and from this bad partition we can’t do select count or any other selects because of error occurring.

      But when we dropping duplicate “base” directory problem goes away.

      Why we got this duplicate folder in our buckets?

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              rakhmadeev Akhmadeev Ramil
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: