Uploaded image for project: 'Sqoop (READ-ONLY)'
  1. Sqoop (READ-ONLY)
  2. SQOOP-119

TextSplitter creates incorrect bound checks.

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.0.0, 1.1.0
    • Fix Version/s: 1.2.0
    • Component/s: None
    • Labels:
      None

      Description

      The TextSplitter implementation does not advance the start point for calculating the subsequent split. For example to have three splits between a and d, the splitter should produce splits like ( col >= 'a', col < 'b'), (col >= 'b', col < 'c') and (col >= 'c', col <= 'd'). However, what it produces instead is ( col >= 'a', col < 'b'), (col >= 'a', col < 'c') and (col >= 'a', col <= 'd'). This results in intersecting splits which causes duplicate records.

        Attachments

          Activity

            People

            • Assignee:
              arvind Arvind Prabhakar
              Reporter:
              arvind Arvind Prabhakar
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: