Uploaded image for project: 'Hue'
  1. Hue
  2. HUE-8021

[indexer] Support splittig a field when operating a REST indexing to Solr

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 4.1.0
    • Fix Version/s: 4.2.0
    • Component/s: app.importer
    • Labels:
      None

      Description

      cf. file, some fields can have multiple value, e.g. multiple fixed versions of a jira "5.12,5.13" and we Solr support multivalue fields (i.e. the both values are split in the same field "5.12" "5.13" instead of being a concatenated value "5.12,5.13")

      The goal is to support the 'split' operation of the CSV handler:
      https://lucene.apache.org/solr/guide/7_0/uploading-data-with-index-handlers.html#csv-formatted-index-updates

      The CSV handler is already the one used when the 'Index with a job' checkbox is not checked.

      1. In our case we should now show the '+ Operation' now even if the 'Index with a job' checkbox is not checked.
      2. Offer to add a split operation cf. screenshot and syntax 'f.category.split=true&f.category.seperator=%2C'

      Code is for the CSV handler parameter for the Solr is just 'kwargs' in https://github.com/cloudera/hue/blob/5f4efa6c62aa1616a10276efdfaad991010a29c2/desktop/libs/indexer/src/indexer/api3.py#L250

      (to keep in mind, we might add 'trim' and 'map' operations at some point later)

        Attachments

          Activity

            People

            • Assignee:
              jgauthier Jean Francois Desjeans Gauthier
              Reporter:
              romain Romain Rigaux
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: