Uploaded image for project: 'CDH (READ-ONLY)'
  1. CDH (READ-ONLY)
  2. DISTRO-542

Beeline reads garbage on "select *" from Parquet files created with Pig or MapReduce

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: CDH4.5.0
    • Fix Version/s: None
    • Component/s: Parquet
    • Labels:
      None

      Description

      Create a table with Hive or Impala, then populate it with Pig or MapReduce.

      With beeline (JDBC client to HiveServer2), "select *" from the table returns empty rows. Selecting all columns explicitly works as expected, and any aggregations as well. This means it is only failing when reading directly from HDFS without launching a MapReduce job.

      The Hive CLI, on the other hand, works correctly for all cases.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              robw Rob Weltman
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: