Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7189

Hive does not store column names in ORC

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 0.12.0, 0.13.0
    • Fix Version/s: None
    • Component/s: File Formats, ORC
    • Labels:
      None
    • Tags:
      orc orcstorage hcatstorer

      Description

      We uncovered the following discrepancy between writing ORC files through Pig and Hive:

      ORCFile header contains the name of the columns. Storing through Pig (ORCStorage or HCatStorer), the column names are stored fine. But when stored through hive they are stored as _col0, _col1,....,_col99 and hive uses the partition schema to map the column names. Reading the same file through Pig then has problems as user will have to manually map columns.

        Attachments

          Activity

            People

            • Assignee:
              daijy Daniel Dai
              Reporter:
              cdrome Chris Drome
            • Votes:
              2 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

              • Created:
                Updated: