Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7189

Hive does not store column names in ORC

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.12.0, 0.13.0
    • None
    • File Formats, ORC
    • None
    • orc orcstorage hcatstorer

    Description

      We uncovered the following discrepancy between writing ORC files through Pig and Hive:

      ORCFile header contains the name of the columns. Storing through Pig (ORCStorage or HCatStorer), the column names are stored fine. But when stored through hive they are stored as _col0, _col1,....,_col99 and hive uses the partition schema to map the column names. Reading the same file through Pig then has problems as user will have to manually map columns.

      Attachments

        Activity

          People

            daijy Daniel Dai
            cdrome Chris Drome
            Votes:
            2 Vote for this issue
            Watchers:
            16 Start watching this issue

            Dates

              Created:
              Updated: