Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-11117

Hive external table - skip header and trailer property issue

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • None
    • None
    • None
    • Production

    Description

      I am using an external hive table pointing to a HDFS location. The external table is partitioned on year/mm/dd folders. When there are more than one partition folder (ex: /2015/01/02/file.txt & /2015/01/03/file2.txt), the select on external table, skips the DATA RECORD instead of skipping the header/trailer record from one of the file).

      tblproperties ("skip.header.line.count"="1");

      Resolution: On enabling hive.input format instead of text input format and execution using TEZ engine instead of MapReduce resovled the issue.

      How to resolve the problem without setting these parameters ? I don't want to run the hive query using TEZ.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            jana_chander Janarthanan

            Dates

              Created:
              Updated:

              Slack

                Issue deployment