Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14354

Cleanup ORC reader interfaces and redundant metadata objects

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.2.0
    • None
    • ORC
    • None

    Description

      ORC reader interface has to separated out to a new interface that only returns metadata.
      OrcTail should implement the new interface that only provides metadata information about the file.
      OrcFileMetadata object contains many redundant information that can be obtained from OrcTail. This class should be cleaned up and OrcTail should be reused.
      Remove all references to List<OrcProto.Types> and replace them with TypeDescription.
      SchemaEvolution objects are created in multiple places. But the information required for creating SchemaEvolution is already present within the Reader (file types, projection and schema on read).
      Schema on read can be specified only to RecordReader options. Reader options should also accept schema on read type description.
      Consolidate Reader options and RecordReader options.

      Attachments

        Activity

          People

            prasanth_j Prasanth Jayachandran
            prasanth_j Prasanth Jayachandran
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: