Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-13291

ORC BI Split strategy should consider block size instead of file size

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.1.0
    • 1.3.0, 2.1.0
    • ORC
    • None

    Description

      When we force split strategy to use "BI" (using hive.exec.orc.split.strategy), entire file is considered as single split. This might be inefficient when the files are large. Instead, BI should consider splitting at block boundary.

      Attachments

        1. HIVE-13291.1.patch
          11 kB
          Prasanth Jayachandran
        2. HIVE-13291.2.patch
          9 kB
          Prasanth Jayachandran
        3. HIVE-13291.3.patch
          9 kB
          Prasanth Jayachandran
        4. HIVE-13291-branch-1.patch
          8 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              gopalv Gopal Vijayaraghavan
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: