Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-17852

remove support for list bucketing "stored as directories" in 3.0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      From the email thread:

      1) LB, when stored as directories, adds a lot of low-level complexity to Hive tables that has to be accounted for in many places in the code where the files are written or modified - from FSOP to ACID/replication/export.
      2) While working on some FSOP code I noticed that some of that logic is broken - e.g. the duplicate file removal from tasks, a pretty fundamental correctness feature in Hive, may be broken. LB also doesn’t appear to be compatible with e.g. regular bucketing.
      3) The feature hasn’t seen development activity in a while; it also doesn’t appear to be used a lot.

      Keeping with the theme of cleaning up “legacy” code for 3.0, I was proposing we remove it.

      (2) also suggested that, if needed, it might be easier to implement similar functionality by adding some flexibility to partitions (which LB directories look like anyway); that would also keep the logic on a higher level of abstraction (split generation, partition pruning) as opposed to many low-level places like FSOP, etc.

      Attachments

        1. HIVE-17852.21.patch
          264 kB
          László Bodor
        2. HIVE-17852.20.patch
          263 kB
          László Bodor
        3. HIVE-17852.19.patch
          263 kB
          László Bodor
        4. HIVE-17852.18.patch
          261 kB
          László Bodor
        5. HIVE-17852.17.patch
          290 kB
          László Bodor
        6. HIVE-17852.16.patch
          339 kB
          László Bodor
        7. HIVE-17852.15.patch
          355 kB
          László Bodor
        8. HIVE-17852.14.patch
          299 kB
          László Bodor
        9. HIVE-17852.13.patch
          301 kB
          László Bodor
        10. HIVE-17852.12.patch
          300 kB
          László Bodor
        11. HIVE-17852.11.patch
          300 kB
          László Bodor
        12. HIVE-17852.10.patch
          300 kB
          László Bodor
        13. HIVE-17852.09.patch
          300 kB
          László Bodor
        14. HIVE-17852.08.patch
          294 kB
          László Bodor
        15. HIVE-17852.07.patch
          294 kB
          László Bodor
        16. HIVE-17852.06.patch
          293 kB
          László Bodor
        17. HIVE-17852.05.patch
          295 kB
          László Bodor
        18. HIVE-17852.04.patch
          508 kB
          László Bodor
        19. HIVE-17852.03.patch
          258 kB
          László Bodor
        20. HIVE-17852.02.patch
          258 kB
          László Bodor
        21. HIVE-17852.01.patch
          265 kB
          László Bodor

        Issue Links

          Activity

            People

              abstractdog László Bodor
              sershe Sergey Shelukhin
              Votes:
              2 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: