Hive
  1. Hive
  2. HIVE-2814

Can we have a feature to disable creating empty buckets on a larger number of buckets creates?

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Minor Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Tags:
      hive bucketing

      Description

      When we create buckets on a larger datasets, its not often that all the partitions have same number of buckets so we choose the largest possible number to capture the buckets mostly.

      It results into creating lot of empty buckets, which might be an overhead of hadoop as well as for hive queries.
      Also it takes a lot of time to just create empty buckets.

      Is there a way where I can say do not create empty buckets?

        Activity

        Nitin Pawar created issue -

          People

          • Assignee:
            Unassigned
            Reporter:
            Nitin Pawar
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:

              Development