Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2814

Can we have a feature to disable creating empty buckets on a larger number of buckets creates?

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • hive bucketing

    Description

      When we create buckets on a larger datasets, its not often that all the partitions have same number of buckets so we choose the largest possible number to capture the buckets mostly.

      It results into creating lot of empty buckets, which might be an overhead of hadoop as well as for hive queries.
      Also it takes a lot of time to just create empty buckets.

      Is there a way where I can say do not create empty buckets?

      Attachments

        Activity

          People

            rmsmani@gmail.com Mani M
            nitinpawar432 Nitin Pawar
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: