Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7148

Use murmur hash to create bucketed tables

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      HIVE-7121 introduced murmur hashing for queries that don't insert into bucketed tables. This was done to achieve better distribution of the data. The same should be done for bucketed tables as well, but this involves making sure we don't break backwards compat. This probably means that we have to store the partitioning function used in the metadata and use that to determine if SMB and bucketed map-join optimizations apply.

        Issue Links

          Activity

          Hide
          Downchuck Charles Pritchard added a comment -

          I could really use custom bucketing functions, as I want to use buckets instead of partitions based on a derived value.

          Show
          Downchuck Charles Pritchard added a comment - I could really use custom bucketing functions, as I want to use buckets instead of partitions based on a derived value.

            People

            • Assignee:
              Unassigned
              Reporter:
              hagleitn Gunther Hagleitner
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:

                Development