Hive
  1. Hive
  2. HIVE-6455 Scalable dynamic partitioning and bucketing optimization
  3. HIVE-6761

Hashcode computation does not use maximum parallelism for scalable dynamic partitioning

    Details

    • Type: Sub-task Sub-task
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.13.0, 0.14.0
    • Fix Version/s: 0.13.0, 0.14.0
    • Component/s: Query Processor
    • Labels:
      None

      Description

      Hashcode computation for HIVE-6455 should consider all the partitioning columns and bucket number to distribute the rows. The following code

      for (int i = 0; i < partitionEval.length - 1; i++) {
      

      ignores the last partition column thereby generating lesser hashcodes.

      1. HIVE-6761.1.patch
        2 kB
        Prasanth Jayachandran

        Activity

        Thejas M Nair made changes -
        Status Resolved [ 5 ] Closed [ 6 ]
        Gunther Hagleitner made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]
        Prasanth Jayachandran made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Prasanth Jayachandran made changes -
        Attachment HIVE-6761.1.patch [ 12637047 ]
        Prasanth Jayachandran made changes -
        Component/s Query Processor [ 12312586 ]
        Prasanth Jayachandran made changes -
        Field Original Value New Value
        Fix Version/s 0.13.0 [ 12324986 ]
        Fix Version/s 0.14.0 [ 12326450 ]
        Prasanth Jayachandran created issue -

          People

          • Assignee:
            Prasanth Jayachandran
            Reporter:
            Prasanth Jayachandran
          • Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development