Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-10215

Large IN() clauses: deep hashCode performance during optimizer pass

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.2.0
    • Fix Version/s: 1.2.0
    • Component/s: Logical Optimizer
    • Labels:
      None
    • Release Note:
      Use object identity to prevent recursion instead of equality in the optimizer visitor pattern

      Description

      The logical optimizer uses several maps and sets, which are exceeding expensive for large IN() clauses due to the fact that several part of the queries walk over the lists without short-circuiting during hashCode(), while equals() is faster due to short-circuiting via less expensive operators.

        Attachments

        1. HIVE-10215.1.patch
          0.8 kB
          Gopal Vijayaraghavan

          Activity

            People

            • Assignee:
              gopalv Gopal Vijayaraghavan
              Reporter:
              gopalv Gopal Vijayaraghavan
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: