Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-23934

Refactor TezCompiler#markSemiJoinForDPP to avoid redundant operations in nested while

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Most of the code inside the nested while loop can be extracted and computed only once in the external loop. Moreover there are catch clauses for NPE which seem rather predictable and could possibly be avoided by proper checks.

      In addition the code should be adapted to handle the case of multi column semijoin reducers introduced by HIVE-21196.

      The goal of this issue is to refactor TezCompiler#markSemiJoinForDPP method to avoid redundant operations, improve code readability, and handle multicolumn semijoin reducers. As a side effect of this refactoring the method will be slightly more efficient although unlikely to have observable difference in practice.

      Attachments

        Activity

          People

            zabetak Stamatis Zampetakis
            zabetak Stamatis Zampetakis
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: