Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-4160 Vectorized Query Execution in Hive
  3. HIVE-4472

OR, NOT Filter logic can lose an array, and always takes time O(VectorizedRowBatch.DEFAULT_SIZE)

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: vectorization-branch, 0.13.0
    • Component/s: None
    • Labels:
      None

      Description

      The issue is in file FilterExprOrExpr.java and FilterNotExpr.java.

      I posted a review for you at

      https://reviews.apache.org/r/10752/

      I think there is a bug related to sharing of an array of integers. Also, one algorithm step takes O(DEFAULT_BATCH_SIZE) time always. If n<<DEFAULT_BATCH_SIZE then this is a performance issue.

        Attachments

        1. HIVE-4472.5.patch
          29 kB
          Jitendra Nath Pandey
        2. HIVE-4472.4.patch
          30 kB
          Jitendra Nath Pandey
        3. HIVE-4472.3.patch
          15 kB
          Jitendra Nath Pandey
        4. HIVE-4472.2.patch
          15 kB
          Jitendra Nath Pandey
        5. HIVE-4472.1.patch
          6 kB
          Jitendra Nath Pandey

          Activity

            People

            • Assignee:
              jnp Jitendra Nath Pandey
              Reporter:
              ehans Eric N. Hanson
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: