Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-4160 Vectorized Query Execution in Hive
  3. HIVE-4472

OR, NOT Filter logic can lose an array, and always takes time O(VectorizedRowBatch.DEFAULT_SIZE)

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • vectorization-branch, 0.13.0
    • None
    • None

    Description

      The issue is in file FilterExprOrExpr.java and FilterNotExpr.java.

      I posted a review for you at

      https://reviews.apache.org/r/10752/

      I think there is a bug related to sharing of an array of integers. Also, one algorithm step takes O(DEFAULT_BATCH_SIZE) time always. If n<<DEFAULT_BATCH_SIZE then this is a performance issue.

      Attachments

        1. HIVE-4472.1.patch
          6 kB
          Jitendra Nath Pandey
        2. HIVE-4472.2.patch
          15 kB
          Jitendra Nath Pandey
        3. HIVE-4472.3.patch
          15 kB
          Jitendra Nath Pandey
        4. HIVE-4472.4.patch
          30 kB
          Jitendra Nath Pandey
        5. HIVE-4472.5.patch
          29 kB
          Jitendra Nath Pandey

        Activity

          People

            jnp Jitendra Nath Pandey
            ehans Eric N. Hanson
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: