Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-11185

Reuse orc::ColumnVectorBatch in the scanner life-cycle

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.1.0
    • Backend
    • None

    Description

      In HdfsOrcScanner::AssembleRows(), we always re-create a orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only destroyed it when the scanner is closed.

      In the flame graph of TPC-H Q1 collected by drorke , the createRowBatch and destructors occupies almost half of the scanner time.

      Attachments

        1. tpch-q1-scanner-flame-graph.jpg
          720 kB
          Quanlong Huang

        Activity

          People

            stigahuang Quanlong Huang
            stigahuang Quanlong Huang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: