Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-12142

Default fetch_size of 10240 is suboptimal

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • Impala 4.3.0
    • Clients
    • ghx-label-5

    Description

      While working on IMPALA-12138 it turned out that the default settings of batch_size=1024 and fetch_size=10240 are not ideal for the coordinator (RowMaterializationTimer). My guess for the cause is that HS2 results vectors are rounded up to power of 2 sizes leading to extra allocations and copying.

      query: select * from tpch_parquet.lineitem
      RowMaterializationTimer (before and after IMPALA-12138)
      fetch_size=10240: 3.6s ->3.2s
      fetch_size=8192: 2.8s->2.6s

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              csringhofer Csaba Ringhofer
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: