Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2885

Scanners store per-split objects in per-query object pool

    XMLWordPrintableJSON

Details

    Description

      Various scanners store control structures in RuntimeState::object_pool_ to be cleaned up at the end of the query. Since some of these control structures are allocated for every input split, a small amount of memory is wasted on control structures that are no longer needed. If a large number of scan ranges and columns are processed in a query, this can add megabytes or 10s of megabytes to the query's memory consumption.

      I added some logging and saw that for a largish scan there was 10000+ objects in the object pool.

      Attachments

        Issue Links

          Activity

            People

              kwho Michael Ho
              tarmstrong Tim Armstrong
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: