Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-12030

[C++] Change dataset readahead to be based on available RAM/CPU instead of fixed constants/options

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • C++
    • None

    Description

      Right now in the dataset scanning there are a few places where we add readahead.  At each spot we have to pick some max for how much we read ahead.  Instead of trying to figure out some max it might be nicer to base it on the available RAM.

      On the other hand, it may be the case that there is some set of nice constants that just always works so this can probably wait until we understand more the memory usage of dataset scanning.

      Attachments

        Issue Links

          Activity

            People

              westonpace Weston Pace
              westonpace Weston Pace
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: