Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10545

Tune data_cache_write_concurrency based on the type of IO device

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Impala 4.0.0
    • Impala 4.2.0
    • Backend
    • ghx-label-14

    Description

      The data cache limits concurrency writes to the cache to avoid overwhelming the underlying IO device. This is controlled by the data_cache_write_concurrency flags and defaults to 1. For SSDs, we should be able to increase this to allow more concurrent writes to the data cache. This would allow the data cache to warm up faster and stay more up to date.

      One option is to detect the underlying IO device (similar to how we do this for other parts of Disk IO Mgr) and tune this parameter higher for SSDs.

      Attachments

        1. test8.out
          75 kB
          Michael Smith
        2. test1.out
          75 kB
          Michael Smith
        3. test.sh
          0.8 kB
          Michael Smith

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            MikaelSmith Michael Smith
            joemcdonnell Joe McDonnell
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment