Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-1

Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls

    XMLWordPrintableJSON

Details

    Description

      Currently, Hudi writers repeatedly list partitions to create file-system views in executors. This task addresses the reductions in listStatus name-node calls in Hudi 2.0 writers by taking advantage of MVCC view of HUDI and caching file-system view and reusing them.

      An embedded file-system view server on driver will be preloaded with the view. It will act as a cache and service File-system view calls from executors.

       

      https://github.com/uber/hudi/issues/433

      https://github.com/uber/hudi/issues/269

       

       

      Attachments

        Issue Links

          Activity

            People

              vbalaji Balaji Varadarajan
              vbalaji Balaji Varadarajan
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 1,008h
                  1,008h
                  Remaining:
                  Remaining Estimate - 1,008h
                  1,008h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified