Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7432

Make Manifest Committer the default for abfs and gcs

    XMLWordPrintableJSON

Details

    • By default, the mapreduce manifest committer is used for jobs working with abfs and gcs.. Hadoop mapreduce jobs will pick this up automatically; for Spark it is a bit complicated: read the docs to see the steps required.

    Description

      Switch to the manifest committer as default for abfs and gcs

      • abfs: needed for performance, scale and resilience under some failure modes
      • gcs: provides correctness through atomic task commit and better job commit performance

      Attachments

        Issue Links

          Activity

            People

              stevel@apache.org Steve Loughran
              stevel@apache.org Steve Loughran
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: