Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1473

Sqoop should allow users to control export parallelism

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None

    Description

      Sqoop uses MapReduce jobs to export files back to a table in the database. The degree of parallelism is controlled by the number of splits; i.e., the number of input files used. The bottleneck in the system, though, is likely to be the database itself.

      Users should have the ability to tune the number of parallel exporters being used to a degree appropriate to their database deployment.

      Attachments

        1. MAPREDUCE-1473.2.patch
          15 kB
          Aaron Kimball
        2. MAPREDUCE-1473.patch
          16 kB
          Aaron Kimball

        Issue Links

          Activity

            People

              kimballa Aaron Kimball
              kimballa Aaron Kimball
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: