Uploaded image for project: 'Sqoop'
  1. Sqoop
  2. SQOOP-1138

incremental lastmodified should re-use output directory

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.4.4
    • Fix Version/s: 1.4.5
    • Component/s: None
    • Labels:
      None

      Description

      When using --incremental lastmodified option in sqoop command line "the second job will take both the old and new data and will merge them together into the final output, preserving only the last updated value for each row."

      However, when I try to run incremental lastmodified twice, the second time I get "FileAlreadyExistsException: Output directory already exists".

      The incremental lastmodified should read from this directory to get the data from the last run and should not throw an exception.

        Attachments

        1. SQOOP-1138.0.patch
          4 kB
          Abraham Elmahrek
        2. SQOOP-1138.1.patch
          20 kB
          Abraham Elmahrek
        3. SQOOP-1138.2.patch
          19 kB
          Abraham Elmahrek
        4. SQOOP-1138.4.patch
          19 kB
          Abraham Elmahrek
        5. SQOOP-1138.5.patch
          20 kB
          Abraham Elmahrek
        6. SQOOP-1138.6.patch
          20 kB
          Abraham Elmahrek

          Issue Links

            Activity

              People

              • Assignee:
                abec Abraham Elmahrek
                Reporter:
                jmspaggi Jean-Marc Spaggiari
              • Votes:
                5 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: