Uploaded image for project: 'Oozie'
  1. Oozie
  2. OOZIE-751

Sqoop jobs through oozie hangs if I try to load 3 or more table in parallel


    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: 3.2.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Environment:

      CentOs 5.0, hadoop-0.20.2, sqoop-1.3.0, oozie-2.3.2


      I want to load data from SQL Server to HDFS and am using the sqoop action of Oozie as defined on page http://archive.cloudera.com/cdh/3/oozie-2.3.2-cdh3u3/DG_SqoopActionExtension.html.

      It works when I try to copy 1 table but when I try to copy 3 or more tables in parallel then the job just hangs. I don't see any errors anywhere in the logs.

      • I have confirmed that there are no deadlocks on the database side.
      • I have confirmed that if I try to load multiple table in parallel using sqoop command line then it works fine

      It looks like there is something in oozie sqoop action.

      One more thing that I noticed is that there are 3 oozie jobs running in the oozie console but 6 jobs are shown in Jobtracker UI (please see screenshot attached). Not sure why that is.

      The workflow.xml file, tasktracker logs for the task and how oozie directory looks on HDFS is attached.


        1. job_201202161642_33931_taskdetailshistory.jsp.htm
          9 kB
          Aman Preet Singh
        2. this_is_how_oozie_directory_structure_looks_in_hdfs.txt
          3 kB
          Aman Preet Singh
        3. how_3_oozie_jobs_look_in_jobtracker_ui.png
          111 kB
          Aman Preet Singh
        4. workflow.xml
          2 kB
          Aman Preet Singh
        5. tasklog.htm
          81 kB
          Aman Preet Singh



            • Assignee:
              apsingh Aman Preet Singh
            • Votes:
              0 Vote for this issue
              2 Start watching this issue


              • Created: