1. Oozie
  2. OOZIE-751

Sqoop jobs through oozie hangs if I try to load 3 or more table in parallel


    • Type: Bug Bug
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Not A Problem
    • Affects Version/s: 3.2.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Environment:

      CentOs 5.0, hadoop-0.20.2, sqoop-1.3.0, oozie-2.3.2


      I want to load data from SQL Server to HDFS and am using the sqoop action of Oozie as defined on page http://archive.cloudera.com/cdh/3/oozie-2.3.2-cdh3u3/DG_SqoopActionExtension.html.

      It works when I try to copy 1 table but when I try to copy 3 or more tables in parallel then the job just hangs. I don't see any errors anywhere in the logs.

      • I have confirmed that there are no deadlocks on the database side.
      • I have confirmed that if I try to load multiple table in parallel using sqoop command line then it works fine

      It looks like there is something in oozie sqoop action.

      One more thing that I noticed is that there are 3 oozie jobs running in the oozie console but 6 jobs are shown in Jobtracker UI (please see screenshot attached). Not sure why that is.

      The workflow.xml file, tasktracker logs for the task and how oozie directory looks on HDFS is attached.

      1. tasklog.htm
        81 kB
        Aman Preet Singh
      2. workflow.xml
        2 kB
        Aman Preet Singh
      3. how_3_oozie_jobs_look_in_jobtracker_ui.png
        111 kB
        Aman Preet Singh
      4. this_is_how_oozie_directory_structure_looks_in_hdfs.txt
        3 kB
        Aman Preet Singh
      5. job_201202161642_33931_taskdetailshistory.jsp.htm
        9 kB
        Aman Preet Singh


        No work has yet been logged on this issue.


          • Assignee:
            Aman Preet Singh
          • Votes:
            0 Vote for this issue
            2 Start watching this issue


            • Created: