Uploaded image for project: 'Apache Gobblin'
  1. Apache Gobblin
  2. GOBBLIN-711

stopping the master in cluster mode fails the job

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • gobblin-cluster
    • None

    Description

      stopping the master process in cluster mode has some issues

      1. graceful shutdown takes more than 40 seconds when the job is in progress ( no worker node is actually executing, master process is only tracking the job status here )
      2. wrongfully reports the error ( org.apache.gobblin.runtime.AbstractJobLauncher  - Failed to launch and run job job_PullFromWikipedia_1553650384618) also ( org.apache.gobblin.cluster.HelixRetriggeringJobCallable  - Failed to run job PullFromWikipedia)
      3. wrongfully reports job is done ( org.apache.gobblin.cluster.GobblinHelixJobLauncher  - Job job_PullFromWikipedia_1553650384618 is done, remove from running map.)

      attached (master-stop.log) shows the full log of stopping and starting master and screenshots.

      Attachments

        Activity

          People

            hutran Hung Tran
            jaysen Jay Sen
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: