Uploaded image for project: 'Ambari'
  1. Ambari
  2. AMBARI-11558

Ooozie start takes too long

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.1.0
    • None
    • None

    Description

      Currently Oozie Start task takes ~3 minutes. Which includes some time-
      expensive actions like:
      1\. Extracting a big tar archive
      2\. Execute prepare-war fom it
      3\. Checking for hdfs directory via hadoop binary.

      1 and 2 can be avoid on the non-first starts, by saving checksum of archive
      which was extracted, and not re-extracting it unless the checksum changed, or
      the unextracted folder is gone.

      3 can benefit from using fast WebHDFS calls.

      *As a result for me for non-first Oozie start this reduced time of start from 180 seconds to 13-30 seconds.*

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            aonishuk Andrew Onischuk
            aonishuk Andrew Onischuk
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment