Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48217

Spark stdout and stderr getting removed at end of spark job triggered from cloudera hue workflow

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete CommentsDelete
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 1.6.0
    • None
    • Spark Submit
    • None

    Description

      Hello,

      we are running spark job triggered from cloudera hue workflow

      and spark printing stdout and stderr logs during execution as expected:

      e.g. -

       

      But stdout and stderr logs getting cleaned when workflows finished/status succeeded

       

      following is spark-submit command workflow is triggering:

      _/usr/bin/spark-submit _
        _--master yarn-client _
        _--driver-memory 4g _
        _--executor-memory 16g _
        _--executor-cores 4 _
        _--class tst _
        _--files `ls -m *.conf | tr -d '\n '` _
        _--conf "spark.dynamicAllocation.maxExecutors=4" _
        _--conf "spark.kryoserializer.buffer.max=1024" _
        tst.jar $*

       

       

      does someone familiar with this spark job behavior or can advise ideas to fix it?

       

      Thanks in advance

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            noamsh88 Noam Shemesh

            Dates

              Created:
              Updated:

              Slack

                Issue deployment