Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-28080

There is a problem to download and watch offline the history of an application with multiple attempts due to UI inconsistency

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete CommentsDelete


    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • 2.3.1
    • None
    • Web UI



      If you are looking to watch locally a spark application attempt history, trying to see the history of the first attempt (or any attempt but the last one) would fail, because some UI inconsistently.

      The inconsistency is that in the spark history UI, the "app_id" column is clickable and will always take you to this application last attempt, but if you tried to download only the first attempt, you will get an error of application not found.


      How to reproduce:

      1. open spark any spark history server (if using Azure HDinsight the address would be https://<cluster name>.azurehdinsight.net/sparkhistory/)
      2. look for an application that have multiple attempts (ie - attempt ID > 1)
      3. look for the first attempt in this application and download it using the "download" button in the event column. save it in your local spark history folder (default: /tmp/spark-events)
      4. Start a local spark history server (typically: using the start-history-server.sh script)
      5. browse to the local history server and look for the application for which you downloaded the history.
      6. click the application name in the "App ID" column, and you would get the following error:
        "Application <your application ID> not found."

      Why ?

      because on the remote history server it is assumed that all Attempts history files are preset, so the "App ID" column points to the latest attempt of this app, while the "Attempt  ID" column points to the specific attempt.

      But if we have an application with two attempts, and we only want to research the first one, we download it locally, opening with the local history server, and intuitively clicking the link in the "app id" column, the link actually points to the second attempt, which we haven't even downloaded. 



          This comment will be Viewable by All Users Viewable by All Users


            Unassigned Unassigned Assign to me
            weiss_gal Gal Weiss
            1 Vote for this issue
            4 Start watching this issue




                Issue deployment