Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-8024

Luigi triggering resolved Blockmanager bug

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 1.3.1
    • None
    • Block Manager, Spark Core
    • None

    Description

      We are using Luigi with Spark to manage our jobs

      However we run into a unique rare case with the following conditions that trigger the resolved Block Manger Bug:

      • Dataset is relatively large ~ 1.5TB
      • Spark job is ran with Luigi
      • save to local HDFS

      The spark job would process data and mappings just fine, until the very end when it proceeds to save the files to local hdfs this is when it triggers this bug.

      However, the job saves and complete data successfully if it was saved to s3:// location.

      wondering what might cause this resolved bug to trigger when ran with luigi saving to local hdfs but not trigger when saved to s3 with luigi or ran without luigi?

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              cqnguyen Cory Nguyen
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: