Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-30190

HistoryServerDiskManager will fail on appStoreDir in s3

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1.0
    • None
    • Spark Core
    • None

    Description

      Hi

      While setting spark.eventLog.dir to s3a://... I realized that it requires destination directory to preexists for S3

      This is explained I think in HistoryServerDiskManager's appStoreDir: it tries check if directory exists or can be created

      if (!appStoreDir.isDirectory() && !appStoreDir.mkdir()) {throw new IllegalArgumentException(s"Failed to create app directory ($appStoreDir).")}

      But in S3, a directory does not exists and cannot be created: directories don't exists by themselves, they are only materialized due to existence of objects.

      Before proposing a patch, I wanted to know what are the prefered options :
      should we have a spark option to skip the appStoreDir test, or skip it only when a particular scheme is set, have a custom implementation of HistoryServerDiskManager ...?

       

      Note for people facing the IllegalArgumentException: Failed to create app directory you just have to put an empty file in bucket destination 'path'.

      Attachments

        Activity

          People

            Unassigned Unassigned
            taccart thierry accart
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: