Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3979

Yarn backend's default file replication should match HDFS's default one

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.2.0
    • YARN
    • None

    Description

      This code in ClientBase.scala sets the replication used for files uploaded to HDFS:

          val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 3).toShort
      

      Instead of a hardcoded "3" (which is the default value for HDFS), it should be using the default value from the HDFS conf ("dfs.replication").

      Attachments

        Activity

          People

            vanzin Marcelo Masiero Vanzin
            vanzin Marcelo Masiero Vanzin
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: