Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23958

HadoopRdd filters empty files to avoid generating empty tasks that affect the performance of the Spark computing performance.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Duplicate
    • 2.4.0
    • None
    • Spark Core
    • None

    Description

      HadoopRdd filter empty files to avoid generating empty tasks that affect the performance of the Spark computing performance.

      Empty file's length is zero.

      Attachments

        Activity

          People

            Unassigned Unassigned
            guoxiaolongzte guoxiaolong
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: