Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-20255

FileIndex hierarchy inconsistency

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.1.0
    • 2.2.0
    • Spark Core
    • None

    Description

      Trying to get a grip on the FileIndex hierarchy, I was confused by the following inconsistency:

      On the one hand, PartitioningAwareFileIndex defines leafFiles and leafDirToChildrenFiles as abstract, but on the other it fully implements listLeafFiles which does all the listing of files. However, the latter is only used by InMemoryFileIndex.

      I'm hereby proposing to move this method (and all its dependencies) to the implementation class that actually uses it, and thus unclutter the PartitioningAwareFileIndex interface.

      Attachments

        Issue Links

          Activity

            People

              a.ionescu Adrian Ionescu
              a.ionescu Adrian Ionescu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: