Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-14908

Distributing CacheFiles through DFS does not work

    XMLWordPrintableJSON

Details

    Description

      User reported that distributing cache files through DFS does not work anymore: https://stackoverflow.com/questions/58978476/flink-1-9-wont-run-program-when-i-use-distributed-cache-why

      I think the problematic part is in RestClusterClient#submitJob:

      for (Map.Entry<String, DistributedCache.DistributedCacheEntry> artifacts : jobGraph.getUserArtifacts().entrySet()) {
      	artifactFileNames.add(new JobSubmitRequestBody.DistributedCacheFile(artifacts.getKey(), new Path(artifacts.getValue().filePath).getName()));
      	filesToUpload.add(new FileUpload(Paths.get(artifacts.getValue().filePath), RestConstants.CONTENT_TYPE_BINARY));
      }
      

      The code does not check if a file is in DFS, but just assumes it is in local FS and tries to add it to the rest request, which fails. The code on the receiver side in JobSubmitHandler still can support files distributed via DFS, but need to get proper paths to files in DFS.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dwysakowicz Dawid Wysakowicz
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: