Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-4063

Flink runner supports cluster-wide artifact deployments through the Distributed Cache

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: runner-flink
    • Labels:
      None

      Description

      As of now, Flink effectively has a dependency on an external storage system for artifact management. This is because the Flink Distributed Cache does not actually distribute and cache blobs itself, but rather expects that each node in a running cluster has access to a well-known artifact resource.

      We should get this for free whenever https://github.com/apache/flink/pull/5580 is merged (likely in 1.5). For now, we will have to defer to external storage systems like GCS or HDFS.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                bsidhom Ben Sidhom
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated: