Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-6765

Beam 2.10.0 for Python requires pyarrow 0.11.1, which is not installable in Google Cloud DataFlow

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • 2.10.0
    • 2.10.0
    • sdk-py-core
    • None

    Description

      When trying to run a Beam 2.10.0 job in Google Cloud DataFlow, I get the following error:

      Collecting pyarrow==0.11.1 (from -r requirements.txt (line 51))
      Could not find a version that satisfies the requirement pyarrow==0.11.1 (from -r requirements.txt (line 51)) (from versions: 0.9.0, 0.10.0, 0.11.0, 0.12.1)
      No matching distribution found for pyarrow==0.11.1 (from -r requirements.txt (line 51))
      

      This version, while it exists, cannot be installed in Google Cloud DataFlow, because it is only available on PyPI as a wheel, and DataFlow does not allow installing binary packages, only source packages.

      Attachments

        Activity

          People

            Unassigned Unassigned
            barrywhart Barry Hart
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: