Details
-
Bug
-
Status: Resolved
-
P2
-
Resolution: Fixed
-
None
-
None
Description
HadoopIO currently uses 2.7.0 and runners-spark uses 2.2.0 for hadoop-client, hadoop-common.
From amitsela
"Spark can be built against different hadoop versions, but the release in maven central is a 2.2.0 build (latest). ''
For HadoopIO, I don't know why 2.7.0 is picked at the beginning. I can check if it will work with 2.2.0.
I am creating this issue, since I think it there is a general question.
In principle, HadoopIO and other sdks Sources should work with any runners. But, when one set of runners require version A, but the other set of runners require version B, we will need a general solution for it.