Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
0.11.0, 0.12.0, 0.13.0, 0.13.1
-
None
-
None
Description
The call to set replication is called prior to uploading the archive file to hdfs, which does not throw an error, but the replication never gets set.
This has a significant impact on large jobs (especially hash joins) due to too many tasks hitting the data nodes.