Description
Map tasks do not explicitly set the permissions of their output files for shuffle. In a secure cluster the shuffle service is running as a different user than the map task, so the output files require group readability in order to serve up the data during the shuffle phase. If the user's UNIX umask is too restrictive (e.g.: 077) then the map task's file.out and file.out.index permissions can be too restrictive to allow the shuffle handler to access them.
Attachments
Attachments
Issue Links
- is related to
-
MAPREDUCE-7159 FrameworkUploader: ensure proper permissions of generated framework tar.gz if restrictive umask is used
-
- Resolved
-
-
TEZ-3894 Tez intermediate outputs implicitly rely on permissive umask for shuffle
-
- Closed
-
- relates to
-
HADOOP-11347 RawLocalFileSystem#mkdir and create should honor umask
-
- Resolved
-