Details
Description
FileInputFormat#listStatus is too slow file system cache is disabled.
TokenCache#obtainTokensForNamenodesInternal for every filesystem instance Master.getMasterPrincipal(conf) is caled which reloads YarnConfiguration .
For fileInput with 1k file will reload YarnConfiguration 1k times.
Master.getMasterPrincipal(conf) can be passed for obtainTokensForNamenodesInternal per filesystem call.