Details
-
Bug
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
2.4.0
-
None
-
None
Description
WebHdfsFileSystem.open and HftpFileSystem.open incorrectly handles non-existing paths.
- 'open', does not really open anything, i.e., it does not contact the server, and therefore cannot discover FileNotFound, it's deferred until next read. It's counterintuitive and not how local FS or HDFS work. In POSIX you get ENOENT on open. LzoInputFormat.getSplits is an example of the code that's broken because of this.
- On the server side, FileDataServlet incorrectly sends SC_BAD_REQUEST instead of SC_NOT_FOUND for non-exitsing paths
Attachments
Attachments
Issue Links
- relates to
-
HADOOP-9361 Strictly define the expected behavior of filesystem APIs and write tests to verify compliance
- Closed
- requires
-
HDFS-6143 WebHdfsFileSystem open should throw FileNotFoundException for non-existing paths
- Closed
linking to
HADOOP-9361and FS semantics.Failing on the open if a file is not found is a core expectation of filesystems.
We could optimise any of the web filesystems by not doing that open (e,g, S3, s3n, swift) and waiting for the first seek. But we don't because things expect missing files to not be there.
Interesting that FileSystemContractBaseTest doesn't catch this