Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.3.0
-
None
-
None
Description
Currently, the short-circuit local read pathway allows HDFS clients to access files directly without going through the DataNode. However, all of these reads involve a copy at the operating system level, since they rely on the read() / pread() / etc family of kernel interfaces.
We would like to enable HDFS to read local files via mmap. This would enable truly zero-copy reads.
In the initial implementation, zero-copy reads will only be performed when checksums were disabled. Later, we can use the DataNode's cache awareness to only perform zero-copy reads when we know that checksum has already been verified.
Attachments
Attachments
Issue Links
- blocks
-
HDFS-5101 ZCR should work with blocks bigger than 2 GB
- Open
- is duplicated by
-
HDFS-3053 Support for true zero-copy (mmap-based) reads
- Resolved
- is part of
-
HDFS-5260 Merge zero-copy memory-mapped HDFS client reads to trunk and branch-2.
- Closed
- is related to
-
HDFS-5191 revisit zero-copy API in FSDataInputStream to make it more intuitive
- Resolved
-
HDFS-5109 Document new zero-copy short-circuit local reads feature
- Open
- relates to
-
HDFS-4949 Centralized cache management in HDFS
- Closed
-
HDFS-6007 Update documentation about short-circuit local reads
- Closed