Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.0.0-alpha-1, 2.4.6
-
None
-
Reviewed
-
Description
After the change in HBASE-17917 that use PREAD (ReadType.DEFAULT) for all user scan, the behavior of TableSnapshotInputFormat changed from STREAM to PREAD.
TableSnapshotInputFormat is supposed to be use with a YARN/MR or other batch engine that should read the entire HFile in the container/executor, with default always to PREAD, we executing a lot more DFSInputStream#seek calls to simply read through the datablock section of the HFile.
The goal of this change is to make any downstream using TableSnapshotInputFormat with STREAM scan.
Attachments
Issue Links
- fixes
-
HBASE-26630 TableSnapshotInputFormat terminates in the middle
- Resolved
- is related to
-
HBASE-26274 Create an option to reintroduce BlockCache to mapreduce job
- Resolved
- links to