Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
With ORC stripes being large, if a stripe straddles an HDFS block, the locality of read is suboptimal. It would be good to add padding to ensure that stripes don't straddle HDFS blocks.
Attachments
Attachments
Issue Links
- contains
-
HIVE-5024 Ensure that the requested blocksize for ORC files is a multiple of 512 bytes.
-
- Resolved
-
- is related to
-
HIVE-6326 Split generation in ORC may generate wrong split boundaries because of unaccounted padded bytes
-
- Resolved
-
- relates to
-
HIVE-4123 The RLE encoding for ORC can be improved
-
- Closed
-