Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
1.9.0
-
None
Description
ParquetFileReader opens a SeekableInputStream to read a footer. In the process, it opens a new FSDataInputStream and wraps it. However, H2SeekableInputStream does not override the close method. Therefore, when ParquetFileReader closes it, the underlying FSDataInputStream is not closed. As a result, these stale connections can exhaust a clusters' data nodes' connection resources and lead to mysterious HDFS read failures in HDFS clients, e.g.
org.apache.hadoop.hdfs.BlockMissingException: Could not obtain block: BP-905337612-172.16.70.103-1444328960665:blk_1720536852_646811517
Attachments
Issue Links
- blocks
-
PARQUET-1027 release Parquet-mr 1.9.1
- Open
- links to