Details
Description
Range comparisons don't work right for columns which start with Double.NaN as the first row (min == max == NaN).
1 < NaN is false.
1 > NaN is false.
File Version: 0.12 with ORC_135 Rows: 3 Compression: ZLIB Compression size: 32768 Type: struct<operation:int,originalTransaction:bigint,bucket:int,rowId:bigint,currentTransaction:bigint,row:struct<c:double>> Stripe Statistics: Stripe 1: Column 0: count: 3 hasNull: false Column 1: count: 3 hasNull: false bytesOnDisk: 5 min: 0 max: 0 sum: 0 Column 2: count: 3 hasNull: false bytesOnDisk: 5 min: 1 max: 1 sum: 3 Column 3: count: 3 hasNull: false bytesOnDisk: 8 min: 536870912 max: 536870912 sum: 1610612736 Column 4: count: 3 hasNull: false bytesOnDisk: 7 min: 0 max: 2 sum: 3 Column 5: count: 3 hasNull: false bytesOnDisk: 5 min: 1 max: 1 sum: 3 Column 6: count: 3 hasNull: false Column 7: count: 3 hasNull: false bytesOnDisk: 19 min: NaN max: NaN sum: NaN
Attachments
Issue Links
- is a parent of
-
ORC-683 PPD: Make Floating point NaN check more strict
- Closed
- is related to
-
PARQUET-1246 Ignore float/double statistics in case of NaN
- Resolved
-
ORC-636 [C++] PPD Floating point stats with NaN should be ignored
- Closed
- links to