The sink process() keep tracks of the buckets opened during the transaction. At the end of transaction, we need to flush all the buckets that has pending data. This is required in order to ensure that the data removed from channel should be safely in HDFS during commit.
Currently the files are tracked only when they are created and also getting closed during the cleanup instead of flush.
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Fix Version/s||v1.2.0 [ 12320243 ]|
|Resolution||Fixed [ 1 ]|
|Field||Original Value||New Value|
|Summary||HDFS file roller creates causes the first file to roll too soon||HDFS rolls the first file incorrectly|