Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.13.0
-
None
Description
Currently RecordUpdater.update and delete take rowid and original transaction as parameters. These values are already present in the row as part of the new ROW_ID virtual column in HIVE-7513, and thus can be read by the writer from there. And the writer will already have to handle skipping ROW_ID when writing, so it needs to be aware of that column anyone.
We could instead read the values from ROW__ID and then remove it from the object inspector in FileSinkOperator, but this will be hard in the vectorization case where rows are being dealt with 10k at a time.
For these reasons it makes more sense to do this work in the writer.