Looks like we are doing file listing several times when creating one instance of VectorizedOrcAcidRowBatchReader
AcidUtils.parseBaseOrDeltaBucketFilename() does full file listing (when there are files with bucket_* prefix) just to get a single file out of a path to figure out if it has ACID schema (as part of
There is full file listing where we populate
3) Twice in computeOffsetAndBucket()
Attaching profiles which Gopal Vijayaraghavan took while debugging.