[ARROW-5705] [Java] Optimize BaseValueVector#computeCombinedBufferSize logic - ASF JIRA

XML

Word

Printable

JSON

Now in BaseValueVector#computeCombinedBufferSize, it computes validity buffer size as follow:

roundUp8(getValidityBufferSizeFromCount(valueCount))

which can be be expanded to

(((valueCount + 7) >> 3 + 7) / 8) * 8

Seems there's no need to compute bufferSize first and expression above could be replaced with:

(valueCount + 63) / 64 * 8

In this way, performance of computeCombinedBufferSize would be improved. Performance test:

Before:
BaseValueVectorBenchmarks.testC_omputeCombinedBufferSize_ avgt 5 4083.180 ± 180.363 ns/op

After:

BaseValueVectorBenchmarks.testC_omputeCombinedBufferSize_ avgt 5 3808.635 ± 162.347 ns/op

links to

GitHub Pull Request #4671

Estimated:

Not Specified

Remaining:

Logged:

4h 20m