Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2594

[Java] Vector reallocation does not properly clear reused buffers

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.10.0
    • Component/s: Java

      Description

      When reallocating a vector buffer, it assumes that the first half of the new buffer was clean or populated from the previous and only zeros out the second half.  This is not the case if the vector has released the buffer and the current capacity is 0 (empty).  If the new buffer has values set, then they will cause bogus values when used in the vector.

      I came across this when looking into SPARK-23030, due to the comment here https://github.com/apache/spark/pull/21312#issuecomment-389035697

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                bryanc Bryan Cutler
                Reporter:
                bryanc Bryan Cutler
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 10m
                  2h 10m