Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3693

[R] Invalid buffer for empty characters with null data

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.12.0
    • R

    Description

      I'm hitting this from data coming from Spark while retrieving this data frame:

       

       

      <database>   <name>    <temp>
      default      a         TRUE
                   b         TRUE
      

       

      Error:

      error: Failed to fetch data: invalid data in buffer 2
      

      The problem is that is possible to have a NULL character array with offsets set to 0, notice that in the example above Spark returns two batches, so effectively, we are trying to parse:

       

      <database>   <name>    <temp>
                   b         TRUE
      

       

      Where the data array is NULL and the offsets 0s for the <database> columns.

      Here is a snapshot while debugging StringArray_to_Vector:

      Attachments

        Activity

          People

            romainfrancois Romain Francois
            javierluraschi Javier Luraschi
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 4.5h
                4.5h

                Slack

                  Issue deployment