Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-1996

[Python] pyarrow.read_serialized cannot read concatenated records

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8.0
    • Fix Version/s: 0.11.0
    • Component/s: Python
    • Environment:
      Linux

      Description

      The following code

      import pyarrow as pa

      f = pa.OSFile('arrow_test', 'w')
      pa.serialize_to(12, f)
      pa.serialize_to(23, f)
      f.close()

      f = pa.OSFile('arrow_test', 'r')
      print(pa.read_serialized(f).deserialize())
      print(pa.read_serialized(f).deserialize())
      f.close()

      gives the following result:

      $ python pyarrow_test.py
      First: 12
      Traceback (most recent call last):
      File "pyarrow_test.py", line 10, in <module>
      print('Second: {}'.format(pa.read_serialized(f).deserialize()))
      File "pyarrow/serialization.pxi", line 347, in pyarrow.lib.read_serialized (/arrow/python/build/temp.linux-x86_64-2.7/lib.cxx:79159)
      File "pyarrow/error.pxi", line 77, in pyarrow.lib.check_status (/arrow/python/build/temp.linux-x86_64-2.7/lib.cxx:8270)
      pyarrow.lib.ArrowInvalid: Expected schema message in stream, was null or length 0

      I would have expected read_serialized to sucessfully read the second value.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                wesmckinn Wes McKinney
                Reporter:
                rshin Richard Shin
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m