Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2459

pyarrow: Segfault with pyarrow.deserialize_pandas

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.10.0
    • Component/s: Python
    • Labels:
      None
    • Environment:
      OS X, Linux

      Description

      Following up from https://github.com/apache/arrow/issues/1884 wherein I found that calling deserialize_pandas in the linked app.py script in the repo linked below causes the app.py process to segfault.

      I initially observed this on OS X, but have since confirmed that the behavior exists on Linux as well.

      Repo containing example: https://github.com/travisbrady/sanic-arrow 

      And more generally: what is the right way to get a Java-based HTTP microservice to talk to a Python-based HTTP microservice using Arrow as the serialization format? I'm exchanging DataFrame type objects (they are pandas.DataFrame's on the Python side) between the two services for real-time scoring in a few xgboost models implemented in Python.

        Attachments

          Activity

            People

            • Assignee:
              Licht-T Licht Takeuchi
              Reporter:
              travisbrady Travis Brady
            • Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: