Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2459

pyarrow: Segfault with pyarrow.deserialize_pandas

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.10.0
    • Python
    • None
    • OS X, Linux

    Description

      Following up from https://github.com/apache/arrow/issues/1884 wherein I found that calling deserialize_pandas in the linked app.py script in the repo linked below causes the app.py process to segfault.

      I initially observed this on OS X, but have since confirmed that the behavior exists on Linux as well.

      Repo containing example: https://github.com/travisbrady/sanic-arrow 

      And more generally: what is the right way to get a Java-based HTTP microservice to talk to a Python-based HTTP microservice using Arrow as the serialization format? I'm exchanging DataFrame type objects (they are pandas.DataFrame's on the Python side) between the two services for real-time scoring in a few xgboost models implemented in Python.

      Attachments

        Activity

          People

            Licht-T Licht Takeuchi
            travisbrady Travis Brady
            Votes:
            1 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: