Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-14686

[Python] Current endian decision for numpy does not work for built-in data type.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 7.0.0
    • 7.0.0
    • Python

    Description

      Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have native | byte order.

      This issue causes test failure as follows:

      FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
      FAILED pyarrow/tests/test_array.py::test_array_from_strided - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
      FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) 

       

      Attachments

        Activity

          People

            kiszk Kazuaki Ishizaki
            kiszk Kazuaki Ishizaki
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 3h
                3h