Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2264

[Python] Efficiently serialize numpy arrays with dtype of unicode fixed length string

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.8.0
    • 0.10.0
    • Python
    • None

    Description

      Looking at the numpy array serialization code it seems that if I have a dtype like "<U3" this will go through custom ndarray serializer and not through an efficient one.

      Example:>>> np.array(['aaa', 'bbb'])
      array(['aaa', 'bbb'], dtype='<U3')

      This should be able to work, no? It has fixed offsets and memory layout.

      Attachments

        Activity

          People

            robertnishihara Robert Nishihara
            mitar Mitar
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: