Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-15054

Attribute nbytes of slice will return the value corresponding to the whole structure

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • None
    • None
    • Python
    • None

    Description

      Whether for tables/arrays, slicing and then calling `nbytes` will return the value for the whole struct. I don't know if this is a bug or the desired behaviour (and if it is I would argue that it should return a result relative to the slice, and also ask if there is any way to get the memory size of my slice?).

       

      ```

      import pyarrow

      pyarrow._version_

      >>> 6.0.0

      table = pyarrow.Table.from_arrays([pyarrow.array(['foo', 'bar', 'qux'], type=pyarrow.string())], names=["a"])

      table.num_rows

      >>> 3

      table.nbytes

      >>> 25

      table.slice(0, 2).num_rows

      >>> 2

      table.slice(0, 2).nbytes

      >>> 25

      ```

      Attachments

        Activity

          People

            Unassigned Unassigned
            Adrien_ Adrien Hoarau
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: