Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-15808

[Python] take function doesn't work when table has large row counts

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • Python
    • None
    • pyarrow 7.0.0, pyarrow 5.0

    Description

      Steps to reproduce the error:

       

      import pyarrow as pa
      a = pa.array(["123456789"]*260000000)
      tb = pa.table([a], names=["a"])
      tb.take([1])
      

       

      ERROR message:
      ArrowInvalid: offset overflow while concatenating arrays

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rewreu Zhe Wu
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: