Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-6281

[Python] Produce chunked arrays for nested types in pyarrow.array

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • Python
    • None

    Description

      As follow up to ARROW-5028 and other issues, in a case like

      vals = [['x' * 1024]] * ((2 << 20) + 1)
      arr = pa.array(vals)
      

      The child array of the ListArray cannot hold all of the string data. After the patch for ARROW-5028, an exception is raised rather than returning a malformed array. We could (with some effort) instead produce a chunked array of list<string> type

      Attachments

        Issue Links

          Activity

            People

              kszucs Krisztian Szucs
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: