Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-10347

[Python][Dataset] Test behaviour in case of duplicate partition field / data column

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Python
    • None

    Description

      See https://www.mail-archive.com/user@arrow.apache.org/msg00680.html, and my answer to it (experimentation in https://nbviewer.jupyter.org/gist/jorisvandenbossche/9382de2eb96db5db2ef801f63a359082).
      It seems we support that the partition field is also present in the actual data, but it's probably good to add some explicit tests to ensure the expected behaviour.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jorisvandenbossche Joris Van den Bossche
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: