Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-11762

[C++][Dataset] Refactor Partitioning to explicitly treat null and absent fields identically

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 3.0.0
    • Fix Version/s: 5.0.0
    • Component/s: C++
    • Labels:
      None

      Description

      ARROW-10438 adds support for partition expressions with explicit absence of a partition key by including an is_null(field_ref("absent key field name")) in the conjunction. Whenever possible, this should be preferred to an equivalent conjunction which simply omits an equality expression for the missing field.

      Additionally since an absent partition key and a null partition key is semantically equivalent to a null valued partition key, we should ensure there is no difference in behavior. Currently, equal(field_ref("a"), literal(0)) and and_(equal(field_ref("a"), literal(0)), is_null("b")) are formatted differently

        Attachments

          Activity

            People

            • Assignee:
              westonpace Weston Pace
              Reporter:
              bkietz Ben Kietzman
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: