Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-10208

[C++] String split kernels do not propagate nulls correctly on sliced input

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 3.0.0
    • C++

    Description

      I am not sure if this is a specific test issue or valid behavior, but when writing a test in https://github.com/apache/arrow/pull/8271 

      The following test fails:

      this->CheckUnary("split_pattern", R"(["foo bar", "foo", null])", list(this->type()),  //                  R"([["foo", "bar"], ["foo"], null])", &options);
      

      with the following output

      Failed:
      Got: 
        [
          [
            [
              "foo",
              "bar"
            ]
          ],
          [
            [
              "foo"
            ],
            null
          ]
        ]
      Expected: 
        [
          [
            [
              "foo",
              "bar"
            ]
          ],
          [
            [
              "foo"
            ],
            null
          ]
        ]
      

      while the outputs are the same, the arrays are seen as unequal.

      Attachments

        Issue Links

          Activity

            People

              apitrou Antoine Pitrou
              maartenbreddels Maarten Breddels
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m