Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-46837 String function support (parent)
  3. SPARK-49207

Fix SplitPart one-to-many case mapping (UTF8_LCASE)

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 4.0.0
    • 4.0.0
    • SQL

    Description

      Fix the following string expressions to handle one-to-many case mapping properly:

      • SplitPart
      • StringSplitSQL

       

      Examples of incorrect results (under UTF8_LCASE collation):

      SplitPart("Ai\u0307B", "İ", 2) // returns: "\u0307B" (incorrect), instead of: "B" (correct)
      SplitPart("AİB", "i\u0307", 1) // returns: "AİB", instead of: "A", "B" (correct)

       

      StringSplitSQL("Ai\u0307B", "İ") // returns: ["A", "\u0307B"] (incorrect), instead of: ["A", "B"] (correct)
      StringSplitSQL("AİB", "i\u0307") // returns: ["AİB"] (incorrect), instead of: ["A", "B"] (correct)

      Attachments

        Issue Links

          Activity

            People

              uros-db Uroš Bojanić
              uros-db Uroš Bojanić
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: