Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-1410

regex functions no longer accept shorthand character classes

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: Impala 2.0
    • Fix Version/s: Impala 2.0.1
    • Component/s: None
    • Labels:
      None

      Description

      From user list:

      We recently upgraded to CDH 5.2 which includes Impala 2.0 and it appears that regexp_extract and regexp_replace functions no longer work when using shorthand character classes. Posix bracket character classes still work.

      For example:

      [slave-4.hadoop.lan:21000] > select regexp_extract('abcdef123ghi456jkl','.*(\\d+)',0);
      Query: select regexp_extract('abcdef123ghi456jkl','.*(\\d+)',0)
      WARNINGS: Could not compile regexp pattern: .*(\d+)
      Error: invalid escape sequence: \d
      
      Could not compile regexp pattern: .*(\d+)
      Error: invalid escape sequence: \d
      

        Attachments

          Activity

            People

            • Assignee:
              dhecht Dan Hecht
              Reporter:
              skye Skye Wanderman-Milne
            • Votes:
              1 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: