Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-984

Path - refactor regex pattern match into cpp

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • v1.9.1
    • Module: Utilities
    • None

    Description

      The UDF to be implemented will obtain the matching indices in sym_str, which is in turn used to filter

      {match_to_row_id}

      for matched rows.

      This implementations pipeline operations directly in

      {matched_partitions}

      and remove the need of creating and joining two extra tables/views

      {length_of_matches}

      and

      {length_of_between_matches}

      .

      Boost.Regex can be helpful for implementing the proposed UDF:
      http://www.boost.org/doc/libs/1_60_0/libs/regex/doc/html/boost_regex/ref/regex_match.html

      Attachments

        Activity

          People

            xctang Xiaocheng Tang
            fmcquillan Frank McQuillan
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: