Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2386

Change regex used for splitting words

Details

    Description

      Regex used in splitting words ([A-Za-z\']+) only works on latin input, change it to make it work on non-latin inputs.

      For example, see Java version:
      https://github.com/apache/beam/blob/367fcb28d544934797d25cb34d54136b2d7d6e99/examples/java/src/main/java/org/apache/beam/examples/common/ExampleUtils.java#L75

      Attachments

        Activity

          People

            Unassigned Unassigned
            altay Ahmet Altay
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: