Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-8829 Improve expression performance
  3. SPARK-8830

levenshtein directly on top of UTF8String

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5.0
    • SQL
    • None

    Description

      We currently rely on commons-lang's levenshtein implementation. Ideally, we should have our own implementation to:

      1. Reduce external dependency
      2. Work directly against UTF8String so we don't need to convert to/from java.lang.String back and forth.

      Attachments

        Activity

          People

            TarekAuel Tarek Auel
            rxin Reynold Xin
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: