Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-11215

Add multiple columns support to StringIndexer

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.4.0
    • Fix Version/s: 3.0.0
    • Component/s: ML
    • Labels:
      None
    • Docs Text:
      Hide
      When specifying frequencyDesc or frequencyAsc as stringOrderType param in StringIndexer, in case of equal frequency, the order of strings was previously undefined. Since Spark 3.0, strings with equal frequency are further
      sorted lexicographically.
      Show
      When specifying frequencyDesc or frequencyAsc as stringOrderType param in StringIndexer, in case of equal frequency, the order of strings was previously undefined. Since Spark 3.0, strings with equal frequency are further sorted lexicographically.

      Description

      Add multiple columns support to StringIndexer, then users can transform multiple input columns to multiple output columns simultaneously. See discussion SPARK-8418.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                viirya L. C. Hsieh
                Reporter:
                yanboliang Yanbo Liang
              • Votes:
                6 Vote for this issue
                Watchers:
                14 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: