Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22001

ImputerModel can do withColumn for all input columns at one pass

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.2.0
    • Fix Version/s: 2.3.0
    • Component/s: SQL
    • Labels:
      None

      Description

      SPARK-21690 makes one-pass Imputer by parallelizing the computation of all input columns. When we transform dataset with ImputerModel, we do withColumn on all input columns sequentially. We can also do this on all input columns at once.

        Attachments

          Activity

            People

            • Assignee:
              viirya Liang-Chi Hsieh
              Reporter:
              viirya Liang-Chi Hsieh
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: