Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23661

Implement treeAggregate on Dataset API

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • 3.0.0
    • None
    • SQL
    • None

    Description

      Many algorithms in MLlib are still not migrated their internal computing workload from RDD to DataFrame. treeAggregate is one of obstacles we need to address in order to see complete migration.

      This ticket is opened to provide treeAggregate on Dataset API. For now this should be a private API used by ML component.

      Attachments

        Activity

          People

            Unassigned Unassigned
            viirya L. C. Hsieh
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: