[SPARK-31169] Random Forest in SparkML 2.3.3 vs 2.4.x - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Question
Status: Resolved
Priority: Major
Resolution: Invalid
Affects Version/s: 2.3.3, 2.4.0, 2.4.3
Fix Version/s: None
Component/s: ML
Labels:

Description

Hi all,

When I trained the model with the Random Forest algorithm, I got different results in different versions of spark, the same input, label ratio, hyperparameter for all training. Detailed training results in the attached file. Model training results with spark 2.3.3 are much better, so I want to ask if there have been any changes to the random forest (or other algorithms) in mllib?

Many thanks.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

spark233.jpg
17/Mar/20 01:25
47 kB
Nguyen Nhanduc
spark240.jpg
17/Mar/20 01:25
48 kB
Nguyen Nhanduc
spark243.jpg
17/Mar/20 01:28
48 kB
Nguyen Nhanduc

Activity

People

Assignee:: Unassigned

Reporter:: Nguyen Nhanduc

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 17/Mar/20 01:24

Updated:: 12/Dec/22 18:10

Resolved:: 23/Mar/20 05:08