[SPARK-6025] Helper method for GradientBoostedTrees to compute validation error - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.3.0
Fix Version/s: 1.4.0
Component/s: MLlib
Labels:
None

Target Version/s:

1.4.0

Description

Create a helper method for computing the error at each iteration of boosting. This should be used post-hoc to compute the error efficiently on a new dataset.

E.g.:

def evaluateEachIteration(data: RDD[LabeledPoint], evaluator): Array[Double]

Notes:

It should run in the same big-O time as predict() by keeping a running total (residual).
A different method name could be good.
It could take an evaluator and/or could evaluate using the training metric by default.

Attachments

Issue Links

is related to

SPARK-5972 Cache residuals for GradientBoostedTrees during training

Resolved

links to

[Github] Pull Request #4819 (MechCoder)

[Github] Pull Request #4906 (MechCoder)

Activity

People

Assignee:: Manoj Kumar

Reporter:: Joseph K. Bradley

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/Feb/15 02:53

Updated:: 21/Mar/15 00:17

Resolved:: 21/Mar/15 00:14