Details
-
Documentation
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
1.5.2, 1.6.3, 2.0.2, 2.1.0
Description
The OneHotEncoder._init_ doc string in PySpark has an input keyword listed as includeFirst, whereas the code actually uses dropLast.
This especially confusing because the _init_ function accepts only keywords, and following the documentation on the web (https://spark.apache.org/docs/2.0.1/api/python/pyspark.ml.html#pyspark.ml.feature.OneHotEncoder) or of help in Python will result in the error:
TypeError: _init_() got an unexpected keyword argument 'includeFirst'
The error is immediately viewable in the source code:
@keyword_only def __init__(self, dropLast=True, inputCol=None, outputCol=None): """ __init__(self, includeFirst=True, inputCol=None, outputCol=None) """