[SPARK-1109] wrong API docs for pyspark map function - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 0.9.0
Fix Version/s: 0.9.1, 1.0.0
Component/s: PySpark
Labels:
None

Description

The source code/API docs for the pyspark RDD map function says:

def map(self, f, preservesPartitioning=False):
"""
Return a new RDD containing the distinct elements in this RDD.
"""
def func(split, iterator): return imap(f, iterator)
return PipelinedRDD(self, func, preservesPartitioning)

I think that was incorrectly cut-and-pasted from the distinct() function, and should actually say "Return a new RDD by applying a function to each element of this RDD."

Attachments

Issue Links

links to

[Github] Pull Request #73 (ScrapCodes)

Activity

People

Assignee:: Prashant Sharma

Reporter:: Diana Carroll

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 19/Feb/14 09:48

Updated:: 07/Feb/20 17:26

Resolved:: 04/Mar/14 15:34