[SPARK-23060] RDD's apply function - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Won't Fix
Affects Version/s: 2.2.1
Fix Version/s: None
Component/s: PySpark
Labels:
- features
- newbie

Description

New function for RDDs -> apply

>>> def foo(rdd):
... return rdd.map(lambda x: x.split('|')).filter(lambda x: x[0] == 'ERROR')
>>> rdd = sc.parallelize(['ERROR|10', 'ERROR|12', 'WARNING|10', 'INFO|2'])
>>> result = rdd.apply(foo)
>>> result.collect()
[('ERROR', '10'), ('ERROR', '12')]

Attachments

Issue Links

links to

[Github] Pull Request #20258 (gianmarcodonetti)

Activity

People

Assignee:: Unassigned

Reporter:: Gianmarco Donetti

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 12/Jan/18 16:25

Updated:: 12/Dec/22 18:10

Resolved:: 16/Jan/18 13:51

Time Tracking

Estimated:

Remaining:

Logged:

Not Specified