[SPARK-24946] PySpark - Allow np.Arrays and pd.Series in df.approxQuantile - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Incomplete
Affects Version/s: 2.3.1
Fix Version/s: None
Component/s: PySpark
Labels:

Description

As Python user it is convenient to pass a numpy array or pandas series `approxQuantile(col, probabilities, relativeError)` for the probabilities parameter.

Especially for creating cumulative plots (say in 1% steps) it is handy to use `approxQuantile(col, np.arange(0, 1.0, 0.01), relativeError)`.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Paul Westenthanner

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 27/Jul/18 14:06

Updated:: 12/Dec/22 18:11

Resolved:: 08/Oct/19 05:44