[ARROW-13224] [Python][Doc] Documentation missing for pyarrow.dataset.write_dataset - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.0.0
Component/s: Documentation, Python
Labels:
- pull-request-available

External issue URL:
https://github.com/apache/arrow/issues/28911

Description

I don't believe this is meant to be internal. pyarrow.parquet.write_to_dataset uses this (if use_legacy_dataset=False) but the parquet API doesn't expose the same features. A new example should also probably be added to the Tabular Datasets section of the docs explaining why write_dataset can take in a scanner (e.g. memory preserving, ability to write a dataset from flight or any record batch source, etc.)

Attachments

Issue Links

is duplicated by

ARROW-13207 [Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach

Closed

links to

GitHub Pull Request #10693

Activity

People

Assignee:: Weston Pace

Reporter:: Weston Pace

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 30/Jun/21 18:24

Updated:: 11/Jan/23 08:31

Resolved:: 20/Jul/21 13:38

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

4h 10m