[ARROW-13034] [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page - ASF JIRA

XML

Word

Printable

JSON

The chapter "Writing to Partitioned Datasets" still presents a "solution" with "hdfs.connect" but since it's mentioned as deprecated no more a good idea to mention it.
The chapter "Reading a Parquet File from Azure Blob storage" is based on the package "azure.storage.blob" ... but an old one and the actual "azure-sdk-for-python" doesn't have any-more methods like get_blob_to_stream(). Possible to update this part with new blob storage possibilities, and also another mentioning the same concept with Delta Lake (similar principle but since there are differences ...)

links to

GitHub Pull Request #10548

Estimated:

Not Specified

Remaining:

Logged:

0.5h