Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
8.0.0
Description
Right now arrow will silently write a file with a .gz extension to CompressedOutputStream rather than passing the compression option to the parquet writer itself. The internal detect_compression() function detects the extension and that is what passes it off incorrectly. However it only fails at the read_parquet stage which could lead to confusion.
library(arrow, warn.conflicts = FALSE) tf <- tempfile(fileext = ".parquet.gz") write_parquet(data.frame(x = 1:5), tf, compression = "gzip", compression_level = 5) read_parquet(tf) #> Error: file must be a "RandomAccessFile"
Attachments
Issue Links
- fixes
-
ARROW-16619 [R] Support compression + R connection (URL with .gz file)
-
- Resolved
-
- links to