Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
9.0.0
-
None
-
None
-
Ubuntu 22.04
Description
When I copy large feather files, open a dataset from that file and immediately write that dataset to a new location I get the following error:
```Error: Invalid: Expected to read 144 metadata bytes but got 0```
I have made a reproducible example below:
``` r
df <- data.frame(replicate(1,sample(0:1,100e6,rep=TRUE)))
savePath <- file.path(tempdir(), 'arrowTest')
if (!dir.exists(savePath))
arrow::write_feather(df, file.path(savePath, 'part-0.feather'))
copyPath <- file.path(tempdir(),'arrowTest')
if (!dir.exists(copyPath))
writePath <- file.path(tempdir(), 'arrowTest')
if (!dir.exists(writePath))
arrow::copy_files(savePath, copyPath)
dataset <- arrow::open_dataset(copyPath, format='feather')
arrow::write_dataset(dataset = dataset, path = writePath, format = 'feather')
```
Attachments
Issue Links
- is duplicated by
-
ARROW-17855 [R] Simultaneous read-write operations causing file corruption.
- Open