Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Done
-
1.13.1
-
None
-
FileSink now supports Azure Data Lake Storage Gen2 APIs (`abfs://` and `abfss://`).
Description
Currently the HadoopRecoverableWriter assumes that the underlying FS is Hadoop and so it checks for DistributedFileSystem. It also tries to do a truncate and ensure the lease is recovered before the 'rename' operation is done.
In the Azure Data lake gen 2 world, the driver does not support truncate and lease recovery API. We should be able to get the last committed size and if it matches go for the rename. Will be back with more details here.
Attachments
Issue Links
- causes
-
FLINK-35531 Avoid calling hsync in flush method in BaseHadoopFsRecoverableFsDataOutputStream
- Closed
- links to