Details
-
New Feature
-
Status: Resolved
-
Critical
-
Resolution: Duplicate
-
None
-
None
-
None
Description
Currently Flink provides the BucketingSink as an exactly once method for writing out to a file system. It provides these guarantees by moving files through several stages and deleting or truncating files that get into a bad state. While this is a powerful abstraction, it causes issues with eventually consistent file systems such as Amazon's S3 where most operations (ie rename, delete, truncate) are not guaranteed to become consistent within a reasonable amount of time. Flink should provide a sink that provides exactly once writes to a file system where only PUT operations are considered consistent.
Attachments
Attachments
Issue Links
- duplicates
-
FLINK-9752 Add an S3 RecoverableWriter
- Closed
- is duplicated by
-
FLINK-9238 There are some questions about BuckeingSink when sinking data to S3.
- Closed
- links to