Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Currently Oozie scheduling works on periodic datasets. It does not have any mechanism to handle aperiodic datasets, which doesn’t follow a fixed schedule/frequency.
Use cases
When incoming dataset arrives with no fixed schedule.
Need to trigger the job based all data available since last run with a possible cap on the max size to process in one run.
Try to avoid creating so many instances when you know input instances will be very few.
Attachments
Attachments
Issue Links
- is blocked by
-
OOZIE-2179 Use HDFS INotify to track HDFS data dependencies instead of polling
- Open