[GOBBLIN-2006] Retention Job should be more robust to OOM failures - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: None
Component/s: misc
Labels:
None

Description

Currently, while cleaning the log files, the Retention job goes into OOM and fails when the no of log files is too many.
Retention job while fetching all the dataset versions, loads the file status all at once into the memory, resulting in this issue.
Thus, the Retention job should avoid loading all data into memory, and use an iterator-based approach. This will load only limited file status into memory and making the retention job pipeline more robust to OOM errors

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Arpit Varshney

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 26/Feb/24 03:57

Updated:: 26/Mar/24 10:16

Resolved:: 26/Mar/24 10:16

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m