Uploaded image for project: 'Falcon'
  1. Falcon
  2. FALCON-1644

Retention : Some feed instances are never deleted by retention jobs.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.8
    • 0.9
    • retention
    • None

    Description

      ​Here is a sample feed xml.

      <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
      <feed name="rawEmailFeed" description="Raw customer email feed" xmlns="uri:falcon:feed:0.1">
          <tags>externalSystem=USWestEmailServers</tags>
          <groups>churnAnalysisDataPipeline</groups>
          <frequency>hours(1)</frequency>
          <timezone>UTC</timezone>
          <late-arrival cut-off="hours(1)"/>
          <clusters>
              <cluster name="primaryCluster" type="source">
                  <validity start="2015-10-30T01:00Z" end="2015-10-30T10:00Z"/>
                  <retention limit="hours(10)" action="delete"/>
              </cluster>
          </clusters>
          <locations>
              <location type="data" path="/user/ambari-qa/falcon/demo/primary/input/enron/${YEAR}-${MONTH}-${DAY}-${HOUR}"/>
              <location type="stats" path="/"/>
              <location type="meta" path="/"/>
          </locations>
          <ACL owner="ambari-qa" group="users" permission="0x755"/>
          <schema location="/none" provider="/none"/>
      </feed>
      

      In the above example, the validity time is "the time interval when the feed is valid on this cluster". After the validity time ends, falcon is not expected to perform any operations on the feed. The retention job for this feed will be run from validity start time up to validity end time, and will delete any feed instances older than 10 hours. Some instances of Feed will never be deleted. In the above example, feed instances at between 2015-10-30T00:00Z and 2015-10-30T10:00Z will never be deleted.

      Ideally, the retention coordinator job should run from "validity start time" up to "validity end time + retention age limit" to ensure all instances are handled.

      Attachments

        1. FALCON-1644-v4.patch
          14 kB
          Ajay Yadav
        2. FALCON-1644-v3.patch
          13 kB
          Balu Vellanki
        3. FALCON-1644-v2.patch
          9 kB
          Balu Vellanki
        4. FALCON-1644-v1.patch
          8 kB
          Balu Vellanki
        5. FALCON-1644.patch
          4 kB
          Balu Vellanki

        Issue Links

          Activity

            People

              bvellanki Balu Vellanki
              bvellanki Balu Vellanki
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: