Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-28

When RSS connector fails to fetch a feed due to network difficulties, its retry strategy is flawed

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • RSS connector
    • None

    Description

      For a continuous RSS crawl, when the RSS connector cannot fetch a feed due to a socket connection problem (e.g. error -103), it does not schedule retry of the feed fetch in a feed-specific way, but rather uses the default document refetch interval. So, if someone has configured their job to never refetch documents, the feed effectively is never tried again.

      The behavior should be for the fetch to be retried for some period of time before giving up. Since the connector does not know what type of document it is fetching in advance, all document fetches must be treated in this way in this situation.

      Attachments

        Activity

          People

            kwright@metacarta.com Karl Wright
            kwright@metacarta.com Karl Wright
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: