Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-1620

Accept Sitemaps with content type application/xml

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • ManifoldCF 2.14
    • Web connector
    • None

    Description

      Given an Output Connection, that does not accepts the MIME type application/xml for ingestion, it is currently not possible to crawl a sitemap.xml, when the webserver returns application/xml as content type for the sitemap.

      The sitemap is discarded before the links are extracted, because the mime type application/xml is not listed in the interestingMimeTypeArray.

      Attachments

        Issue Links

          Activity

            People

              schuch Markus Schuch
              schuch Markus Schuch
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: