Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2612

Support for sitemap processing by hostname

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.14
    • 1.16
    • sitemap
    • None

    Description

      Add support to sitemap processor for processing just hostnames. Similar to the mapper eating sitemap URL's, but then with BaseRobotRules finding the sitemap URL's itself.

      Will upload patch soon.

      Attachments

        1. NUTCH-2612.patch
          8 kB
          Markus Jelsma
        2. NUTCH-2612.patch
          5 kB
          Markus Jelsma
        3. NUTCH-2612.patch
          5 kB
          Markus Jelsma

        Activity

          People

            markus17 Markus Jelsma
            markus17 Markus Jelsma
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: