Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-966

Behavior of NOINDEX,FOLLOW is not intuitive

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Auto Closed
    • 1.2
    • 2.5
    • indexer, parser
    • None

    Description

      If a page has NOINDEX,FOLLOW for the ROBOTS metatag, Nutch will still create a document that can be found in the index via metatag or URL matching. Instead, Nutch should rely on doc or parse metadata but nothing should be stored by the html parser. (thanks to Julien Nioche for helping me to understand the issue).

      Attachments

        Activity

          People

            Unassigned Unassigned
            jpavel Josh Pavel
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: