Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1337

WebGraph to follow redirects

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • 1.21
    • scoring, webgraph
    • None

    Description

      With the current WebGraph URL shortening services `steal` inlinks from the actual target pages. The WebGraph OutlinkDB Mapper should use the target URL instead if there is any.

      Attachments

        Activity

          People

            markus17 Markus Jelsma
            markus17 Markus Jelsma
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: