[NUTCH-1090] LinkDb (invertlinks) should inform the user when it ignores internal links - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Trivial
Resolution: Fixed
Affects Version/s: 1.3
Fix Version/s: 1.5
Component/s: linkdb
Labels:

Patch Info:

Patch Available

Description

I used nutch to crawl sites on a single domain. After the crawl was complete I tried to build a LinkDb. The LinkDb was empty.
It comes up that this happens because the invertlinks command ignores internal links to the same domain by default.

Unfortunately the LinkDb class doesn't tell anything about that. So it was hard to find out why the LinkDb was empty.

I suggest to add an information for the user when the invertlinks command is ignoring internal links.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LinkDb.patch
24/Aug/11 15:14
2 kB
Marek Bachmann

Activity

People

Assignee:: Markus Jelsma

Reporter:: Marek Bachmann

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 24/Aug/11 14:08

Updated:: 22/May/13 03:54

Resolved:: 15/Nov/11 11:56