
| Key: |
NUTCH-324
|
| Type: |
Improvement
|
| Status: |
Closed
|
| Resolution: |
Fixed
|
| Priority: |
Critical
|
| Assignee: |
Unassigned
|
| Reporter: |
Stefan Groschupf
|
| Votes: |
0
|
| Watchers: |
0
|
|
If you were logged in you would be able to see more operations.
|
|
|
|
File Attachments:
|
|
|
Issue Links:
|
Duplicate
|
|
|
|
This issue is duplicated by:
|
|
|
|
|
|
|
|
| Resolution Date: |
24/Jul/06 03:26 PM
|
|
Configuration properties db.score.link.external and db.score.link.internal are ignored.
In case of e.g. message board webpages or pages that have larger navigation menus on each page having a lower impact of internal links makes a lot of sense for scoring.
Also for web spam this is a serious problem, since now spammers can setup just one domain with dynamically generated pages and this highly manipulate the nutch scores.
So I also suggest that we give db.score.link.internal by default a value of something like 0.25.
|
|
Description
|
Configuration properties db.score.link.external and db.score.link.internal are ignored.
In case of e.g. message board webpages or pages that have larger navigation menus on each page having a lower impact of internal links makes a lot of sense for scoring.
Also for web spam this is a serious problem, since now spammers can setup just one domain with dynamically generated pages and this highly manipulate the nutch scores.
So I also suggest that we give db.score.link.internal by default a value of something like 0.25. |
Show » |
|