Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
1.5.1
-
None
-
None
Description
Hi,
I have been able to parse metatags in an html page using http://wiki.apache.org/nutch/IndexMetatags. It does not work quite well when there are two metatags with same name but two different contents.
Does anyone encounter this kind of issue ?
Are there any changes that need to be made to the config files to make it work ?
When there are two tags with same name and different content, it takes the value of the later tag and saves it rather than creating a multiValue field.
Edit: I have attached the patch for the file and it is provided by DLA (Digital Library and Archives) http://scholar.lib.vt.edu/ of Virginia Tech.
Many Thanks,
Attachments
Attachments
Issue Links
- is depended upon by
-
NUTCH-1583 Headings does not support multiValued headings
- Closed