Affects Version/s: 1.5.1
Fix Version/s: 1.9
I have been able to parse metatags in an html page using http://wiki.apache.org/nutch/IndexMetatags. It does not work quite well when there are two metatags with same name but two different contents.
Does anyone encounter this kind of issue ?
Are there any changes that need to be made to the config files to make it work ?
When there are two tags with same name and different content, it takes the value of the later tag and saves it rather than creating a multiValue field.
Edit: I have attached the patch for the file and it is provided by DLA (Digital Library and Archives) http://scholar.lib.vt.edu/ of Virginia Tech.