Description
jira NUTCH-1733 add support to HTML5 charset definitions.
In some case web site declare multiple meta element with different charset :
<meta charset="utf-8">
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
(ex : http://www.edga.fr/)
In this case the second charset is detected (iso-8859-1).
What about prioritize HTML5 charset definitions first ?