Description
DOMSerializerImpl from xerces is deprecated in our current artifact. It is replaced by the (still ancient but slightly newer org.apache.xml.serializer.dom3.LSSerializerImpl in [0]).
Upon closer inspection it seems that find . | xargs grep "org.apache.xml" * only pulled up DOMBuilder, XMLChatacterRecognizer and DOMContentUtilsTest as the places where such classes are used.
I am confused as to why they are included as primary dependencies within Nutch. Either these XML specific dependencies should be restricted dependencies to parse-html or else they should be removed and replaced by the new artifact [0].
[0] http://search.maven.org/#artifactdetails|xalan|serializer|2.7.1|jar