Description
[~wastl-nagel] pointed out on the user list that there can be thread contention when creating SAXParsers. Jukka Zitting recommended pooling SAXParsers.
I noticed a modest improvement in speed when parsing docx files with our SAX based parser, and I suspect that [~wastl-nagel] will find an even greater improvement during detection because we're currently creating a new SAXParser on every call to detect an XML root.
Attachments
Attachments
Issue Links
- is related to
-
NUTCH-2578 Avoid lock by MimeUtil in constructor of protocol.Content
-
- Closed
-