Description
wastl-nagel pointed out on the user list that there can be thread contention when creating SAXParsers. jukkaz recommended pooling SAXParsers.
I noticed a modest improvement in speed when parsing docx files with our SAX based parser, and I suspect that wastl-nagel will find an even greater improvement during detection because we're currently creating a new SAXParser on every call to detect an XML root.
Attachments
Issue Links
- is related to
-
NUTCH-2578 Avoid lock by MimeUtil in constructor of protocol.Content
- Closed