Details
Description
While working on upgrading the indexer-elastic plugin from 2.x to 5.x, I ran into several nasty runtime dependency issues (both local and on Hadoop). After seeking help on the mailing list, I still was unable to resolve these issues and after digging further, decided to try a different plugin classloader strategy.
The normal classloader delegates class loading requests to it's parent classloader. This can cause all sorts of nasty runtime dependency version conflicts (jar hell, version conflicts), since the plugin's own classloader gets queried last. The child-first classloader approach tries to load a class from the plugin's dependencies first and when unavailable, delegates to it's parent classloader. This fixed the issues I had.
The new approach can give runtime LinkageErrors, but these are easily resolvable (see the patch for a few examples)
I've tested the new loader a bit and am curious about others' findings.
Attachments
Attachments
Issue Links
- contains
-
NUTCH-2071 A parser failure on a single document may fail crawling job if parser.timeout=-1
- Closed
-
NUTCH-2316 Library conflict with Parser-Tika Plugin and Lib Folder
- Closed
- is required by
-
NUTCH-2380 indexer-elastic version upgrade to 5.3.0
- Closed