Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Incomplete
-
None
-
None
-
None
-
None
Description
These patches should be first applied in a sandbox. They integrate some changes proposed earlier in jira.
List of the changes:
- TaskQueue repleaced by java.util.Queue
- Handling process reviewed.
- Extractors inherit from Handler => no need to parse the document twice
- Entity renamed in Identifier
- ContentEntity in Resource
- Crawler moved to droids-crawler
- Parser moved to droids-parser
- Walker moved to droids-walker
- The walker also use an Extractor
- ...
- and much more that should be reviewed before integration .