Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.7.0
-
None
-
None
Description
I've been running Any23 on a big web crawler dump. I found for certain documents with a lot of Microdata relations the method MicrodataParser.getItemProps() becomes very slow. As a result, processing one document can take several minutes. An example of a problematic page can be seen here: http://dreamtime.fftunes.com/
I'll attach a patch for the method that greatly improves the performance of this method. I was wondering if someone could have a look at it and include it in the next release if possible.
Attachments
Attachments
Issue Links
- duplicates
-
ANY23-77 Facing a infinite loop problem in version 0.6.1 - Verify
- Closed