Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
2.3
-
None
Description
Since the NTriplesExtractorFactory includes a content type of "text/plain", this causes every plain text file to be processed by the NTriplesExtractor, which in turn causes huge numbers of completely unnecessary fatal issues being sent to the extraction report.
In my crawls, this mostly occurs for all the "humans.txt" files encountered.
While this isn't a hugely serious bug, it is quite irritating as it does really clutter up my logs.
Note: the NQuadsExtractorFactory (which can parse all the same documents as NTriples) does not include a content type of "text/plain".
Attachments
Issue Links
- links to