Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
PDFDebugger isn't needed in tika but it is a dependency in pdfbox-tools (because that one contains the command line interface, which calls the PDFBox command line tools).
Thus I suggest that the tika parser pom be changed like this:
<dependency> <groupId>org.apache.pdfbox</groupId> <artifactId>pdfbox-tools</artifactId> <version>${pdfbox.version}</version> <exclusions> <exclusion> <groupId>commons-logging</groupId> <artifactId>commons-logging</artifactId> </exclusion> + <exclusion> + <groupId>org.apache.pdfbox</groupId> + <artifactId>pdfbox-debugger</artifactId> + </exclusion> </exclusions>
This saves you 200KB in tika-app. That's not much, but every weight loss counts
It should also be possible to get it removed from tika-bundle, but I don't know how to remove it properly. Just removing it from "Embed-Dependency" isn't enough.
Attachments
Issue Links
- is related to
-
TIKA-3818 Remove pdfdebugger from tika (2)
- Resolved