Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Sometimes, for example, you use tika to parse an XLS file that isn't really that big, maybe 60 MB. and suddenly the JVM heap size taken is >800Mb which causes an OOM in my case.
Can we make an "abort threshold" where the tika parse will halt if parse output bytes exceeds this value?
Or it is possible for users to already do this themselves by watching the input stream as it grows somehow?