Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.18
-
None
-
None
Description
when extracting text from some relatively large excel files (9000 rows or so), I found an extra string of "&A PAGE &P" is added to the end of the resulting text, when Tika.parseToString is called. Is it a known issue? Is there any configuration that I can do that will opt out from outputting these extra characters?
did not find a good answer over google.
the input excel spreadsheet is attached.