Details
-
Bug
-
Status: Open
-
Critical
-
Resolution: Unresolved
-
1.23
-
None
-
None
Description
Attached is a RAR file containing a PPT file ("test.ppt") with one line in it - "Here the PPT content starts".
However, the extracted text from tika is not separating the file name and its content as follows:
"test.pptHere the PPT content starts"