Description
RTF files (such as those produced by WordPad) often encode consecutive linebreaks as consecutive \par commands. However, org.apache.tika.parser.rtf.TextExtractor ignores the second \par. Solution is simple, see attached patch.
Attachments
Attachments
Issue Links
- is duplicated by
-
TIKA-1544 empty lines are not preserved
- Resolved