Description
RTF files (such as those produced by WordPad) often encode consecutive linebreaks as consecutive \par commands. However, org.apache.tika.parser.rtf.TextExtractor ignores the second \par. Solution is simple, see attached patch.
Attachments
Attachments
Issue Links
- is duplicated by
-
TIKA-1544 empty lines are not preserved
- Resolved
Fixed on duplicate
TIKA-1544in r1658947. Thank you for the patch, almson!