Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2150

RTF TextExtractor omits some content

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.13
    • 1.22
    • parser
    • None

    Description

      The TextExtractor class seems to handle the first two content words (TO FROM) in the provided file as if they would belong to the header. They are missing in the text output .

      Attachments

        1. bi16tabe.000
          0.2 kB
          T. Schmidt

        Issue Links

          Activity

            People

              tallison Tim Allison
              tins T. Schmidt
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: