Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-290

org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.4
    • 0.5
    • parser
    • None
    • Windows XP / jdk1.6.0_15

    Description

      It's just for information (I am testing Tika).

      I am using tika-app-0.4.jar from the box.
      I get the run-time error below :
      org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16

      with the ANSI text file containing :
      azerty

      123456789012345 6789012345678901 2345678901234567890123456789 0123456789012345678901234567890123 456789012345678901234567890123456 789012345678901234567890123456789012345678901 2345678901234567890123456789 012345678901234567890123456 7890123456789012345 678901234567890123456789012345 6789012345678901234567890

      1234567890123456789012 345678901234567890123456789012345 6789012345678901234567890123456789012345678901234 567890123456789012345678901234567890123456789012345678901234 56789012345678901234567890123456789012345678901234567890123456789012345 78901234567890123456789012345678901234 56789012345678901234567890TOOLONGTOKEN

      qwerty.

      It works well if this file is saved in UTF-8 or if I delete some lines in the ANSI file. I don't know why.

      Best regards

      Attachments

        1. test.txt
          0.6 kB
          MRIT64

        Activity

          People

            jukkaz Jukka Zitting
            mrit64 MRIT64
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: