Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1970

Date not extracted from email saved as plain txt

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.14
    • 1.14, 2.0.0
    • metadata
    • None
    • Debian Linux Jessie
      Java(TM) SE Runtime Environment (build 1.8.0_91-b14)
      Mac OS X Mail

    Description

      HI have two email testfiles:

      (1) A file that has been created by using "save as" in Mac Mail (this creates a .txt file)
      (2) A file that has been created by dragging an email from Mac Mail to the Desktop (this creates an .eml file)

      If I feed the files with

      curl -T filename http://localhost:9998/detect/stream

      I get the response "message/rfc822" for both files.

      If I run

      curl -T filename http://localhost:9998/meta

      I get the metadata, but in the case of (1) I do not get the DATE extracted, while in case (2) I do.

      Attachments

        1. Testemail-nodate.txt
          0.2 kB
          Philipp Steinkrueger
        2. Testemail-date.eml
          0.5 kB
          Philipp Steinkrueger

        Issue Links

          Activity

            People

              Unassigned Unassigned
              philipp.steinkrueger@uni-koeln.de Philipp Steinkrueger
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: