Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2280

message_from not extracted from Outlook emails

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 1.14
    • None
    • parser

    Description

      While the MESSAGE_FROM metadata field is extracted for both RFC and Outlook emails, it doesn't include the address for Outlook emails.

      For example, if the raw from field is "John Doe <john@example.com>", the Outlook email parser sets MESSAGE_FROM to "John Doe" while the RFC email parser sets it to "John Doe <john@example.com>".

      Currently I'm getting the from address from the RAW_HEADER_FROM field for Outlook emails, but it would be nice to be able to use a standard across email formats.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mcaruanagalizia Matthew Caruana Galizia
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: