Uploaded image for project: 'James Mailbox'
  1. James Mailbox
  2. MAILBOX-403

Email main body is also indexed as an attachment

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.6.0
    • None
    • None

    Description

      What

      I discovered that the main body part, holding the text of an email, and already indexed as part of textBody/htmlBody properties, is also indexed as an attachment.

      This behaviour is functionally wrong, as it returns attachment hits for terms contained in the body of the message.

      It also cause a larger index size, meaning more disk costs, and higher latencies.

      Definition of done

      Unit tests emonstrating ElasticSearch main bodies are no longer indexed as attachments.

      How

      Upon turning children subparts into attachment (flattening) only keep mime parts that explicitly have a content-disposition (either inline or attachment).

      This by the way avoids indexing multiparts as attachments (they were not filtered out...)

      Proposed fix: https://github.com/linagora/james-project/pull/4152

      Attachments

        Activity

          People

            Unassigned Unassigned
            btellier Benoit Tellier
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: