Enable importing of mails into solr through DIH. Take one or more mailbox credentials, download and index their content along with the content from attachments. The folders to fetch can be made configurable based on various criteria. Apache Tika is used for extracting content from different kinds of attachments. JavaMail is used for mail box related operations like fetching mails, filtering them etc.
The basic configuration for one mail box is as below:
The below is the list of all configuration available:
protocol (only "imaps" supported now)
folders - comma seperated list of folders.
If not specified, default folder is used. Nested folders can be specified like a/b/c
recurse - index subfolders. Defaults to true.
exclude - comma seperated list of patterns.
include - comma seperated list of patterns.
batchSize - mails to fetch at once in a given folder.
Only headers can be prefetched in Javamail IMAP.
readTimeout - defaults to 60000ms
conectTimeout - defaults to 30000ms
fetchSize - IMAP config. 32KB default
date/time in "yyyy-MM-dd HH:mm:ss" format, mails received after which will be fetched. Useful for delta import.
customFilter - class name.
processAttachement - defaults to true
The below are the indexed fields.