Description
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries and it would be nice to have it integrated inside Apache Clerezza so that Resources could be easily enriched and auto-tagged with Metadata once inside Clerezza