Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-1458

Out-of-process text extraction for better protection agains JVM/memory/CPU problems

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • lucene, query

    Description

      This is a tracking / collection bug for solving problems with text extraction of
      documents (very large, broken, malicious, etc), causing JVM crashes, memory
      problems, excessive CPU usage.

      The basic TIKA feature to enable this fix is TIKA-416 [1]

      [1] https://issues.apache.org/jira/browse/TIKA-416

      Attachments

        Activity

          People

            Unassigned Unassigned
            mmarth Michael Marth
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: