Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3841

An exception occurred when parsing some word documents using tika, tika_exception

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Blocker
    • Resolution: Unresolved
    • 1.24, 2.4.1, 1.28.4
    • None
    • parser
    • None

    Description

      {
          "error": {
              "root_cause": [

      {             "type": "parse_exception",             "reason": "Error parsing document in field [content]"         }

      ],
              "type": "parse_exception",
              "reason": "Error parsing document in field [content]",
              "caused_by": {
                  "type": "tika_exception",
                  "reason": "Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@3b5e180a",
                  "caused_by":

      {                 "type": "array_index_out_of_bounds_exception",                 "reason": "351"             }

              }
          },
          "status": 400
      }

      Attachments

        Activity

          People

            Unassigned Unassigned
            lxz lxz
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: