Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2488

Outlook PST Parser fails from NullPointerException

    Details

    • Type: Bug
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: 1.16
    • Fix Version/s: None
    • Component/s: parser
    • Labels:
    • Environment:

      Ubuntu 64 bit, openjdk version 1.8.0_31

      Description

      Getting this error when trying to parse PST file.
      commands used:
      java -jar tika-server-1.16.jar
      curl -T test.pst http://127.0.0.1:9998/tika --header "Accept: text/plain"

      WARN tika: Text extraction failed
      org.apache.tika.exception.TikaException: Unable to unpack document stream
      at org.apache.tika.parser.mbox.OutlookPSTParser.parse(OutlookPSTParser.java:95)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
      at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
      at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
      at org.apache.tika.server.resource.TikaResource.parse(TikaResource.java:322)
      at org.apache.tika.server.resource.TikaResource$5.write(TikaResource.java:421)
      at org.apache.cxf.jaxrs.provider.BinaryDataProvider.writeTo(BinaryDataProvider.java:169)
      at org.apache.cxf.jaxrs.utils.JAXRSUtils.writeMessageBody(JAXRSUtils.java:1389)
      at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.serializeMessage(JAXRSOutInterceptor.java:243)
      at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.processResponse(JAXRSOutInterceptor.java:119)
      at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.handleMessage(JAXRSOutInterceptor.java:82)
      at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
      at org.apache.cxf.interceptor.OutgoingChainInterceptor.handleMessage(OutgoingChainInterceptor.java:83)
      at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
      at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
      at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:252)
      at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:261)
      at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:76)
      at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088)
      at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024)
      at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
      at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
      at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
      at org.eclipse.jetty.server.Server.handle(Server.java:370)
      at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494)
      at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:973)
      at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1035)
      at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:647)
      at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:231)
      at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82)
      at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:696)
      at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:53)
      at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
      at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
      at java.lang.Thread.run(Thread.java:748)
      Caused by: org.apache.tika.exception.TikaException: Unable to unpack document stream
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseMailAttachments(OutlookPSTParser.java:241)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:121)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parse(OutlookPSTParser.java:92)
      ... 35 more
      Caused by: java.lang.NullPointerException
      at com.pff.PSTAttachment.getFileInputStream(PSTAttachment.java:119)
      at org.apache.tika.parser.mbox.OutlookPSTParser.parseMailAttachments(OutlookPSTParser.java:232)
      ... 41 more
      ERROR Problem with writing the data, class org.apache.tika.server.resource.TikaResource$5, ContentType: text/plain

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              kyle.goode Kyle Goode
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated: