Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-576

Manifold gets repeated service interruptions and stops

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: ManifoldCF next
    • Fix Version/s: ManifoldCF 1.1
    • Component/s: None
    • Labels:
      None
    • Environment:

      solr 4.0 manifoldcf v1.1-dev on windows 7

      Description

      Manifold gets repeated service interruptions and stops.
      Is there a way to get more detailed error information?
      such as, the document name/url/location that it's having a problem with?
      In v.5.1 these errors would appear at the very end (the last 130 to 184 document) and then stop.
      The solr logs always reported vague TIKA errors
      I'm unsure where the problems lie.
      Here's the manifoldcf log
      WARN 2012-12-04 10:27:40,722 (Worker thread '0') - Service interruption reported for job 1343845636068 connection 'LISA-DEV': Error 500 from ingestion request; ingestion will be retried again later
      ERROR 2012-12-04 10:27:40,754 (Worker thread '0') - Exception tossed: Repeated service interruptions - failure processing document: Ingestion HTTP error code 500
      org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated service interruptions - failure processing document: Ingestion HTTP error code 500
      at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585)
      Caused by: org.apache.manifoldcf.core.interfaces.ManifoldCFException: Ingestion HTTP error code 500
      at org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:1386)
      WARN 2012-12-04 10:27:40,847 (Worker thread '24') - Service interruption reported for job 1343845636068 connection 'LISA-DEV': Job no longer active

      And here's the solr log if it helps:
      org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: XML parse error at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:215) at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1561) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:442) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:263) at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1337) at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:484) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:119) at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:524) at

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              dmorana David Morana
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: