Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-1472

Confluence connector doesn't call activities.noDocument() properly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • ManifoldCF 2.8.1
    • ManifoldCF 2.9
    • Confluence connector
    • None

    Description

      During crawling, the Confluence connector in one installation is throwing the following exception:

      java.lang.IllegalArgumentException: Unrecognized document identifier: 'att44634026'
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
              at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
      WARN 2017-11-21 10:00:14,373 (Worker thread '111') - Exception: Unrecognized document identifier: 'att69240163'
      java.lang.IllegalArgumentException: Unrecognized document identifier: 'att69240163'
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
              at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
      WARN 2017-11-21 10:00:14,379 (Worker thread '82') - Exception: Unrecognized document identifier: 'att56984899'
      java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56984899'
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
              at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
      WARN 2017-11-21 10:00:14,386 (Worker thread '47') - Exception: Unrecognized document identifier: 'att56986313'
      java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56986313'
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
              at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
              at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
              at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
      FATAL 2017-11-21 10:00:14,386 (Worker thread '132') - Error tossed: null
      java.lang.NullPointerException
      

      Attachments

        1. CONNECTORS-1472.patch
          2 kB
          Karl Wright
        2. CONNECTORS-1472-2.patch
          0.9 kB
          Karl Wright

        Activity

          People

            kwright@metacarta.com Karl Wright
            kwright@metacarta.com Karl Wright
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: