Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
ManifoldCF 2.8.1
-
None
Description
During crawling, the Confluence connector in one installation is throwing the following exception:
java.lang.IllegalArgumentException: Unrecognized document identifier: 'att44634026' at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) WARN 2017-11-21 10:00:14,373 (Worker thread '111') - Exception: Unrecognized document identifier: 'att69240163' java.lang.IllegalArgumentException: Unrecognized document identifier: 'att69240163' at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) WARN 2017-11-21 10:00:14,379 (Worker thread '82') - Exception: Unrecognized document identifier: 'att56984899' java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56984899' at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) WARN 2017-11-21 10:00:14,386 (Worker thread '47') - Exception: Unrecognized document identifier: 'att56986313' java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56986313' at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012) at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) FATAL 2017-11-21 10:00:14,386 (Worker thread '132') - Error tossed: null java.lang.NullPointerException