Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.9.0
-
None
-
None
Description
I am trying to integrate Apache NiFi with Apache Atlas (ReportLineageToAtlas controller service) so that all my datasets and lineages are visible in Atlas. I am using NiFi (1.9) and my Lineage strategy is 'Complete Path'. My flow is a small test flow which reads a text file from HDFS and pushes records to Kafka as well as HDFS. I am seeing following exception:
2019-11-27 07:39:47,637 ERROR [Timer-Driven Process Thread-20] o.a.n.a.reporting.ReportLineageToAtlas ReportLineageToAtlas[id=92385b7e-016e-1000-0000-000003b7ee8c] Skipping failed analyzing event ProvenanceEventRecord [eventId=19361944, eventType=DROP, eventTime=Wed Nov 27 07:32:41 UTC 2019, uuid=751b892e-c3e1-4c0b-a5f3-cba77d704470, fileSize=5, componentId=97663a52-cacf-17ac-b07a-2611465d790b, transitUri=null, sourceSystemFlowFileIdentifier=null, parentUuids=[], alternateIdentifierUri=null] due to java.lang.NullPointerException.: java.lang.NullPointerException
java.lang.NullPointerException: null
at org.apache.nifi.atlas.provenance.lineage.CompleteFlowPathLineage.analyzeLineageTree(CompleteFlowPathLineage.java:198)
at org.apache.nifi.atlas.provenance.lineage.CompleteFlowPathLineage.processEvent(CompleteFlowPathLineage.java:68)
at org.apache.nifi.atlas.reporting.ReportLineageToAtlas.lambda$consumeNiFiProvenanceEvents$6(ReportLineageToAtlas.java:716)
at org.apache.nifi.reporting.util.provenance.ProvenanceEventConsumer.consumeEvents(ProvenanceEventConsumer.java:220)
at org.apache.nifi.atlas.reporting.ReportLineageToAtlas.consumeNiFiProvenanceEvents(ReportLineageToAtlas.java:713)
at org.apache.nifi.atlas.reporting.ReportLineageToAtlas.onTrigger(ReportLineageToAtlas.java:665)
at org.apache.nifi.controller.tasks.ReportingTaskWrapper.run(ReportingTaskWrapper.java:44)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Attached are screenshots of NiFi Flow and ReportLineage controller service config as well as NiFi log file