Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-4545

Improve Hive processors provenance transit URL

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5.0
    • Extensions
    • None

    Description

      Hive-related processors report NiFi provenance events with transit URLs in a format as 'jdbc:hive2://<host1>:<port1>,<host2>:<port2>/dbName'. The URL format can identify a Hive environment, but not descriptive enough to derive actual table names affecting or being affected by the query which generated the provenance event.

      Those table information can only be known by parsing query. This JIRA improves following Hive related processors to write additional 'query.input.tables' and 'query.output.tables' FlowFile attributes by parsing Hive queries using Hive parser.

      Target Processors:

      • PutHiveQL
      • SelectHiveQL
      • PutHiveStreaming: This processor knows a table name without the need of parsing queries.

      Attachments

        Issue Links

          Activity

            People

              ijokarumawak Koji Kawamura
              ijokarumawak Koji Kawamura
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: