Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-5755

Allow PutParquet prosessor to dynamically set config properties for parquet lib

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 1.7.1
    • None
    • Core Framework
    • None

    Description

      PutParquet (NiFi 1.7.1), will fail to write parquet if incoming records contain arrays with null elements: 
      Schema snippet example:

      {              "name": "myfield",              "type" : ["null",

      { "type" : "array", "items" : ["null","string"], "default": null }

      ], "default": null
       },
       
      And a corresponding data example:
      "myfield" : [ null, "value1" ],
      "myfield" : [ null, "value2" ],
      "myfield" : [ "value3", null, "value4" ],
       
      Avro does not seem to have a problem with this, but Put parquet fails with the following error: "Array contains a null element at X".
       
      There is a parquet config to allow this: 
      parquet.avro.write-old-list-structure=false
       
      If the processor would have a box for toggling this value, or a text box to pass configs in general would be great!

      Attachments

        1. parq2.png
          55 kB
          Ken Tore Tallakstad

        Activity

          People

            bbende Bryan Bende
            kentore82 Ken Tore Tallakstad
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: