Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-11167

Add Excel Record Reader

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.0.0-M1, 1.23.0
    • Extensions
    • None

    Description

      A new Excel Record Reader should be implemented to support reading XSLX spreadsheet rows as NiFi Records. This Reader will enable integration with various record-oriented components, obviating the need for the narrowly focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader should not support the legacy binary XLS format.

      The ExcelReader should use a library that supports reading from a stream of rows to avoid consuming large amounts of heap memory during processing.

      The ExcelReader should support configurable properties to read selected sheets. With Excel supporting typed field values, some amount of field type mapping will be required. Additional input filtering properties should not be implemented as existing Processors like QueryRecord support a wide variety of filtering and projection use cases.

      Attachments

        1. Test Sheet-formula.xlsx
          11 kB
          Philipp Korniets
        2. Test ExcelReader.xlsx
          11 kB
          Philipp Korniets
        3. screenshot-1.png
          47 kB
          Philipp Korniets
        4. resulting.csv
          0.6 kB
          Daniel Stieglitz
        5. QueryRecord_configuration.png
          34 kB
          Daniel Stieglitz
        6. image-2023-11-29-15-51-08-386.png
          43 kB
          Philipp Korniets
        7. image-2023-11-28-18-22-07-446.png
          33 kB
          Philipp Korniets
        8. ExcelReaderConfiguration.png
          26 kB
          Daniel Stieglitz
        9. CSVRecordSetWriter_configuration.png
          40 kB
          Daniel Stieglitz

        Issue Links

          Activity

            People

              dstiegli1 Daniel Stieglitz
              exceptionfactory David Handermann
              Votes:
              1 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10h 10m
                  10h 10m