Uploaded image for project: 'Apache NiFi'
  1. Apache NiFi
  2. NIFI-6348

Allow GenerateTableFetch to order on a column without a Max-Value column

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.10.0
    • Extensions
    • None

    Description

      Related to NIFI-6244, if no Max-Value Column is set for GenerateTableFetch, and Partition Size > 0 (meaning paging/partitioning is done), then no column has been specified for ordering the rows. When fetching multiple pages, this causes the ordering of each fetch to be arbitrary (i.e. different ordering for each fetch), which can result in missing and/or duplicate rows. In the case of SQL Server, the parser will actually throw an error when no ORDER BY clause is provided. NIFI-6244 aimed to fix that but it was not a correct solution so was reverted.

      This Jira proposes to add a "Custom ORDER BY Column" property, which will be used only if Max-Value Column is not set and Partition Size > 0. It is not required, but the documentation should explain that not using this field can result in the behavior above.

      Attachments

        Issue Links

          Activity

            People

              mattyb149 Matt Burgess
              mattyb149 Matt Burgess
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m