Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-3177

Expose snapshotTimestampMicros to Spark Read Options

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.13.0
    • Component/s: spark
    • Labels:

      Description

      If a spark application needs to read from the same table multiple times and that table has new records that may come in during the life of the application, you may get inconsistent scan results unless you persist the DataFrame. I'd like to expose snapshotTimestampMicros to the spark read options so I can set a timestamp before the first scan and use that for READ_AT_SNAPSHOT to keep all scans on the same table consistent throughout the run of the application. 

        Attachments

          Activity

            People

            • Assignee:
              kmccarthy Kevin J McCarthy
              Reporter:
              kmccarthy Kevin J McCarthy
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: