Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-2271

Follow-up items for timeline-server-based marker files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Follow-up items after HUDI-1138:

      • Implement upgrade/downgrade, rollback regarding markers
        • Rollback should be based on the markers in the file system, with marker type indicated by MARKERS.type, or assumed to be "direct", if no MARKERS.type
        • New write follows the `hoodie.write.markers.type` config to create markers
      • Verify multiple concurrent writers with timeline-server-based markers
      • Let's ensure the parallelism passed for local engine context is reasonable
      • Fix `Task not serializable` due to new APIs in FSUtils for recursive, level by level listing (`java.io.NotSerializableException: org.apache.hudi.common.fs.FSUtils$$Lambda$4224/1845791682`)
      • Optimize fs.exists(dirPath) call in marker creation in DirectMarkerFiles
      • Optimize future timeouts due to I/O failure in the marker handler
      • Test new marker strategy for all engines (spark, flink and java) w/o any additional changes

      Attachments

        Activity

          People

            guoyihua Ethan Guo
            guoyihua Ethan Guo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: