Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3082

[Phase 1] Unify MOR table access across Spark, Hive

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Blocker
    • Resolution: Duplicate
    • None
    • 0.11.0
    • None
    • 10

    Description

      This is Phase 1 of what outlined in HUDI-3081

       

      The goal is 

      • Unify Hive’s RecordReaders (`RealtimeCompactedRecordReader`, RealtimeUnmergedRecordReader)
        • These Readers should only differ in the way they handle the payload, everything else should remain constant
      • Abstract w/in common component (name TBD)
        • Listing current file-slices at the requested instant (handling the timeline)
        • Creating Record Iterator for the provided file-slice

      Attachments

        Issue Links

          Activity

            People

              alexey.kudinkin Alexey Kudinkin
              alexey.kudinkin Alexey Kudinkin
              Ethan Guo (this is the old account; please use "yihua"), Shiyan Xu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: