Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-163

Updates to HoodieRecord are missing during MergeHandle with ExternalSpillableMap

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Common Core
    • None

    Description

      Copy of https://github.com/apache/incubator-hudi/issues/773

      MergeHandle uses ExternalSpillableMap for storing all new inserts to a file. Today during merge handle if number of records which we are adding to a file are a lot then ExternalSpillableMap may end up writing it to disk and future updates done to in memory copy of HoodieRecord will not be reflected into on disk copy. This creates duplicate key issue if we are using HBaseIndex. Creating this issue to address this.

      https://github.com/apache/incubator-hudi/blob/master/hoodie-client/src/main/java/com/uber/hoodie/io/HoodieMergeHandle.java#L211 Here we are updating HoodieRecord after adding to the map.

       

      Attachments

        Issue Links

          Activity

            People

              ovjforu omkar vinit joshi
              vbalaji Balaji Varadarajan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: