Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-199

Fix inconsistency between File creation and HBase Index mapping when spark DAG is re-computed

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Unresolved
    • None
    • 0.5.0
    • index
    • None

    Description

      • When WriteStatus RDD lineage is re-triggered, index is not updated with new File ID created.
      • This is happening because updateLocation() is returning back input WriteStatus RDD, which does not have the index update operation chained in it.
      • While addressing this inconsistency issue, we also need to cache the WriteStatus RDD before updating index, so that there are less chances of the RDD being re-triggered.

      Attachments

        Activity

          People

            Venkatesh Rudraraju Venkatesh Rudraraju
            Venkatesh Rudraraju Venkatesh Rudraraju
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: