Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-8724

Bug fixes - Phase 1

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • None
    • 1.0.1
    • None

    Attachments

      1.
      Difference between HoodieData.map and mapPartitions Sub-task Open Unassigned  
      2.
      Fix ORC tests on Spark 3.5 Sub-task Open Jonathan Vexler  
      3.
      Fix JSON Reader tests for Spark > 3.3 Sub-task Open Unassigned  
      4.
      Incorrect partition pruning when TimestampBasedKeyGenerator is used in partition column Sub-task Open Danny Chen  
      5.
      Incorrect type casting while reading HUDI table created with CustomKeyGenerator and unixtimestamp paritioning field Sub-task Open Jonathan Vexler

      0%

      Original Estimate - 6h
      Remaining Estimate - 6h
      6.
      Support YYYY-MM-DD partition format with hive Sub-task Open Lokesh Jain  
      7.
      Support YYYY/MM/DD partition format with hive Sub-task Open Lokesh Jain  
      8.
      Different parquet reader config on list-typed fields is used to read parquet file generated by clustering Sub-task Open Jonathan Vexler  
      9.
      Writer failing with exception when precision is different for two decimal fields with same name Sub-task Open Ethan Guo (this is the old account; please use "yihua")  
      10.
      Using CustomKeyGenerator fails w/ SparkHoodieTableFileIndex Sub-task In Progress Jonathan Vexler

      0%

      Original Estimate - 4h
      Remaining Estimate - 4h
      11.
      Use snapshot query as default in DefaultSource Sub-task Patch Available Jonathan Vexler  
      12.
      Fix log file marker creation in file group initialization in metadata table writer Sub-task In Progress Y Ethan Guo

      0%

      Original Estimate - 2h
      Remaining Estimate - 2h
      13.
      Fix default value of bootstrap table config Sub-task In Progress Y Ethan Guo

      50%

      Original Estimate - Not Specified Original Estimate - Not Specified
      Time Spent - 2h Remaining Estimate - 2h
      14.
      Revisit commitsMetadata fetching from timeline history in MergeOnReadIncrementalRelation Sub-task Open Unassigned  
      15.
      Reduce spurious logs from Spark SQL write Sub-task Open Unassigned

      0%

      Original Estimate - 4h
      Remaining Estimate - 4h
      16.
      Proper cleanup of BitCaskDiskMap on storage Sub-task Open Unassigned

      0%

      Original Estimate - 10h
      Remaining Estimate - 10h
      17.
      Follow up with EMR to fix out-of-the-box experience for Hudi streamer continuous mode Sub-task Open Unassigned  

      Activity

        People

          Unassigned Unassigned
          yihua Y Ethan Guo
          Votes:
          0 Vote for this issue
          Watchers:
          1 Start watching this issue

          Dates

            Created:
            Updated:

            Time Tracking

              Estimated:
              Original Estimate - 26h Original Estimate - 26h
              26h
              Remaining:
              Time Spent - 2h Remaining Estimate - 28h
              28h
              Logged:
              Time Spent - 2h Remaining Estimate - 28h
              2h