Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-610

MOR table Impala read support

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Impala uses the JAVA based module call "frontend" to list all the files to scan and let the C++ based "backend" to do all the file scanning. 

      Merge Avro and Parquet could be difficult because it might need to have a custom merging logic like RealtimeCompactedRecordReader to be implemented in backend using C++, but I think it will be doable to have something like RealtimeUnmergedRecordReader which only need some changes in the frontend. 

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              garyli1019 Yanjia Gary Li
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: