Uploaded image for project: 'DataFu'
  1. DataFu
  2. DATAFU-71

Create IncrementalAvroStorage UDF for incrementally processing date partitioned data

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Won't Do
    • None
    • None
    • None

    Description

      Data can sometimes be stored in HDFS in a time-partitioned manner, e.g. /some/input/yyyy/mm/dd. You may want to process this data incrementally, where the output has a format like /some/output/yyyy/mm/dd. It be useful if there is a UDF that handles the incremental processing for you.

      Attachments

        1. DATAFU-71.patch
          74 kB
          Matthew Hayes

        Activity

          People

            mhayes Matthew Hayes
            mhayes Matthew Hayes
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: