Hadoop Common
  1. Hadoop Common
  2. HADOOP-3684

The data_join should allow the user to implement a customer cloning function

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.19.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Allowed user to overwrite clone function in a subclass of TaggedMapOutput class.

      Description

      Currently, the framework uses serialization/deserialization to clone the values passed to the resuce function.
      This amounts to a very heavy weight deep copy of the value objects.
      That is way too expensive. Although that may be a generic way to work for all possible value classes, thus good as a default way,
      the framework should allow the user to implemet an application specific yet efficient cloning function.

      1. H-3684.txt
        2 kB
        Runping Qi

        Activity

        Runping Qi created issue -
        Runping Qi made changes -
        Field Original Value New Value
        Attachment H-3684.txt [ 12385153 ]
        Runping Qi made changes -
        Release Note make it possible for the user to overwrite clone function in a subclass of TaggedMapOutput class
        Fix Version/s 0.19.0 [ 12313211 ]
        Status Open [ 1 ] Patch Available [ 10002 ]
        Runping Qi made changes -
        Status Patch Available [ 10002 ] Open [ 1 ]
        Runping Qi made changes -
        Attachment H-3684.txt [ 12385153 ]
        Runping Qi made changes -
        Attachment H-3684.txt [ 12385161 ]
        Runping Qi made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Chris Douglas made changes -
        Hadoop Flags [Reviewed]
        Assignee Runping Qi [ runping ]
        Description
        Currently, the framework uses serialization/deserialization to clone the values passed to the resuce function.
        This amounts to a very heavy weight deep copy of the value objects.
        That is way too expensive. Although that may be a generic way to work for all possible value classes, thus good as a default way,
        the framework should allow the user to implemet an application specific yet efficient cloning function.
         
        Currently, the framework uses serialization/deserialization to clone the values passed to the resuce function.
        This amounts to a very heavy weight deep copy of the value objects.
        That is way too expensive. Although that may be a generic way to work for all possible value classes, thus good as a default way,
        the framework should allow the user to implemet an application specific yet efficient cloning function.
         
        Chris Douglas made changes -
        Issue Type Bug [ 1 ] Improvement [ 4 ]
        Chris Douglas made changes -
        Resolution Fixed [ 1 ]
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Robert Chansler made changes -
        Release Note make it possible for the user to overwrite clone function in a subclass of TaggedMapOutput class Allowed user to overwrite clone function in a subclass of TaggedMapOutput class.
        Nigel Daley made changes -
        Status Resolved [ 5 ] Closed [ 6 ]
        Owen O'Malley made changes -
        Component/s mapred [ 12310690 ]

          People

          • Assignee:
            Runping Qi
            Reporter:
            Runping Qi
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development