Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-1555

Cleanup duplicated code of python functions

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Function/UDF
    • None

    Description

      I'm working on supporting Python UDF at TAJO-1344. This is still a prototype, and has some problems. One of the problems is related to serialization/deserialization protocol. For easy implementation, I simply used CSV format to serialize/deserialize tuples. To do so, I copied some bunch of codes from the tajo-storage package. This will incur a maintenance issue in addition to the problem of low performance.

      To cleanup the duplicated codes, I think that we should use a well-known serialization/deserialization protocol such as protocol buffers.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jihoonson Jihoon Son
              Votes:
              1 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: