Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-375

LazySimpleSerDe to directly serialize (append) int/long/byte/short etc to UTF-8 buffer

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.3.0
    • 0.4.0
    • None
    • None
    • Reviewed

    Description

      LazySimpleSerDe currently serialize all data into a StringBuilder, and then convert it to String and then Text.
      Even if the data is of type int/long/byte/short, we still do that unnecessary conversion.

      We should directly serialize/append int/long/byte/short to a UTF-8 buffer.

      This is a very simple change, but it is expected to save 2-3% of the time of a typical mapper (on a group-by query with some int/long columns), and this blocks HIVE-266.

      Attachments

        1. HIVE-375.1.patch
          14 kB
          Zheng Shao
        2. HIVE-375.2.patch
          16 kB
          Zheng Shao

        Issue Links

          Activity

            People

              zshao Zheng Shao
              zshao Zheng Shao
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: