Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12427 HiveServer2: Improve HiveServer2 JDBC/ODBC ResultSet performance - part1
  3. HIVE-14901

HiveServer2: Use user supplied fetch size to determine #rows serialized in tasks

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.1.0
    • None
    • HiveServer2, JDBC, ODBC

    Description

      Currently, we use hive.server2.thrift.resultset.max.fetch.size to decide the max number of rows that we write in tasks. However, we should ideally use the user supplied value (which can be extracted from the ThriftCLIService.FetchResults' request parameter) to decide how many rows to serialize in a blob in the tasks. We should however use hive.server2.thrift.resultset.max.fetch.size to have an upper bound on it, so that we don't go OOM in tasks and HS2.

      Attachments

        1. HIVE-14901.patch
          3 kB
          Norris Lee
        2. HIVE-14901.1.patch
          3 kB
          Norris Lee
        3. HIVE-14901.2.patch
          3 kB
          Norris Lee
        4. HIVE-14901.3.patch
          5 kB
          Norris Lee
        5. HIVE-14901.4.patch
          10 kB
          Norris Lee
        6. HIVE-14901.5.patch
          10 kB
          Norris Lee
        7. HIVE-14901.6.patch
          13 kB
          Norris Lee
        8. HIVE-14901.7.patch
          13 kB
          Norris Lee
        9. HIVE-14901.8.patch
          13 kB
          Norris Lee
        10. HIVE-14901.9.patch
          13 kB
          Norris Lee

        Issue Links

          Activity

            People

              norrisl Norris Lee
              vgumashta Vaibhav Gumashta
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: