Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14549 HiveServer2: Improve HiveServer2 JDBC/ODBC ResultSet performance - part2
  3. HIVE-14551

HiveServer2: Use vectorized data whenever available for writing final results

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.1.0
    • None
    • HiveServer2
    • None

    Description

      In ThriftJDBCBinarySerde, which we are using in FileSinkOperator to write final results, we buffer rows and store them into typed columns before writing a batch of rows to the result file. However, when vectorized rows batches are available from higher level operators, we should try to use them and avoid the extra penalty of converting from vector --> non-vector single row --> buffered thrift columns (equivalent to vector).

      Attachments

        Activity

          People

            Unassigned Unassigned
            vgumashta Vaibhav Gumashta
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: