Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8595

Slow IO when stdout directed to NAS with large blocksize

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 0.13.1
    • None
    • Clients
    • None
    • nfs4 rsize=1048576,wsize=1048576

    Description

      Very slow IO when executing a SQL command file using the following command line when the target file system is an nfs4 mounted NAS with a large blocksize:

      hive -f sqlscript.sql 2>results.log >results.tab

      Work around (thousands of times faster):

      hive -f sqlscript.sql 2>results.log | cat >results.tab

      For instance I had a command finish 10 hours ago and I forgot to use cat and it is still writing out the output which after 10 hours is in the 180 GB range.

      Attachments

        Activity

          People

            Unassigned Unassigned
            kenglish Kevin English
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: