[HIVE-11073] ORC FileDump utility ignores errors when writing output - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.2.0
Fix Version/s: 1.3.0, 2.0.0
Component/s: Hive
Labels:
- cli
- orc

Release Note:
orcfiledump exits if errors are detected when writing to stdout.
Flags:

Patch

Description

The Hive command line provides the --orcfiledump utility for dumping data contained within ORC files, specifically when using the -d option. Generally, it is useful to be able to pipe the data extracted into other commands and utilities to transform and control the data so that it is more manageable by the CLI user. A classic example is less.

When such command pipelines are currently constructed, the underlying implementation in org.apache.hadoop.hive.ql.io.orc.FileDump#printJsonData is oblivious to errors occurring when writing to its output stream. Such errors are common place when a user issues Ctrl+C to kill the leaf process. In this event the leaf process terminates immediately but the Hive CLI process continues to execute until the full contents of the ORC file has been read.

By making FileDump considerate of output stream errors the process will terminate as soon as the destination process exits (i.e. when the user kills less) and control will be returned to the user as expected.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-11073.1.patch
22/Jun/15 16:59
5 kB
Elliot West

Activity

People

Assignee:: Elliot West

Reporter:: Elliot West

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 22/Jun/15 16:56

Updated:: 16/Feb/16 23:52

Resolved:: 27/Jun/15 22:03