[HADOOP-1722] Make streaming to handle non-utf8 byte array - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.0.2, 0.21.0
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
Streaming allows binary (or other non-UTF8) streams.

Description

Right now, the streaming framework expects the output sof the steam process (mapper or reducer) are line
oriented UTF-8 text. This limit makes it impossible to use those programs whose outputs may be non-UTF-8
(international encoding, or maybe even binary data). Streaming can overcome this limit by introducing a simple
encoding protocol. For example, it can allow the mapper/reducer to hexencode its keys/values,
the framework decodes them in the Java side.
This way, as long as the mapper/reducer executables follow this encoding protocol,
they can output arabitary bytearray and the streaming framework can handle them.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-1722-v0.20.1.patch
15/Oct/09 09:14
153 kB
Matthias Lehmann
HADOOP-1722-branch-0.19.patch
04/Mar/09 17:58
152 kB
Klaas Bosteels
HADOOP-1722-branch-0.18.patch
13/Feb/09 10:19
152 kB
Klaas Bosteels
HADOOP-1722-v6.patch
12/Feb/09 18:59
153 kB
Klaas Bosteels
HADOOP-1722-v5.patch
06/Feb/09 14:32
153 kB
Klaas Bosteels
HADOOP-1722-v4.patch
04/Feb/09 16:15
121 kB
Klaas Bosteels
HADOOP-1722-v4.patch
28/Jan/09 22:52
119 kB
Klaas Bosteels
HADOOP-1722-v3.patch
28/Jan/09 13:24
119 kB
Klaas Bosteels
HADOOP-1722-v2.patch
27/Jan/09 17:47
119 kB
Klaas Bosteels
HADOOP-1722.patch
26/Jan/09 17:24
114 kB
Klaas Bosteels

Issue Links

is related to

HADOOP-6901 Parsing large compressed files with HADOOP-1722 spawns multiple mappers per file

Resolved

relates to

MAPREDUCE-606 Implement a binary input/output format for Streaming

Resolved

HIVE-708 Add TypedBytes SerDe for transform

Closed

MAPREDUCE-5018 Support raw binary data with Hadoop streaming

Patch Available

Activity

People

Assignee:: Klaas Bosteels

Reporter:: Runping Qi

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 16/Aug/07 14:33

Updated:: 21/Feb/13 17:09

Resolved:: 13/Feb/09 04:13