[MAPREDUCE-2046] A CombineFileInputSplit cannot be less than a dfs block - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.22.0
Component/s: None
Labels:
None

Description

I ran into this while testing some hive features.

Whether we use hiveinputformat or combinehiveinputformat, a split cannot be less than a dfs block size.
This is a problem if we want to increase the block size for older data to reduce memory consumption for the
name node.

It would be useful if the input split was independent of the dfs block size.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

patch-2046-ydist.txt
07/Sep/10 09:18
6 kB
Amareshwari Sriramadasu
combineFileInputFormatMaxSize2.txt
04/Sep/10 23:28
7 kB
Dhruba Borthakur
combineFileInputFormatMaxSize.txt
02/Sep/10 07:00
7 kB
Dhruba Borthakur

Activity

People

Assignee:: Dhruba Borthakur

Reporter:: Namit Jain

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Dates

Created:: 31/Aug/10 19:19

Updated:: 12/Dec/11 06:18

Resolved:: 06/Sep/10 10:26