Issue Details (XML | Word | Printable)

Key: HADOOP-3350
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Chris Douglas
Reporter: Chris Douglas
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

distcp should permit users to limit the number of maps

Created: 06/May/08 02:56 AM   Updated: 22/Aug/08 07:50 PM
Component/s: None
Affects Version/s: None
Fix Version/s: 0.18.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works 3350-0.patch 2008-05-06 02:59 AM Chris Douglas 2 kB
Text File Licensed for inclusion in ASF works 3350-1.patch 2008-05-09 09:21 PM Chris Douglas 6 kB

Hadoop Flags: Reviewed
Resolution Date: 15/May/08 08:47 PM


 Description  « Hide
distcp provides no user-configurable setting to affect the number of maps run during a copy. For large clusters, the current calculation- min(#bytes / 256MB, #nodes * 20)- can be overly aggressive.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Chris Douglas added a comment - 06/May/08 02:59 AM
This retains the existing behavior by default, but observes a "distcp.max.map.tasks" parameter.

Hadoop QA added a comment - 06/May/08 06:32 AM
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12381473/3350-0.patch
against trunk revision 653638.

+1 @author. The patch does not contain any @author tags.

-1 tests included. The patch doesn't appear to include any new or modified tests.
Please justify why no tests are needed for this patch.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of release audit warnings.

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2405/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2405/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2405/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2405/console

This message is automatically generated.


Owen O'Malley added a comment - 07/May/08 10:26 PM
You seem to be missing testcase and documentation.

Chris Douglas added a comment - 09/May/08 09:21 PM
Patch adds and documents a -m option to set the maximum number of maps, a distcp.bytes.per.map property governing the obvious, and a test case.

Hadoop QA added a comment - 09/May/08 11:59 PM
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12381797/3350-1.patch
against trunk revision 654941.

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 3 new or modified tests.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac compiler warnings.

+1 findbugs. The patch does not introduce any new Findbugs warnings.

+1 release audit. The applied patch does not increase the total number of release audit warnings.

+1 core tests. The patch passed core unit tests.

+1 contrib tests. The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2442/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2442/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2442/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/2442/console

This message is automatically generated.


Tsz Wo (Nicholas), SZE added a comment - 15/May/08 06:55 PM
+1 patch looks good

Chris Douglas added a comment - 15/May/08 08:47 PM
I just committed this.

Hudson added a comment - 16/May/08 12:52 PM