[HADOOP-36] Adding some uniformity/convenience to environment management - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.1.0
Component/s: conf
Labels:
None

Description

Currently, "slaves" are loaded from ~/.slaves. What would be better would be to default from something like conf/hadoop-slaves

Perhaps split slaves, having a different set for "datanodes" vs. "tasktracker" nodes. ie, conf/hadoop-slaves-tasktracker, conf/hadoop-slaves-datanodes, or some similar split. There's the possibility it's worth building in the assumption that tasktracker is a superset, and thus implicitly includes datanodes, but this might be a bad assumption.

Also, make sure all scripts source something like conf/hadoop-env.sh. Thus, the user can edit hadoop-env.sh to specify JAVA_HOME, or an alternate HADOOP_SLAVES location. It would also be desirable to have a seed CLASSPATH here. Possibly name it HADOOP_CLASSPATH, to make it explicit and not make hadoop scripts possibly interact with an otherwise-set system CLASSPATH variable.

These changes would probably be useful to the nutch project, too.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Bryan Pendleton

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 14/Feb/06 05:03

Updated:: 03/Aug/06 17:46

Resolved:: 16/Feb/06 07:03