
|
If you were logged in you would be able to see more operations.
|
|
|
|
Time Tracking:
|
|
Original Estimate:
|
24h
|
|
|
Remaining Estimate:
|
24h
|
|
|
Time Spent:
|
Not Specified
|
|
|
|
|
File Attachments:
|
| |
Size |
|
getfs.patch |
2009-04-16 08:51 PM |
Stephen Green |
33 kB |
|
|
Environment:
|
Mac OS X 10.5 and Amazon's Elastic MapReduce
|
|
| Resolution Date: |
19/Apr/09 11:18 PM
|
|
All of the uses of org.apache.hadoop.fs.FileSystem.get use the single argument version that takes a job configuration. This will always return the default file system type (which is usually HDFS), rather than using the file system type used in the URIs for the input or output paths. This is particularly a problem on Amazon's Elastic MapReduce where the input and output data typically reside in a org.apache.hadoop.fs.s3native.NativeS3FileSystem.
|
|
Description
|
All of the uses of org.apache.hadoop.fs.FileSystem.get use the single argument version that takes a job configuration. This will always return the default file system type (which is usually HDFS), rather than using the file system type used in the URIs for the input or output paths. This is particularly a problem on Amazon's Elastic MapReduce where the input and output data typically reside in a org.apache.hadoop.fs.s3native.NativeS3FileSystem. |
Show » |
|
The only remaining instance is in org.apache.mahout.ga.watchmaker.MahoutEvaluator at line 64, where there's no obvious Path available to provide the first argument. Getting this one working will probably require refactoring that's beyond the scope of my Mahout understanding.