[HADOOP-4952] Improved files system interface for the application writer. - ASF JIRA

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.21.0
Fix Version/s: 0.21.0
Component/s: fs
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
New FileContext API introduced to replace FileSystem API. FileContext will be the version-compatible API for future releases. FileSystem API will be deprecated in the next release.

Description

Currently the FIleSystem interface serves two purposes:

an application writer's interface for using the Hadoop file system
a file system implementer's interface (e.g. hdfs, local file system, kfs, etc)

This Jira proposes that we provide a simpler interfaces for the application writer and leave the FilsSystem interface for the implementer of a filesystem.

Filesystem interface has a confusing set of methods for the application writer
We could make it easier to take advantage of the URI file naming
- Current approach is to get FileSystem instance by supplying the URI and then access that name space. It is consistent for the FileSystem instance to not accept URIs for other schemes, but we can do better.
- The special copyFromLocalFIle can be generalized as a copyFile where the src or target can be generalized to any URI, including the local one.
- The proposed scheme (below) simplifies this.

The client side config can be simplified.
- New config() by default uses the default config. Since this is the common usage pattern, one should not need to always pass the config as a parameter when accessing the file system.
- It does not handle multiple file systems too well. Today a site.xml is derived from a single Hadoop cluster. This does not make sense for multiple Hadoop clusters which may have different defaults.
- Further one should need very little to configure the client side:
  - Default files system.
  - Block size
  - Replication factor
  - Scheme to class mapping
- It should be possible to take Blocksize and replication factors defaults from the target file system, rather then the client size config. I am not suggesting we don't allow setting client side defaults, but most clients do not care and would find it simpler to take the defaults for their systems from the target file system.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

FileContext3.patch
27/Aug/09 02:37
65 kB
Sanjay Radia
FileContext5.patch
27/Aug/09 07:50
68 kB
Sanjay Radia
FileContext6.patch
31/Aug/09 03:46
70 kB
Sanjay Radia
FileContext7.patch
01/Sep/09 02:16
73 kB
Sanjay Radia
FileContext9.patch
04/Sep/09 18:17
79 kB
Sanjay Radia
FileContext-common10.patch
07/Sep/09 22:59
81 kB
Sanjay Radia
FileContext-common11.patch
08/Sep/09 21:35
83 kB
Sanjay Radia
FileContext-common12.patch
10/Sep/09 06:59
88 kB
Sanjay Radia
FileContext-common13.patch
13/Sep/09 16:55
88 kB
Sanjay Radia
FileContext-common14.patch
15/Sep/09 00:13
88 kB
Sanjay Radia
FileContext-common16.patch
16/Sep/09 05:44
88 kB
Sanjay Radia
FileContext-common18.patch
16/Sep/09 09:27
88 kB
Sanjay Radia
FileContext-common19.patch
16/Sep/09 15:16
90 kB
Sanjay Radia
FileContext-common21.patch
16/Sep/09 18:55
92 kB
Sanjay Radia
FileContext-common22.patch
17/Sep/09 04:30
91 kB
Sanjay Radia
FileContext-common24.patch
17/Sep/09 19:27
92 kB
Sanjay Radia
FileContext-common25.patch
17/Sep/09 20:32
92 kB
Sanjay Radia
FileContext-hdfs10.patch
07/Sep/09 22:59
5 kB
Sanjay Radia
FileContext-hdfs11.patch
08/Sep/09 21:35
8 kB
Sanjay Radia
Files.java
11/Aug/09 07:00
15 kB
Sanjay Radia
Files.java
29/Dec/08 21:02
11 kB
Sanjay Radia
FilesContext1.patch
25/Aug/09 01:49
53 kB
Sanjay Radia
FilesContext2.patch
25/Aug/09 16:41
66 kB
Sanjay Radia

Issue Links

blocks

HDFS-610 Add support for FileContext

Closed

HADOOP-6261 Junit tests for FileContextURI

Closed

HADOOP-6260 Unit tests for FileSystemContextUtil.

Closed

incorporates

HDFS-578 Support for using server default values for blockSize and replication when creating a file

Closed

is depended upon by

HADOOP-6356 Add a Cache for AbstractFileSystem in the new FileContext/AbstractFileSystem framework.

Open

is related to

HDFS-617 Support for non-recursive create() in HDFS

Closed

HDFS-618 Support for non-recursive mkdir in HDFS

Closed

relates to

HADOOP-6223 New improved FileSystem interface for those implementing new files systems.

Closed

HADOOP-6265 Remove deprecated protected methods added to FileSystem to support FileContext.

Open

HADOOP-6271 Fix FileContext to allow both recursive and non recursive create and mkdir

Closed

(2 is related to, 3 relates to)

Sub-Tasks

1.

New improved FileSystem interface for those implementing new files systems.

Closed

Sanjay Radia

Improved files system interface for the application writer.

Details

Description

Attachments

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates