Issue Details (XML | Word | Printable)

Key: HADOOP-4952
Type: New Feature New Feature
Status: Resolved Resolved
Resolution: Fixed
Priority: Major Major
Assignee: Sanjay Radia
Reporter: Sanjay Radia
Votes: 0
Watchers: 25
Operations

If you were logged in you would be able to see more operations.
Hadoop Common

Improved files system interface for the application writer.

Created: 29/Dec/08 08:55 PM   Updated: 20/Nov/09 11:41 PM
Return to search
Component/s: fs
Affects Version/s: 0.21.0
Fix Version/s: 0.21.0

Time Tracking:
Issue & Sub-Tasks
Issue Only
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works FileContext-common10.patch 2009-09-07 10:59 PM Sanjay Radia 81 kB
Text File Licensed for inclusion in ASF works FileContext-common11.patch 2009-09-08 09:35 PM Sanjay Radia 83 kB
Text File Licensed for inclusion in ASF works FileContext-common12.patch 2009-09-10 06:59 AM Sanjay Radia 88 kB
Text File Licensed for inclusion in ASF works FileContext-common13.patch 2009-09-13 04:55 PM Sanjay Radia 88 kB
Text File Licensed for inclusion in ASF works FileContext-common14.patch 2009-09-15 12:13 AM Sanjay Radia 88 kB
Text File Licensed for inclusion in ASF works FileContext-common16.patch 2009-09-16 05:44 AM Sanjay Radia 88 kB
Text File Licensed for inclusion in ASF works FileContext-common18.patch 2009-09-16 09:27 AM Sanjay Radia 88 kB
Text File Licensed for inclusion in ASF works FileContext-common19.patch 2009-09-16 03:16 PM Sanjay Radia 90 kB
Text File Licensed for inclusion in ASF works FileContext-common21.patch 2009-09-16 06:55 PM Sanjay Radia 92 kB
Text File Licensed for inclusion in ASF works FileContext-common22.patch 2009-09-17 04:30 AM Sanjay Radia 91 kB
Text File Licensed for inclusion in ASF works FileContext-common24.patch 2009-09-17 07:27 PM Sanjay Radia 92 kB
Text File Licensed for inclusion in ASF works FileContext-common25.patch 2009-09-17 08:32 PM Sanjay Radia 92 kB
Text File Licensed for inclusion in ASF works FileContext-hdfs10.patch 2009-09-07 10:59 PM Sanjay Radia 5 kB
Text File Licensed for inclusion in ASF works FileContext-hdfs11.patch 2009-09-08 09:35 PM Sanjay Radia 8 kB
Text File Licensed for inclusion in ASF works FileContext3.patch 2009-08-27 02:37 AM Sanjay Radia 65 kB
Text File Licensed for inclusion in ASF works FileContext5.patch 2009-08-27 07:50 AM Sanjay Radia 68 kB
Text File Licensed for inclusion in ASF works FileContext6.patch 2009-08-31 03:46 AM Sanjay Radia 70 kB
Text File Licensed for inclusion in ASF works FileContext7.patch 2009-09-01 02:16 AM Sanjay Radia 73 kB
Text File Licensed for inclusion in ASF works FileContext9.patch 2009-09-04 06:17 PM Sanjay Radia 79 kB
Java Source File Licensed for inclusion in ASF works Files.java 2009-08-11 07:00 AM Sanjay Radia 15 kB
Java Source File Licensed for inclusion in ASF works Files.java 2008-12-29 09:02 PM Sanjay Radia 11 kB
Text File Licensed for inclusion in ASF works FilesContext1.patch 2009-08-25 01:49 AM Sanjay Radia 53 kB
Text File Licensed for inclusion in ASF works FilesContext2.patch 2009-08-25 04:41 PM Sanjay Radia 66 kB
Issue Links:
Blocker
 
Incorporates
 
Reference
dependent
 

Hadoop Flags: Reviewed
Release Note: New FileContext API introduced to replace FileSystem API. FileContext will be the version-compatible API for future releases. FileSystem API will be deprecated in the next release.
Resolution Date: 17/Sep/09 10:29 PM

Sub-Tasks  All   Open   
No sub-tasks match this view.

 Description  « Hide
Currently the FIleSystem interface serves two purposes:
  • an application writer's interface for using the Hadoop file system
  • a file system implementer's interface (e.g. hdfs, local file system, kfs, etc)

This Jira proposes that we provide a simpler interfaces for the application writer and leave the FilsSystem interface for the implementer of a filesystem.

  • Filesystem interface has a confusing set of methods for the application writer
  • We could make it easier to take advantage of the URI file naming
    • Current approach is to get FileSystem instance by supplying the URI and then access that name space. It is consistent for the FileSystem instance to not accept URIs for other schemes, but we can do better.
    • The special copyFromLocalFIle can be generalized as a copyFile where the src or target can be generalized to any URI, including the local one.
    • The proposed scheme (below) simplifies this.
  • The client side config can be simplified.
    • New config() by default uses the default config. Since this is the common usage pattern, one should not need to always pass the config as a parameter when accessing the file system.
    • It does not handle multiple file systems too well. Today a site.xml is derived from a single Hadoop cluster. This does not make sense for multiple Hadoop clusters which may have different defaults.
    • Further one should need very little to configure the client side:
      • Default files system.
      • Block size
      • Replication factor
      • Scheme to class mapping
    • It should be possible to take Blocksize and replication factors defaults from the target file system, rather then the client size config. I am not suggesting we don't allow setting client side defaults, but most clients do not care and would find it simpler to take the defaults for their systems from the target file system.


 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
No work has yet been logged on this issue.