Issue Details (XML | Word | Printable)

Key: HDFS-225
Type: New Feature New Feature
Status: Open Open
Priority: Major Major
Assignee: Enis Soztutar
Reporter: Michel Tourn
Votes: 10
Watchers: 25
Operations

If you were logged in you would be able to see more operations.
Hadoop HDFS

Expose HDFS as a WebDAV store

Created: 30/Aug/06 07:28 PM   Updated: 03/Aug/09 01:30 PM
Return to search
Component/s: None
Affects Version/s: None
Fix Version/s: None

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works hadoop-496-3.patch 2007-11-15 12:33 AM Michael Bieniosek 43 kB
Text File Licensed for inclusion in ASF works hadoop-496-4.patch 2007-11-16 10:25 PM Michael Bieniosek 38 kB
File Licensed for inclusion in ASF works hadoop-496-5.tgz 2008-01-26 07:18 AM Michael Bieniosek 7 kB
Text File Licensed for inclusion in ASF works hadoop-496-spool-cleanup.patch 2007-11-15 01:43 AM Michael Bieniosek 43 kB
Zip Archive hadoop-webdav.zip 2007-07-07 01:23 PM Albert Strasheim 15 kB
XML File Licensed for inclusion in ASF works jetty-slide.xml 2007-07-07 01:11 PM Albert Strasheim 2 kB
GZip Archive Licensed for inclusion in ASF works lib.webdav.tar.gz 2007-07-25 01:38 PM Enis Soztutar 2.14 MB
File Licensed for inclusion in ASF works slideusers.properties 2007-07-07 01:12 PM Albert Strasheim 0.0 kB
Text File Licensed for inclusion in ASF works webdav_wip1.patch 2007-07-25 01:38 PM Enis Soztutar 41 kB
Text File Licensed for inclusion in ASF works webdav_wip2.patch 2007-07-26 03:34 PM Enis Soztutar 43 kB
Image Attachments:

1. screenshot-1.jpg
(54 kB)
Issue Links:
Dependants
 
Reference
 


 Description  « Hide
WebDAV stands for Distributed Authoring and Versioning. It is a set of extensions to the HTTP protocol that lets users collaboratively edit and manage files on a remote web server. It is often considered as a replacement for NFS or SAMBA

HDFS (Hadoop Distributed File System) needs a friendly file system interface. DFSShell commands are unfamiliar. Instead it is more convenient for Hadoop users to use a mountable network drive. A friendly interface to HDFS will be used both for casual browsing of data and for bulk import/export.

The FUSE provider for HDFS is already available ( http://issues.apache.org/jira/browse/HADOOP-17 ) but it had scalability problems. WebDAV is a popular alternative.

The typical licensing terms for WebDAV tools are also attractive:
GPL for Linux client tools that Hadoop would not redistribute anyway.
More importantly, Apache Project/Apache license for Java tools and for server components.
This allows for a tighter integration with the HDFS code base.

There are some interesting Apache projects that support WebDAV.
But these are probably too heavyweight for the needs of Hadoop:
Tomcat servlet: http://tomcat.apache.org/tomcat-4.1-doc/catalina/docs/api/org/apache/catalina/servlets/WebdavServlet.html
Slide: http://jakarta.apache.org/slide/

Being HTTP-based and "backwards-compatible" with Web Browser clients, the WebDAV server protocol could even be piggy-backed on the existing Web UI ports of the Hadoop name node / data nodes. WebDAV can be hosted as (Jetty) servlets. This minimizes server code bloat and this avoids additional network traffic between HDFS and the WebDAV server.

General Clients (read-only):
Any web browser

Linux Clients:
Mountable GPL davfs2 http://dav.sourceforge.net/
FTP-like GPL Cadaver http://www.webdav.org/cadaver/

Server Protocol compliance tests:
http://www.webdav.org/neon/litmus/
A goal is for Hadoop HDFS to pass this test (minus support for Properties)

Pure Java clients:
DAV Explorer Apache lic. http://www.ics.uci.edu/~webdav/

WebDAV also makes it convenient to add advanced features in an incremental fashion:
file locking, access control lists, hard links, symbolic links.
New WebDAV standards get accepted and more or less featured WebDAV clients exist.
core http://www.webdav.org/specs/rfc2518.html
ACLs http://www.webdav.org/specs/rfc3744.html
redirects "soft links" http://greenbytes.de/tech/webdav/rfc4437.html
BIND "hard links" http://www.webdav.org/bind/
quota http://tools.ietf.org/html/rfc4331



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Michel Tourn made changes - 06/Sep/06 08:33 PM
Field Original Value New Value
Link This issue is related to HADOOP-512 [ HADOOP-512 ]
Doug Cutting made changes - 15/Dec/06 09:44 PM
Assignee Sameer Paranjpye [ sameerp ]
Albert Strasheim made changes - 07/Jul/07 01:11 PM
Attachment jetty-slide.xml [ 12361349 ]
Albert Strasheim made changes - 07/Jul/07 01:12 PM
Attachment slideusers.properties [ 12361350 ]
Albert Strasheim made changes - 07/Jul/07 01:23 PM
Attachment hadoop-webdav.zip [ 12361351 ]
Enis Soztutar made changes - 25/Jul/07 01:38 PM
Attachment lib.webdav.tar.gz [ 12362526 ]
Attachment webdav_wip1.patch [ 12362527 ]
Enis Soztutar made changes - 25/Jul/07 01:39 PM
Assignee Sameer Paranjpye [ sameerp ] Enis Soztutar [ enis ]
Enis Soztutar made changes - 26/Jul/07 03:34 PM
Attachment webdav_wip2.patch [ 12362625 ]
Albert Strasheim made changes - 04/Aug/07 12:12 PM
Link This issue depends on HADOOP-1654 [ HADOOP-1654 ]
Pete Wyckoff made changes - 07/Nov/07 12:38 AM
Attachment screenshot-1.jpg [ 12369066 ]
Michael Bieniosek made changes - 15/Nov/07 12:33 AM
Attachment hadoop-496-3.patch [ 12369559 ]
Michael Bieniosek made changes - 15/Nov/07 01:43 AM
Attachment hadoop-496-spool-cleanup.patch [ 12369561 ]
Michael Bieniosek made changes - 16/Nov/07 10:25 PM
Attachment hadoop-496-4.patch [ 12369698 ]
Anurag Sharma made changes - 01/Dec/07 12:18 AM
Attachment fuse-j-patch.zip [ 12370717 ]
Attachment fuse-j-hadoopfs-0.zip [ 12370716 ]
Anurag Sharma made changes - 05/Dec/07 08:13 PM
Attachment fuse-j-patch.zip [ 12370717 ]
Anurag Sharma made changes - 05/Dec/07 08:14 PM
Attachment fuse-j-hadoopfs-0.zip [ 12370716 ]
Michael Bieniosek made changes - 26/Jan/08 07:18 AM
Attachment hadoop-496-5.tgz [ 12374118 ]
Owen O'Malley made changes - 20/Jun/09 07:42 AM
Component/s dfs [ 12310710 ]
Key HADOOP-496 HDFS-225
Project Hadoop Common [ 12310240 ] HDFS [ 12310942 ]