|
Kan Zhang made changes - 22/Oct/08 06:39 PM
Kan Zhang made changes - 22/Oct/08 06:42 PM
Kan Zhang made changes - 22/Oct/08 06:45 PM
Kan Zhang made changes - 22/Oct/08 06:46 PM
Arun C Murthy made changes - 22/Oct/08 08:25 PM
Arun C Murthy made changes - 16/Dec/08 10:33 PM
Arun C Murthy made changes - 16/Dec/08 10:33 PM
Arun C Murthy made changes - 16/Dec/08 10:33 PM
Arun C Murthy made changes - 22/Dec/08 10:15 PM
Arun C Murthy made changes - 22/Dec/08 10:53 PM
Arun C Murthy made changes - 23/Dec/08 05:49 PM
Kan Zhang made changes - 05/Mar/09 02:19 AM
Kan Zhang made changes - 24/Apr/09 04:41 PM
Vinod K V made changes - 07/Jul/09 05:07 AM
Kan Zhang made changes - 15/Jul/09 05:22 PM
Jeff Hammerbacher made changes - 22/Sep/09 11:00 PM
Kan Zhang made changes - 06/Oct/09 07:17 PM
Arun C Murthy made changes - 08/Oct/09 07:52 AM
Boris Shkolnik made changes - 22/Oct/09 07:05 PM
Kan Zhang made changes - 10/Nov/09 01:01 AM
Andrew Purtell made changes - 29/Nov/09 12:50 AM
Kan Zhang made changes - 08/Dec/09 06:52 PM
Kan Zhang made changes - 08/Dec/09 07:36 PM
Kan Zhang made changes - 08/Dec/09 08:08 PM
Kan Zhang made changes - 08/Dec/09 08:09 PM
Kan Zhang made changes - 08/Dec/09 08:09 PM
[
Permalink
| « Hide
]
Owen O'Malley added a comment - 18/Dec/09 10:16 PM
A security design overview for Hadoop.
Owen O'Malley made changes - 18/Dec/09 10:16 PM
Andrew Purtell made changes - 18/Dec/09 10:59 PM
Owen O'Malley made changes - 19/Dec/09 05:33 PM
Fixed title page to format better.
Owen O'Malley made changes - 19/Dec/09 05:33 PM
Kan Zhang made changes - 26/Dec/09 02:15 AM
I'm surprised I'm the first to comment: is the discussion going on elsewhere?
I read the design document over Christmas. Great to see a document with so much detail, thanks! I had some questions, and thought a couple of places could be clearer; my comments are below. ****** One thing that hasn't been covered (outside of assumptions) is more detail about how to operationally secure a Hadoop cluster in Unix-land. The assumptions section lays out some of these ("root" needs to be secure). Some things that I thought about: (1) data nodes node to write their data with a unix user that users don't have access to, and with appropriate permissions (or umask). (Looking at my local system, the DataNode has left blocks world-readable.) (2) We assume that the JT and NN are also run under unix accounts which users do not have access to. Since Data Nodes and the NameNode share a key, it's important to limit cluster membership. (This is critical for task trackers, too, since an evil task tracker could do nasty things.) What's the mechanism to limit cluster participation? Is there a central registry of what users can access HDFS and queues? Is there an "HDFS" superuser? In existing Hadoop, it's the username corresponding to the uid of the running the Namenode process.
It could also mean that the token is expired, no? I think this is made clearer in the following sentences.
What is the COPY access mode used for?
Somewhere else in the document, there's discussion of jobs having owners/groups, not just owners. Surely a superuser or cluster manager can kill jobs with appropriate permissions?
Will users still be able to use Hadoop in a "non-secure" manner? How much work would be involved in using a different security model? This is probably answered by the patch itself
Doug Cutting made changes - 15/Jan/10 08:32 PM
Kan Zhang made changes - 05/Feb/10 09:41 PM
Kan Zhang made changes - 08/Feb/10 10:38 PM
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||