Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Implemented
-
None
-
None
-
None
-
None
Description
Ticket for work in progress on new FileSystem abstractions. Previously, we (Yahoo) submitted a ticket that would add support for humongous (1 million region+) tables via a hierarchical layout (HBASE-13991). However open source is moving in a similar but not identical direction in the future and so the patch will not be merged into open source.
We will be working on a different patch now with folks from open source. It will create/add to 2 layers-- a path abstraction layer and a use-oriented abstraction layer. The path abstraction layer is epitomized by classes like FsUtils (and in the patch new classes like AFsLayout). The use oriented abstraction layer is epitomized by existing classes like MasterFileSystem/HRegionFileSystem (and possibly new classes later) that build on the path abstraction layer and focus on 'doing things' (eg creating regions) and less on the gritty details like the paths.
This work on abstracting and isolating the paths from the use cases will help Yahoo not diverge too much from open source with its internal 'Humongous' table hierarchical layout, while also helping open source move further towards the eventual goal of redoing the FS layout in a similar (but different) hierarchical layout later that focuses on data directory uniformity (unlike the humongous patch) and storing hierarchy in the meta table instead which enables new optimizations (see HBASE-14090.)
Attached to this ticket is some work we've done at Yahoo so far that will be put into an open source HBase branch for further collaboration. The patch is not meant to be complete yet and is a work in progress. (Please wait on patch comments/reviews.) It also includes some Yahoo-specific 'humongous' layout code that will be removed before submission in open source.
Attachments
Attachments
Issue Links
- Dependent
-
HBASE-6205 Support an option to keep data of dropped table for some time
- Closed
- duplicates
-
HBASE-16250 Remove direct use of Hadoop Path/FileSysem 2/5
- Closed
-
HBASE-16251 Remove direct use of Hadoop Path/FileSysem 3/5
- Closed
-
HBASE-16248 Remove direct use of Hadoop Path/FileSysem
- Closed
- is duplicated by
-
HBASE-16249 Remove direct use of Hadoop Path/FileSysem 1/5
- Closed
-
HBASE-16252 Remove direct use of Hadoop Path/FileSysem 4/5
- Closed
-
HBASE-16253 Remove direct use of Hadoop Path/FileSysem 5/5
- Closed
- is related to
-
HBASE-14090 Redo FS layout; let go of tables/regions/stores directory hierarchy in DFS
- Open
-
HBASE-14887 Use Orientated Abstractions
- Resolved