[HADOOP-6313] Expose flush APIs to application users - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.21.0, 0.22.0
Fix Version/s: 0.21.0
Component/s: fs
Labels:
None

Hadoop Flags:

Reviewed
Release Note:
FSOutputDataStream implement Syncable interface to provide hflush and hsync APIs to the application users.

Description

Earlier this year, Yahoo, Facebook, and Hbase developers had a roundtable discussion where we agreed to support three types of flush in HDFS (API1, 2, and 3) and the append project aims to implement API2. Here is a proposal to expose these APIs to application users.
1. Three flush APIs

API1: flushes out from the address space of client into the socket to the data nodes. On the return of the call there is no guarantee that that data is out of the underlying node and no guarantee of having reached a DN. New readers will eventually see this data if there are no failures.
API2: flushes out to all replicas of the block. The data is in the buffers of the DNs but not on the DN's OS buffers. New readers will see the data after the call has returned.
API3: flushes out to all replicas and all replicas have done posix fsync equivalent - ie the OS has flushed it to the disk device (but the disk may have it in its cache).

2. Support flush APIs in FS

FSDataOutputStream#flush supports API1
FSDataOutputStream implements Syncable interface defined below. If its wrapped output stream (i.e. each file system's stream) is Syncable, FSDataOutputStream#hflush() and hsync() call its wrapped output stream's hflush & hsync.
```
  public interface Syncable {
    public void hflush() throws IOException;  // support API2
    public void hsync() throws IOException;   // support API3
  }
```
In each file system, if only hflush() is implemented, hsync() by default calls hflush(). If only hsync() is implemented, hflush() by default calls flush().

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hflushCommon1.patch
26/Oct/09 23:58
5 kB
Hairong Kuang
hflushCommon2.patch
29/Oct/09 23:47
5 kB
Hairong Kuang
hflushCommon2.patch
29/Oct/09 22:04
5 kB
Hairong Kuang

Issue Links

blocks

HDFS-731 Support new Syncable interface in HDFS

Closed

relates to

HDFS-744 Support hsync in HDFS

Closed

Activity

People

Assignee:: Hairong Kuang

Reporter:: Hairong Kuang

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Dates

Created:: 14/Oct/09 17:19

Updated:: 24/Aug/10 20:40

Resolved:: 30/Oct/09 19:53