[HADOOP-17404] ABFS: Piggyback flush on Append calls for short writes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.3.0
Fix Version/s: 3.3.1
Component/s: fs/azure
Labels:
- pull-request-available

Description

When Hflush or Hsync APIs are called, a call is made to store backend to commit the data that was appended.

If the data size written by Hadoop app is small, i.e. data size :

before any of HFlush/HSync call is made or

between 2 HFlush/Hsync API calls

is less than write buffer size, 2 separate calls, one for append and another for flush is made,

Apps that do such small writes eventually end up with almost similar number of calls for flush and append.

This PR enables Flush to be piggybacked onto append call for such short write scenarios.

NOTE: The changes is guarded over a config, and is disabled by default until relevant supported changes is made available on all store production clusters.

New Config added: fs.azure.write.enableappendwithflush

Attachments

Issue Links

links to

GitHub Pull Request #2509

PR Link - 2509

Activity

People

Assignee:: Sneha Vijayarajan

Reporter:: Sneha Vijayarajan

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 01/Dec/20 17:51

Updated:: 22/Jan/21 11:12

Resolved:: 22/Jan/21 11:12

Time Tracking

Estimated:

Not Specified

Remaining:

Logged: