Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Currently, the ForSt gives a direct buffer to ByteBufferWritableFSDataOutputStream, where the data will be written one byte by byte. According our perf, the statistics of hadoop based fs will be updated once for each byte, which takes a lot of CPU. Below is a flamegraph, where the statistics part is marked as purple (taking 8.14% of the overall CPU).
It might be better to copy to a heap buffer before invoking write.
Attachments
Attachments
Issue Links
- links to