Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.4.0
-
None
-
None
-
CDH 4.4
Description
We are using a Spooling source to read from files and have noticed that there are situations where the source will stop reading in the middle of a file (see attached example file). After much debugging, I have determine that this occurs if the buffer in ResettableFileInputStream happens to fill when in the middle of handling a UTF-8 character. In this case readChar() will return -1 which ends the read lines loop in LineSerializer. Running the spool source with the attached file should result in > 5400 events in the channel but now it will only return ~3300.
Attachments
Attachments
Issue Links
- duplicates
-
FLUME-2215 ResettableFileInputStream can't support ucs-4 character
-
- Closed
-
- relates to
-
FLUME-2535 DEFAULT_BUF_SIZE hits special unicode character on the last byte truncates the event stream
-
- Open
-