Description
ResettableFileInputStream.java:readChar() not handle ucs-4 character. it need 2 charBuf. it cause an unexpected termination。
a temporary solution:
if (res.isOverflow() && !charBuf.hasRemaining()){
logger.warn("decoder ucs-4 at postion: {}" , buf.position());
tmpBuf.clear();
res = decoder.decode(buf, tmpBuf, isEndOfInput);
incrPosition( buf.position() - start, false);
return '?';
}
Attachments
Attachments
Issue Links
- is duplicated by
-
FLUME-2360 Spooling Source: LineDeserializer incorrectly stops reading events if UTF-8 char occurs at end of buffer
- Open
-
FLUME-2182 Spooling Directory Source will not ingest data completely when a wide character appears at the edge of a buffer
- Resolved
-
FLUME-2241 Spooling Directory Source doesn't handle 2 byte UTF-8 encoded characters correctly
- Closed
- relates to
-
FLUME-2535 DEFAULT_BUF_SIZE hits special unicode character on the last byte truncates the event stream
- Open
-
FLUME-654 how to let flume support Chinese log file
- Resolved
- links to