[HARMONY-6640] UTF8 decoder doesn't properly decode supplementary characters - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 5.0M14
Fix Version/s: 5.0M16
Component/s: Classlib
Labels:
None
Environment:
Windows Vista

Patch Info:

Patch Available

Description

When attempting to build Lucene, I discovered a problem with UTF8 decoding.
(this actually prevents our tests from even compiling without a workaround)

For any codepoint > 0xffff (4-byte utf-8 sequence), the decoder doesn't properly
split the decoded codepoint into surrogate pairs.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HARMONY-6640.patch
02/Sep/10 13:37
2 kB
Robert Muir
HARMONY-6640.patch
02/Sep/10 13:28
1 kB
Robert Muir
nio_char.jar
14/Sep/10 19:15
1.34 MB
Mark Hindess

Activity

People

Assignee:: Tim Ellison

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 02/Sep/10 13:26

Updated:: 15/Sep/10 20:14

Resolved:: 14/Sep/10 14:06