[LUCENE-2019] map unicode process-internal codepoints to replacement character - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Reopened
Priority: Minor
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: core/index
Labels:
None

Lucene Fields:

New

Description

A spinoff from ~~LUCENE-2016~~.

There are several process-internal codepoints in unicode, we should not store these in the index.
Instead they should be mapped to replacement character (U+FFFD), so they can be used process-internally.

An example of this is how Lucene Java currently uses U+FFFF process-internally, it can't be in the index or will cause problems.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2019.patch
29/Oct/09 21:35
1 kB
Robert Muir

Activity

People

Assignee:: Unassigned

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 29/Oct/09 18:18

Updated:: 28/Aug/22 12:12