[LUCENE-7053] Remove deprecated BytesRef#getUTF8SortedAsUTF16Comparator(); remove natural comparator in favour of Java 8 one - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 6.0
Component/s: core/other
Labels:
None

Lucene Fields:

New

Description

Followup from ~~LUCENE-7052~~: This removes the legacy, deprecated getUTF8SortedAsUTF16Comparator() in the BytesRef class. I know originally we added the different comparators to be able to allow the index term dict to be sorted in different order. This never proved to be useful, as many Lucene queries rely on the default order. The only codec that used another byte order internally was the Lucene 3 one (but it used the unicode spaghetti algorithm to reorder its term enums at runtime).

This patch also removes the BytesRef-Comparator completely and just implements compareTo. So all code can rely on natural ordering.

This patch also cleans up other usages of natural order comparators, e.g. in ArrayUtil, because Java 8 natively provides a comparator.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-7053.patch
28/Feb/16 11:00
14 kB
Uwe Schindler
LUCENE-7053.patch
28/Feb/16 12:02
17 kB
Uwe Schindler
LUCENE-7053.patch
28/Feb/16 12:26
29 kB
Uwe Schindler
LUCENE-7053.patch
28/Feb/16 16:01
43 kB
Uwe Schindler

Issue Links

depends upon

LUCENE-7052 BytesRefHash.sort should always sort in unicode code point order

Resolved

Activity

People

Assignee:: Uwe Schindler

Reporter:: Uwe Schindler

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 28/Feb/16 10:59

Updated:: 28/Aug/22 14:50

Resolved:: 29/Feb/16 08:26