[LUCENE-2380] Add FieldCache.getTermBytes, to load term data as byte[] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.0-ALPHA
Component/s: core/search
Labels:
None

Lucene Fields:

New

Description

With flex, a term is now an opaque byte[] (typically, utf8 encoded unicode string, but not necessarily), so we need to push this up the search stack.

FieldCache now has getStrings and getStringIndex; we need corresponding methods to load terms as native byte[], since in general they may not be representable as String. This should be quite a bit more RAM efficient too, for US ascii content since each character would then use 1 byte not 2.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2380_direct_arr_access.patch
19/Jun/10 13:30
5 kB
Yonik Seeley
LUCENE-2380_enum.patch
14/Jun/10 22:51
9 kB
Yonik Seeley
LUCENE-2380_enum.patch
05/Jun/10 16:52
7 kB
Yonik Seeley
LUCENE-2380.patch
28/May/10 20:05
133 kB
Michael McCandless
LUCENE-2380.patch
25/May/10 10:49
129 kB
Michael McCandless
LUCENE-2380.patch
24/May/10 22:42
114 kB
Michael McCandless
LUCENE-2380.patch
18/May/10 17:21
92 kB
Michael McCandless

Issue Links

blocks

LUCENE-2364 Add support for terms in BytesRef format to Term, TermQuery, TermRangeQuery & Co.

Closed

is depended upon by

LUCENE-2426 change sort order to binary order

Closed

Activity

People

Assignee:: Michael McCandless

Reporter:: Michael McCandless

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 07/Apr/10 16:16

Updated:: 28/Aug/22 12:24

Resolved:: 03/Jun/10 18:38