Lucene - Core
  1. Lucene - Core
  2. LUCENE-2380

Add FieldCache.getTermBytes, to load term data as byte[]

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.0-ALPHA
    • Component/s: core/search
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      With flex, a term is now an opaque byte[] (typically, utf8 encoded unicode string, but not necessarily), so we need to push this up the search stack.

      FieldCache now has getStrings and getStringIndex; we need corresponding methods to load terms as native byte[], since in general they may not be representable as String. This should be quite a bit more RAM efficient too, for US ascii content since each character would then use 1 byte not 2.

      1. LUCENE-2380_direct_arr_access.patch
        5 kB
        Yonik Seeley
      2. LUCENE-2380_enum.patch
        9 kB
        Yonik Seeley
      3. LUCENE-2380_enum.patch
        7 kB
        Yonik Seeley
      4. LUCENE-2380.patch
        133 kB
        Michael McCandless
      5. LUCENE-2380.patch
        129 kB
        Michael McCandless
      6. LUCENE-2380.patch
        114 kB
        Michael McCandless
      7. LUCENE-2380.patch
        92 kB
        Michael McCandless

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Michael McCandless
              Reporter:
              Michael McCandless
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development