Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.1
-
None
Description
We should change the format of strings written to indexes so that the length of the string is in bytes, not Java characters. This issue has been discussed at:
http://www.mail-archive.com/java-dev@lucene.apache.org/msg01970.html
We must increment the file format number to indicate this change. At least the format number in the segments file should change.
I'm targetting this for 2.1, i.e., we shouldn't commit it to trunk until after 2.0 is released, to minimize incompatible changes between 1.9 and 2.0 (other than removal of deprecated features).
Attachments
Attachments
Issue Links
- is related to
-
HADOOP-302 class Text (replacement for class UTF8) was: HADOOP-136
- Closed
- relates to
-
LUCENE-509 Performance optimization when retrieving a single field from a document
- Resolved