Details
-
Task
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
-
New
Description
Given how we now store the length of the field in norms, we could optimize the default norms format for the case that all norms are in 0..16 and store it on 4 bits. This would be picked up for short fields that have less than 16 terms (eg. title fields) and reduce disk utilization by 2.
Should this really be necessary given an iterator API? It seems too highly specialized and fragile (just like what random access APIs were doing), versus using a e.g. block-level compression like the posting lists.