Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5329

Make DocumentDictionary and co more lenient to dirty documents

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.7, 6.0
    • Component/s: core/search
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      Currently DocumentDictionary errors out whenever any document does not have value for any relevant stored fields. It would be nice to make it lenient and instead ignore the invalid documents.

      Another "issue" with the DocumentDictionary is that it only allows string fields as suggestions and binary fields as payloads. When exposing these dictionaries to solr (via https://issues.apache.org/jira/browse/SOLR-5378), it is inconvenient for the user to ensure that a suggestion field is a string field and a payload field is a binary field. It would be nice to have the dictionary "just work" whenever a string/binary field is passed to suggestion/payload field. The patch provides one solution to this problem (by accepting string or binary values), though it would be great if there are any other solution to this, without making the DocumentDictionary "too flexible"

        Attachments

        1. LUCENE-5329.patch
          14 kB
          Areek Zillur
        2. LUCENE-5329.patch
          21 kB
          Areek Zillur
        3. LUCENE-5329.patch
          21 kB
          Areek Zillur
        4. LUCENE-5329.patch
          32 kB
          Areek Zillur
        5. LUCENE-5329.patch
          31 kB
          Areek Zillur

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              areek Areek Zillur
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: