Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5329

Make DocumentDictionary and co more lenient to dirty documents

Details

    • Improvement
    • Status: Reopened
    • Major
    • Resolution: Fixed
    • None
    • 4.7, 6.0
    • core/search
    • None
    • New, Patch Available

    Description

      Currently DocumentDictionary errors out whenever any document does not have value for any relevant stored fields. It would be nice to make it lenient and instead ignore the invalid documents.

      Another "issue" with the DocumentDictionary is that it only allows string fields as suggestions and binary fields as payloads. When exposing these dictionaries to solr (via https://issues.apache.org/jira/browse/SOLR-5378), it is inconvenient for the user to ensure that a suggestion field is a string field and a payload field is a binary field. It would be nice to have the dictionary "just work" whenever a string/binary field is passed to suggestion/payload field. The patch provides one solution to this problem (by accepting string or binary values), though it would be great if there are any other solution to this, without making the DocumentDictionary "too flexible"

      Attachments

        1. LUCENE-5329.patch
          31 kB
          Areek Zillur
        2. LUCENE-5329.patch
          32 kB
          Areek Zillur
        3. LUCENE-5329.patch
          21 kB
          Areek Zillur
        4. LUCENE-5329.patch
          21 kB
          Areek Zillur
        5. LUCENE-5329.patch
          14 kB
          Areek Zillur

        Activity

          People

            Unassigned Unassigned
            areek Areek Zillur
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: