Solr
  1. Solr
  2. SOLR-3246

UpdateRequestProcessor to extract Solr XML from rich documents

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Minor Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: update
    • Labels:
      None

      Description

      This would be an update request handler to save a file with the xml that represents the document in an external directory. The original
      idea behind this was to add it to the processing chain of the ExtractingRequestHandler to store an already parsed version of the docs. This storage of pre-parsed documents will make the re indexing of the entire index faster (avoiding the Tika phase, and just sending the xml to the standard update processor).
      As a side effect, extracting the xml can make debugging of rich docs easier.

      1. SOLR-3246.patch
        23 kB
        Emmanuel Espina
      2. SOLR-3246.patch
        19 kB
        Emmanuel Espina

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Unassigned
            Reporter:
            Emmanuel Espina
          • Votes:
            1 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development