Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-284

Parsing Rich Document Types

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.4
    • update
    • None

    Description

      I have developed a RichDocumentRequestHandler based on the CSVRequestHandler that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into Solr.

      There is a wiki page with information here: http://wiki.apache.org/solr/UpdateRichDocuments

      Attachments

        1. libs.zip
          4.78 MB
          Eric Pugh
        2. rich.patch
          81 kB
          Chris Harris
        3. rich.patch
          79 kB
          Chris Harris
        4. rich.patch
          79 kB
          Chris Harris
        5. rich.patch
          79 kB
          Chris Harris
        6. rich.patch
          404 kB
          Chris Harris
        7. rich.patch
          68 kB
          Chris Harris
        8. rich.patch
          4 kB
          Eric Pugh
        9. schema_update.patch
          3 kB
          Yonik Seeley
        10. SOLR-284.patch
          5 kB
          Grant Ingersoll
        11. SOLR-284.patch
          33 kB
          Yonik Seeley
        12. SOLR-284.patch
          124 kB
          Chris Harris
        13. SOLR-284.patch
          127 kB
          Chris Harris
        14. SOLR-284.patch
          124 kB
          Chris Harris
        15. SOLR-284.patch
          123 kB
          Chris Harris
        16. SOLR-284.patch
          123 kB
          Chris Harris
        17. SOLR-284.patch
          134 kB
          Grant Ingersoll
        18. SOLR-284.patch
          127 kB
          Grant Ingersoll
        19. SOLR-284-no-key-gen.patch
          6 kB
          Grant Ingersoll
        20. solr-word.pdf
          21 kB
          Grant Ingersoll
        21. source.zip
          17 kB
          Eric Pugh
        22. test.zip
          8 kB
          Eric Pugh
        23. test-files.zip
          1022 kB
          Chris Harris
        24. test-files.zip
          1.01 MB
          Eric Pugh
        25. un-hardcode-id.diff
          4 kB
          Chris Harris

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            gsingers Grant Ingersoll
            epugh Eric Pugh
            Votes:
            32 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment