Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-284

Parsing Rich Document Types

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.4
    • update
    • None

    Description

      I have developed a RichDocumentRequestHandler based on the CSVRequestHandler that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into Solr.

      There is a wiki page with information here: http://wiki.apache.org/solr/UpdateRichDocuments

      Attachments

        1. libs.zip
          4.78 MB
          Eric Pugh
        2. rich.patch
          81 kB
          Chris Harris
        3. rich.patch
          79 kB
          Chris Harris
        4. rich.patch
          79 kB
          Chris Harris
        5. rich.patch
          79 kB
          Chris Harris
        6. rich.patch
          404 kB
          Chris Harris
        7. rich.patch
          68 kB
          Chris Harris
        8. rich.patch
          4 kB
          Eric Pugh
        9. schema_update.patch
          3 kB
          Yonik Seeley
        10. SOLR-284.patch
          5 kB
          Grant Ingersoll
        11. SOLR-284.patch
          33 kB
          Yonik Seeley
        12. SOLR-284.patch
          124 kB
          Chris Harris
        13. SOLR-284.patch
          127 kB
          Chris Harris
        14. SOLR-284.patch
          124 kB
          Chris Harris
        15. SOLR-284.patch
          123 kB
          Chris Harris
        16. SOLR-284.patch
          123 kB
          Chris Harris
        17. SOLR-284.patch
          134 kB
          Grant Ingersoll
        18. SOLR-284.patch
          127 kB
          Grant Ingersoll
        19. SOLR-284-no-key-gen.patch
          6 kB
          Grant Ingersoll
        20. solr-word.pdf
          21 kB
          Grant Ingersoll
        21. source.zip
          17 kB
          Eric Pugh
        22. test.zip
          8 kB
          Eric Pugh
        23. test-files.zip
          1022 kB
          Chris Harris
        24. test-files.zip
          1.01 MB
          Eric Pugh
        25. un-hardcode-id.diff
          4 kB
          Chris Harris

        Issue Links

          Activity

            People

              gsingers Grant Ingersoll
              epugh Eric Pugh
              Votes:
              32 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: