Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-284

Parsing Rich Document Types

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.4
    • update
    • None

    Description

      I have developed a RichDocumentRequestHandler based on the CSVRequestHandler that supports streaming a PDF, Word, Powerpoint, Excel, or PDF document into Solr.

      There is a wiki page with information here: http://wiki.apache.org/solr/UpdateRichDocuments

      Attachments

        1. test-files.zip
          1.01 MB
          Eric Pugh
        2. libs.zip
          4.78 MB
          Eric Pugh
        3. rich.patch
          4 kB
          Eric Pugh
        4. source.zip
          17 kB
          Eric Pugh
        5. test.zip
          8 kB
          Eric Pugh
        6. rich.patch
          68 kB
          Chris Harris
        7. rich.patch
          404 kB
          Chris Harris
        8. test-files.zip
          1022 kB
          Chris Harris
        9. rich.patch
          79 kB
          Chris Harris
        10. rich.patch
          79 kB
          Chris Harris
        11. un-hardcode-id.diff
          4 kB
          Chris Harris
        12. rich.patch
          79 kB
          Chris Harris
        13. rich.patch
          81 kB
          Chris Harris
        14. SOLR-284.patch
          127 kB
          Grant Ingersoll
        15. SOLR-284.patch
          134 kB
          Grant Ingersoll
        16. solr-word.pdf
          21 kB
          Grant Ingersoll
        17. SOLR-284.patch
          123 kB
          Chris Harris
        18. SOLR-284.patch
          123 kB
          Chris Harris
        19. SOLR-284.patch
          124 kB
          Chris Harris
        20. SOLR-284.patch
          127 kB
          Chris Harris
        21. SOLR-284.patch
          124 kB
          Chris Harris
        22. SOLR-284-no-key-gen.patch
          6 kB
          Grant Ingersoll
        23. SOLR-284.patch
          33 kB
          Yonik Seeley
        24. schema_update.patch
          3 kB
          Yonik Seeley
        25. SOLR-284.patch
          5 kB
          Grant Ingersoll

        Issue Links

          Activity

            People

              gsingers Grant Ingersoll
              epugh Eric Pugh
              Votes:
              32 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: