Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1857

Enhance PDFParser to extract text from XFA forms

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.13
    • parser
    • Patch

    Description

      Extract text from PDF Forms (XFA). Information about XFA: https://en.wikipedia.org/wiki/XFA

      Attachments

        1. xfa_in_govdocs1.txt
          3 kB
          Tim Allison
        2. govdocs1_xfas.zip
          8.26 MB
          Tim Allison
        3. doc8.pdf
          109 kB
          Kenneth Lui
        4. 041617_filled_out.pdf
          815 kB
          Tim Allison

        Issue Links

          Activity

            People

              Unassigned Unassigned
              pascal.essiembre Pascal Essiembre
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: