Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Not a Problem
    • Affects Version/s: 1.3.1
    • Fix Version/s: None
    • Component/s: Text extraction
    • Labels:
      None
    • Environment:
      tika-0.8

      Description

      german umlaute are not recognized in this document
      http://www.computing.dcu.ie/~irehbein/SS08/uebung1/stts-guide.pdf

      Guidelines f
      
      ur das Tagging deutscher Textcorpora

      1. stts-guide.pdf
        386 kB
        Jukka Zitting

        Issue Links

          Activity

          Reinhard Schwab created issue -
          Reinhard Schwab made changes -
          Field Original Value New Value
          Affects Version/s 1.3.0 [ 12315175 ]
          Fix Version/s 1.3.0 [ 12315175 ]
          Jukka Zitting made changes -
          Attachment stts-guide.pdf [ 12457139 ]
          Jukka Zitting made changes -
          Fix Version/s 1.3.0 [ 12315175 ]
          Tilman Hausherr made changes -
          Link This issue is related to PDFBOX-1791 [ PDFBOX-1791 ]
          John Hewson made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Not a Problem [ 8 ]
          Andreas Lehmkühler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Unassigned
              Reporter:
              Reinhard Schwab
            • Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development