Details

    • Type: New Feature New Feature
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 1.0.0
    • Fix Version/s: 1.1
    • Component/s: fetcher
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.

      1. NUTCH-705.patch
        174 kB
        Dmitry Lihachev

        Issue Links

          Activity

          Dmitry Lihachev created issue -
          Hide
          Dmitry Lihachev added a comment -

          This parser correctly handles non ascii input

          Show
          Dmitry Lihachev added a comment - This parser correctly handles non ascii input
          Dmitry Lihachev made changes -
          Field Original Value New Value
          Attachment NUTCH-705.patch [ 12401087 ]
          Dmitry Lihachev made changes -
          Link This issue duplicates NUTCH-644 [ NUTCH-644 ]
          Hide
          Sami Siren added a comment -

          I think that the patch contains some lgpl code that we cannot commit into apache repository.

          Show
          Sami Siren added a comment - I think that the patch contains some lgpl code that we cannot commit into apache repository.
          Andrzej Bialecki made changes -
          Fix Version/s 1.0.0 [ 12312443 ]
          Priority Major [ 3 ] Minor [ 4 ]
          Description Demoting this issue and moving to 1.1 - current patch is not suitable due to LGPL licensed parts.
          Fix Version/s 1.1 [ 12313609 ]
          Hide
          Dmitry Lihachev added a comment -

          Yes, it looks a bit like a problem... How can we handle this?

          Show
          Dmitry Lihachev added a comment - Yes, it looks a bit like a problem... How can we handle this?
          Hide
          Sami Siren added a comment -

          I think we should start looking at Apache Tika for most (or all) of our parsers.

          Show
          Sami Siren added a comment - I think we should start looking at Apache Tika for most (or all) of our parsers.
          Julien Nioche made changes -
          Link This issue is related to NUTCH-766 [ NUTCH-766 ]
          Hide
          Julien Nioche added a comment -

          RTF parsing is now handled by the TikaPlugin (NUTCH-766). Please open an issue on Tika if the original problem with non-ascii chars still occurs

          Show
          Julien Nioche added a comment - RTF parsing is now handled by the TikaPlugin ( NUTCH-766 ). Please open an issue on Tika if the original problem with non-ascii chars still occurs
          Julien Nioche made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Lewis John McGibbney made changes -
          Status Resolved [ 5 ] Closed [ 6 ]
          Transition Time In Source Status Execution Times Last Executer Last Execution Date
          Open Open Resolved Resolved
          356d 6h 31m 1 Julien Nioche 18/Feb/10 10:48
          Resolved Resolved Closed Closed
          1188d 17h 6m 1 Lewis John McGibbney 22/May/13 04:54

            People

            • Assignee:
              Unassigned
              Reporter:
              Dmitry Lihachev
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development