Tika
  1. Tika
  2. TIKA-513

Support of Deja Vu (DjVu) format

    Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Won't Fix
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: parser
    • Labels:
      None

      Description

      It might be great if Tika could provide such a parser. Any suggestions/thoughts?

        Activity

        Hide
        Jukka Zitting added a comment -

        Is there a DjVu parser we could use?

        Show
        Jukka Zitting added a comment - Is there a DjVu parser we could use?
        Hide
        Timothy Truckle added a comment -

        there is an open source called DjVuLibre
        (http://djvu.sourceforge.net/)

        and here is an parser plugin (java)
        http://dlibra.psnc.pl/os/4.0.10/multiproject/dlibra-app-extension-fp-djvu/downloads.html
        project page (http://dlibra.psnc.pl/) is polish, so I can't say if it's open source...
        javadoc is here: http://dlibra.psnc.pl/os/4.0.10/multiproject/dlibra-app-extension-fp-djvu/apidocs/index.html

        and this: http://javadjvu.foxtrottechnologies.com/ (GPL)

        Show
        Timothy Truckle added a comment - there is an open source called DjVuLibre ( http://djvu.sourceforge.net/ ) and here is an parser plugin (java) http://dlibra.psnc.pl/os/4.0.10/multiproject/dlibra-app-extension-fp-djvu/downloads.html project page ( http://dlibra.psnc.pl/ ) is polish, so I can't say if it's open source... javadoc is here: http://dlibra.psnc.pl/os/4.0.10/multiproject/dlibra-app-extension-fp-djvu/apidocs/index.html and this: http://javadjvu.foxtrottechnologies.com/ (GPL)
        Hide
        Nick Burch added a comment -

        Both DjVuLibre and JavaDjVu are GPL'd, so we couldn't host a parser based on them in the main Tika codebase

        However, if someone was to pick one of these two libraries (whichever is easier to work with), and write a Tika Parser based on that, we can list it in the 3rd Party Parsers list: <http://wiki.apache.org/tika/3rd%20party%20parser%20plugins> (People who are happy with the library license could then choose to download and use the plugin)

        Show
        Nick Burch added a comment - Both DjVuLibre and JavaDjVu are GPL'd, so we couldn't host a parser based on them in the main Tika codebase However, if someone was to pick one of these two libraries (whichever is easier to work with), and write a Tika Parser based on that, we can list it in the 3rd Party Parsers list: < http://wiki.apache.org/tika/3rd%20party%20parser%20plugins > (People who are happy with the library license could then choose to download and use the plugin)
        Hide
        Jukka Zitting added a comment -

        Resolving as Won't Fix until there's an upstream library that we can use.

        Meanwhile, as noted by Nick, anyone can make a 3rd party parser plugin for Tika based on the existing libraries.

        Show
        Jukka Zitting added a comment - Resolving as Won't Fix until there's an upstream library that we can use . Meanwhile, as noted by Nick, anyone can make a 3rd party parser plugin for Tika based on the existing libraries.

          People

          • Assignee:
            Unassigned
            Reporter:
            Oleg Tikhonov
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development