Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 1.16
    • Fix Version/s: None
    • Component/s: parser
    • Labels:
      None

      Description

      It's straightforward in theory to implement support for PCX and DCX. There's support for it in Commons Imaging as well as in ImageIO via TwelveMonkeys.

      In practise, however, I'm not really sure how implement support. We obviously want to OCR the images, but Tesseract has no support for the format. So where do we do the conversion to a BufferedImage? I tried to look for what is done to handle JBIG2 files but I can't find that anywhere.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              mcaruanagalizia Matthew Caruana Galizia
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated: