Uploaded image for project: 'Apache Jena'
  1. Apache Jena
  2. JENA-601

Provide better support for compressed input formats

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Done
    • Jena 2.11.0
    • None
    • RIOT
    • None

    Description

      Currently Jena has little or not support for compressed input formats. There are the odd cases where some consideration is given e.g.

      • RDFLanguages.filenameToLang() strips off .gz extensions to help it correctly detect file types
      • HTTP responses can deal with compressed responses by virtue of Apache HttpClient

      What would be nice is to have a better strategy for handling compressed inputs. For example having a registry of known compression extensions e.g. .gz, .bz2, .deflate which ARQ would strip off when trying to deduce format from the filename.

      It would also be useful if the various locator implementations took compression into account when opening input streams as I'm fairly sure if you asked ARQ to open a foo.nt.gz file it would just open a raw input stream and then the reading would fail.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rvesse Rob Vesse
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: