Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10981

Allow update to load gzip files

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 6.6
    • 7.6, 8.0
    • SolrJ

    Description

      We currently import large CSV files. We store them in gzip files as they compress at around 80%.

      To import them we must gunzip them and then import them. After that we no longer need the decompressed files.

      This patch allows directly opening either URL, or local files that are gzipped.

      For URLs, to determine if the file is gzipped, it will check the content encoding=="gzip" or if the file ends in ".gz"

      For files, if the file ends in ".gz" then it will assume the file is gzipped.

      I have tested the patch with 4.10.4, 6.6.0, 7.0.1 and master from git.

      Attachments

        1. SOLR-10981.patch
          9 kB
          Andrew Lundgren
        2. SOLR-10981.patch
          11 kB
          Andrew Lundgren
        3. SOLR-10981.patch
          16 kB
          Andrew Lundgren
        4. SOLR-10981.patch
          18 kB
          Andrew Lundgren
        5. SOLR-10981.patch
          18 kB
          Andrew Lundgren
        6. SOLR-10981.patch
          18 kB
          Andrew Lundgren

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            dsmiley David Smiley Assign to me
            lundgren Andrew Lundgren
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment