Uploaded image for project: 'ORC'
  1. ORC
  2. ORC-374

Possible to reduce size of release tarballs?

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      We are building the Apache ORC C++ library as a dependency of Apache Arrow. I have noticed that the latest release tarball for ORC is about 13 MB.

      It looks like is caused by a combination of

      • Data files used for testing
      • Generated Javadoc

      Here's the du output

      $ du -d 2 -h .
      14M	./examples/expected
      23M	./examples
      12K	./proto
      48K	./cmake_modules
      40K	./site/develop
      12K	./site/security
      18M	./site/api
      24K	./site/_layouts
      16K	./site/_data
      16K	./site/js
      468K	./site/img
      8.0K	./site/help
      116K	./site/specification
      16K	./site/news
      8.0K	./site/talks
      520K	./site/fonts
      24K	./site/_sass
      88K	./site/_includes
      120K	./site/_posts
      108K	./site/_docs
      32K	./site/css
      20M	./site
      8.0K	./docker/centos7
      8.0K	./docker/centos6
      8.0K	./docker/ubuntu16-clang5
      8.0K	./docker/ubuntu12
      8.0K	./docker/debian8
      8.0K	./docker/debian7
      8.0K	./docker/ubuntu14
      8.0K	./docker/ubuntu16
      76K	./docker
      256K	./tools/test
      56K	./tools/src
      320K	./tools
      8.0K	./.git/info
      28K	./.git/refs
      52K	./.git/hooks
      32K	./.git/logs
      4.0K	./.git/branches
      22M	./.git/objects
      22M	./.git
      64K	./java/examples
      260K	./java/mapreduce
      2.3M	./java/core
      472K	./java/tools
      128K	./java/shims
      356K	./java/bench
      3.6M	./java
      708K	./c++/test
      104K	./c++/include
      664K	./c++/src
      948K	./c++/libs
      2.5M	./c++
      71M	.
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                wesmckinn Wes McKinney
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: