Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-10986

hadoop tarball is twice as big as prev. version and 6 times as big unpacked

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Duplicate
    • 2.5.0
    • None
    • None
    • None

    Description

      I noticed that the binary tarball for 2.5.0 is almost 300MB, while 2.4.1 is only 132MB. Unpacking the latest tarball gives me 1.8 GB of stuff, with the majority in the "share" directory.

      $ cd hadoop-2.4.1
      $ du -sh *
      364K    bin
      356K    etc
      100K    include
      2,3M    lib
      128K    libexec
      24K     LICENSE.txt
      12K     NOTICE.txt
      12K     README.txt
      336K    sbin
      280M    share
      
       $ cd hadoop-2.5.0 
       $ du -sh *
      512K    bin
      332K    etc
      100K    include
      4,6M    lib
      128K    libexec
      336K    sbin
      1,8G    share
      

      I also saw some warnings from tar while unpacking:

      $ tar xf hadoop-2.5.0.tar.gz 
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      tar: Ignoring unknown extended header keyword `SCHILY.dev'
      tar: Ignoring unknown extended header keyword `SCHILY.ino'
      tar: Ignoring unknown extended header keyword `SCHILY.nlink'
      

      Attachments

        Issue Links

          Activity

            People

              kasha Karthik Kambatla
              fs111 André Kelpe
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: