[HADOOP-8368] Use CMake rather than autotools to build native code - ASF JIRA

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.0.0-alpha
Fix Version/s: 2.0.2-alpha
Component/s: None
Labels:
None

Target Version/s:
Hadoop Flags:

Incompatible change, Reviewed

Description

It would be good to use cmake rather than autotools to build the native (C/C++) code in Hadoop.

Rationale:
1. automake depends on shell scripts, which often have problems running on different operating systems. It would be extremely difficult, and perhaps impossible, to use autotools under Windows. Even if it were possible, it might require horrible workarounds like installing cygwin. Even on Linux variants like Ubuntu 12.04, there are major build issues because /bin/sh is the Dash shell, rather than the Bash shell as it is in other Linux versions. It is currently impossible to build the native code under Ubuntu 12.04 because of this problem.

CMake has robust cross-platform support, including Windows. It does not use shell scripts.

2. automake error messages are very confusing. For example, "autoreconf: cannot empty /tmp/ar0.4849: Is a directory" or "Can't locate object method "path" via package "Autom4te..." are common error messages. In order to even start debugging automake problems you need to learn shell, m4, sed, and the a bunch of other things. With CMake, all you have to learn is the syntax of CMakeLists.txt, which is simple.

CMake can do all the stuff autotools can, such as making sure that required libraries are installed. There is a Maven plugin for CMake as well.

3. Different versions of autotools can have very different behaviors. For example, the version installed under openSUSE defaults to putting libraries in /usr/local/lib64, whereas the version shipped with Ubuntu 11.04 defaults to installing the same libraries under /usr/local/lib. (This is why the FUSE build is currently broken when using OpenSUSE.) This is another source of build failures and complexity. If things go wrong, you will often get an error message which is incomprehensible to normal humans (see point #2).

CMake allows you to specify the minimum_required_version of CMake that a particular CMakeLists.txt will accept. In addition, CMake maintains strict backwards compatibility between different versions. This prevents build bugs due to version skew.

4. autoconf, automake, and libtool are large and rather slow. This adds to build time.

For all these reasons, I think we should switch to CMake for compiling native (C/C++) code in Hadoop.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-8368-b2.003.trimmed.patch
11/Jun/12 18:26
43 kB
Colin McCabe
HADOOP-8368-b2.003.rm.patch
11/Jun/12 18:26
2 kB
Colin McCabe
HADOOP-8368-b2.002.trimmed.patch
04/Jun/12 18:45
43 kB
Colin McCabe
HADOOP-8368-b2.002.rm.patch
04/Jun/12 18:45
2 kB
Colin McCabe
HADOOP-8368-b2.001.trimmed.patch
02/Jun/12 22:51
47 kB
Colin McCabe
HADOOP-8368-b2.001.rm.patch
02/Jun/12 22:51
2 kB
Colin McCabe
HADOOP-8368-b2.001.patch
02/Jun/12 22:51
259 kB
Colin McCabe
HADOOP-8368.030.trimmed.patch
08/Jun/12 20:30
44 kB
Colin McCabe
HADOOP-8368.030.rm.patch
08/Jun/12 20:30
2 kB
Colin McCabe
HADOOP-8368.030.patch
07/Jun/12 22:08
101 kB
Colin McCabe
HADOOP-8368.030.patch
08/Jun/12 08:15
101 kB
Colin McCabe
HADOOP-8368.029.patch
07/Jun/12 19:57
102 kB
Colin McCabe
HADOOP-8368.028.trimmed.patch
06/Jun/12 19:54
44 kB
Colin McCabe
HADOOP-8368.028.rm.patch
06/Jun/12 19:54
2 kB
Colin McCabe
HADOOP-8368.026.trimmed.patch
30/May/12 23:02
47 kB
Colin McCabe
HADOOP-8368.026.rm.patch
30/May/12 23:02
2 kB
Colin McCabe
HADOOP-8368.025.trimmed.patch
30/May/12 01:21
56 kB
Colin McCabe
HADOOP-8368.024.trimmed.patch
29/May/12 22:22
56 kB
Colin McCabe
HADOOP-8368.023.trimmed.patch
26/May/12 01:23
58 kB
Colin McCabe
HADOOP-8368.021.trimmed.patch
25/May/12 19:13
50 kB
Colin McCabe
HADOOP-8368.020.trimmed.patch
25/May/12 04:42
50 kB
Colin McCabe
HADOOP-8368.020.rm.patch
25/May/12 04:41
2 kB
Colin McCabe
HADOOP-8368.018.trimmed.patch
23/May/12 22:25
65 kB
Colin McCabe
HADOOP-8368.016.trimmed.patch
23/May/12 18:31
65 kB
Colin McCabe
HADOOP-8368.015.trimmed.patch
23/May/12 04:57
53 kB
Colin McCabe
HADOOP-8368.014.trimmed.patch
23/May/12 04:21
53 kB
Colin McCabe
HADOOP-8368.012.rm.patch
22/May/12 15:44
3 kB
Colin McCabe
HADOOP-8368.012.patch
19/May/12 11:11
445 kB
Colin McCabe
HADOOP-8368.012.half.patch
22/May/12 15:40
221 kB
Colin McCabe
HADOOP-8368.010.patch
18/May/12 22:12
443 kB
Colin McCabe
HADOOP-8368.009.patch
18/May/12 19:46
443 kB
Colin McCabe
HADOOP-8368.008.patch
17/May/12 20:49
445 kB
Colin McCabe
HADOOP-8368.007.patch
17/May/12 20:29
445 kB
Colin McCabe
HADOOP-8368.006.patch
17/May/12 19:02
445 kB
Colin McCabe
HADOOP-8368.005.patch
17/May/12 07:15
444 kB
Colin McCabe
HADOOP-8368.001.patch
09/May/12 22:32
34 kB
Colin McCabe

Issue Links

blocks

HDFS-3250 Get the fuse-dfs test running

Resolved

is depended upon by

MAPREDUCE-4267 mavenize pipes

Closed

is duplicated by

HDFS-3607 log a message when fuse_dfs is not built

Closed

is related to

HADOOP-8482 give a better error message if native build dependencies are missng

Open

HADOOP-8488 test-patch.sh gives +1 even if the native build fails.

Closed

HADOOP-8480 The native build should honor -DskipTests

Closed

HADOOP-8481 update BUILDING.txt to talk about cmake rather than autotools

Closed

relates to

HADOOP-8538 CMake builds fail on ARM

Closed

HADOOP-8620 Add -Drequire.fuse and -Drequire.snappy

Closed

(2 is related to, 2 relates to)

Use CMake rather than autotools to build native code

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates