Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2390

JobTracker and TaskTrackers fail with a misleading error if one of the mapreduce.cluster.dir has unusable permissions / is unavailable.

    XMLWordPrintableJSON

Details

    • jobtracker, tasktracker

    Description

      To reproduce, have a mapred.local.dir property set to a few directories. Before starting up the JT, set one of these directories' permission as 'd---------', and then start the JT/TT. The JT, although it tries to ignore this directory, fails with an odd and misleading message claiming that its configured address in use.

      Fixing the permission clears this issue!

      This was also reported in the mailing lists by Ted Yu, quite a few months ago. But I had forgotten about filing a bug for it here. Still seems to happen. A log is attached below.

      2011-03-17 00:40:32,321 WARN org.apache.hadoop.mapred.JobTracker: Error starting tracker: java.io.IOException: Cannot create toBeDeleted in /home/hack/.tmplocalz/2
              at org.apache.hadoop.util.MRAsyncDiskService.<init>(MRAsyncDiskService.java:86)
              at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2189)
              at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2022)
              at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:276)
              at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:268)
              at org.apache.hadoop.mapred.JobTracker.main(JobTracker.java:4712)
      
      2011-03-17 00:40:33,322 INFO org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager: Updating the current master key for generating delegation tokens
      2011-03-17 00:40:33,322 INFO org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager: Starting expired delegation token remover thread, tokenRemoverScanInterval=60 min(s)
      2011-03-17 00:40:33,322 INFO org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager: Updating the current master key for generating delegation tokens
      2011-03-17 00:40:33,322 INFO org.apache.hadoop.mapred.JobTracker: Scheduler configured with (memSizeForMapSlotOnJT, memSizeForReduceSlotOnJT, limitMaxMemForMapTasks, limitMaxMemForReduceTasks) (-1, -1, -1, -1)
      2011-03-17 00:40:33,322 INFO org.apache.hadoop.util.HostsFileReader: Refreshing hosts (include/exclude) list
      2011-03-17 00:40:33,350 INFO org.apache.hadoop.mapred.JobTracker: Starting jobtracker with owner as hack
      2011-03-17 00:40:33,351 FATAL org.apache.hadoop.mapred.JobTracker: java.net.BindException: Problem binding to localhost/127.0.0.1:8021 : Address already in use
              at org.apache.hadoop.ipc.Server.bind(Server.java:227)
              at org.apache.hadoop.ipc.Server$Listener.<init>(Server.java:314)
              at org.apache.hadoop.ipc.Server.<init>(Server.java:1411)
              at org.apache.hadoop.ipc.RPC$Server.<init>(RPC.java:510)
              at org.apache.hadoop.ipc.RPC.getServer(RPC.java:471)
              at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2112)
              at org.apache.hadoop.mapred.JobTracker.<init>(JobTracker.java:2022)
              at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:276)
              at org.apache.hadoop.mapred.JobTracker.startTracker(JobTracker.java:268)
              at org.apache.hadoop.mapred.JobTracker.main(JobTracker.java:4712)
      Caused by: java.net.BindException: Address already in use
              at sun.nio.ch.Net.bind(Native Method)
              at sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:126)
              at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:59)
              at org.apache.hadoop.ipc.Server.bind(Server.java:225)
              ... 9 more
      
      2011-03-17 00:40:33,352 INFO org.apache.hadoop.mapred.JobTracker: SHUTDOWN_MSG: 
      /************************************************************
      SHUTDOWN_MSG: Shutting down JobTracker at QDuo/127.0.0.1
      ************************************************************/
      

      The list conversation in context, at search-hadoop.com:
      http://search-hadoop.com/m/FzN7iqreL/problem+starting+cdh3b2+jobtracker&subj=problem+starting+cdh3b2+jobtracker

      I'll try to investigate and post the exact problem / solution soon.

      Attachments

        Activity

          People

            qwertymaniac Harsh J
            qwertymaniac Harsh J
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: