Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-12

InputFormat used in job must be in JobTracker classpath (not loaded from job JAR)

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.1.0
    • None
    • None
    • ~20 node nutch mapreduce environment, running SVN trunk, on Linux

    Description

      During development, I've been creating/tweaking custom InputFormat implementations. However, when you try to run a job against a running cluster, you get:
      Exception in thread "main" java.io.IOException: java.lang.RuntimeException: java.lang.RuntimeException: java.lang.ClassNotFoundException: my.custom.InputFormat
      at org.apache.nutch.ipc.Client.call(Client.java:294)
      at org.apache.nutch.ipc.RPC$Invoker.invoke(RPC.java:127)
      at $Proxy0.submitJob(Unknown Source)
      at org.apache.nutch.mapred.JobClient.submitJob(JobClient.java:259)
      at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:288)
      at com.parc.uir.wikipedia.WikipediaJob.main(WikipediaJob.java:85)

      This error goes away if I restart the TaskTrackers/JobTracker with a classpath which includes the needed code. Other classes (Mapper, Reducer) appear to be available out of the jar file specified in the JobConf, but not the InputFormat. Obviously, it's less than idea to have to restart the JobTracker whenever there's a change to a job-specific class.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            bpendleton Bryan Pendleton
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment