Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-955

Hadoop doesn't schedule the tasks close to the data

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Fix Version/s: 0.6.1
    • Component/s: None
    • Labels:
      None

      Description

      Hadoop relies on locations for data in input splits being represented as hostnames and not ip addresses. Currently in my testing tasks are more often then not being scheduled on a node that does not contain the data requested.

        Attachments

          Activity

            People

            • Assignee:
              johanoskarsson Johan Oskarsson
              Reporter:
              johanoskarsson Johan Oskarsson
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: