Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-3426

Fix Load performance degrade by fixing task distribution

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.6.0
    • None
    • None

    Description

      Problem: Load performance degrade by fixing task distribution issue.

      Cause: Consider 3 node cluster (host name a,b,c with IP1, IP2, IP3 as ip address), to launch load task, host name is required from NewCarbonDataLoadRDD in getPreferredLocations(). But if the driver is a (IP1), 

      result is IP1, b,c instead of a,b,c. Hence task was not launching to one executor which is same ip as driver.

      getLocalhostIPs is modified in current version recently and instead of IP it was returning address, hence local ip hostanme was removed instead of address.

      solution: Revert the change in getLocalhostIPs as it is not used in any other flow.  

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              Ajantha_Bhat Ajantha Bhat
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1.5h
                  1.5h