Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-2711

Improve performance of GetTableLocations for large tables

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.9.0
    • Fix Version/s: 1.10.0
    • Component/s: master, perf
    • Labels:
      None

      Description

      Currently the response for TableLocations returns a separate copy of the TSInfoPB for every replica. These PBs are relatively large and include string UUIDs, hostnames, etc. For large tables, a response may have up to 3000 replicas. In most clusters, the table itself will be on far fewer than 3000 unique hosts, so it makes sense to return the replicas as indexes into a response-level list rather than separately duplicate the TS info in each of the replica PBs.

      Additionally, we can consider a reader-writer lock on TSDescriptor so that these requests can proceed more in parallel.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                wdberkeley William Berkeley
                Reporter:
                tlipcon Todd Lipcon
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: