Uploaded image for project: 'Comdev GSOC'
  1. Comdev GSOC
  2. GSOC-66

Add HTrace distributed tracing for s3 and other alternative Hadoop FS implementations

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Won't Fix

    Description

      The Apache HTrace distributed tracing framework allows developers and system administrators to get an end-to-end view of system performance, in a manner similar to XTrace or Dapper. See http://htrace.incubator.apache.org/

      s3, GCS, WASB, and other cloud blob stores are becoming increasingly important in Hadoop. But we don't have distributed tracing for these yet. It would be interesting to add distributed tracing here. It would enable collecting really interesting data like probability distributions of PUT and GET requests to s3 and their impact on MR jobs, etc. We should add HTrace distributed tracing for s3 and other alternative Hadoop FS implementations.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              cmccabe Colin McCabe
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: