Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-31

pattern match for bulk-import in-progress markers is not scalable

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.3.5-incubating
    • gc
    • None

    Description

      When you ask HDFS for tables//bulk_/processing_proc_*, the NameNode seems to be shipping back all the file names, and the client is performing the pattern match. This results in very bad NameNode performance during accumulo garbage collection.

      This is fixed in the trunk: markers are put in the METADATA table.

      However, the fix is needed in very large accumulo installations presently using 1.3.x.

      Attachments

        Activity

          People

            ecn Eric C. Newton
            ecn Eric C. Newton
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 2h
                2h
                Remaining:
                Remaining Estimate - 2h
                2h
                Logged:
                Time Spent - Not Specified
                Not Specified