Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-10413

Tablesplit.getLength returns 0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.96.1.1
    • 0.98.1, 0.99.0
    • Client, mapreduce
    • None
    • Reviewed
    • TableSplit.getLength() contains correct sizes of region in bytes. It is used by M/R framework for better scheduling.

    Description

      InputSplits should be sorted by length but TableSplit does not contain real getLength implementation:

      @Override
      public long getLength()

      { // Not clear how to obtain this... seems to be used only for sorting splits return 0; }

      This is causing us problem with scheduling - we have got jobs that are supposed to finish in limited time but they get often stuck in last mapper working on large region.

      Can we implement this method ?
      What is the best way ?

      We were thinking about estimating size by size of files on HDFS.
      We would like to get Scanner from TableSplit, use startRow, stopRow and column families to get corresponding region than computing size of HDFS for given region and column family.

      Update:
      This ticket was about production issue - I talked with guy who worked on this and he said our production issue was probably not directly caused by getLength() returning 0.

      Attachments

        1. 10413.addendum
          1 kB
          Ted Yu
        2. 10413-7.patch
          24 kB
          Ted Yu
        3. HBASE-10413.patch
          24 kB
          Lukas Nalezenec
        4. HBASE-10413-2.patch
          24 kB
          Lukas Nalezenec
        5. HBASE-10413-3.patch
          23 kB
          Lukas Nalezenec
        6. HBASE-10413-4.patch
          23 kB
          Lukas Nalezenec
        7. HBASE-10413-5.patch
          23 kB
          Lukas Nalezenec
        8. HBASE-10413-6.patch
          25 kB
          Lukas Nalezenec

        Issue Links

          Activity

            People

              lukas.nalezenec Lukas Nalezenec
              lukas.nalezenec Lukas Nalezenec
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: