Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-2649

GC/OOM during BulkLoad

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 4.7.0
    • 4.7.0
    • None
    • None
    • Mac OS, Hadoop 2.7.2, HBase 1.1.2

    Description

      Phoenix fails to complete bulk load of 40Mb csv data with GC heap error during Reduce phase. The problem is in the comparator for TableRowkeyPair. It expects that the serialized value was written using zero-compressed encoding, but at least in my case it was written in regular way. So, trying to obtain length for table name and row key it always get zero and reports that those byte arrays are equal. As the result, the reducer receives all data produced by mappers in one reduce call and fails with OOM.

      Attachments

        1. PHOENIX-2649-4.patch
          4 kB
          Sergey Soldatov
        2. PHOENIX-2649-3.patch
          4 kB
          Sergey Soldatov
        3. PHOENIX-2649-2.patch
          4 kB
          Sergey Soldatov
        4. PHOENIX-2649-1.patch
          6 kB
          maghamravikiran
        5. PHOENIX-2649.patch
          5 kB
          maghamravikiran

        Activity

          People

            sergey.soldatov Sergey Soldatov
            sergey.soldatov Sergey Soldatov
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: