Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-6986

SequenceFile.Reader should distinguish between Network IOE and Parsing IOE

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 0.20-append, 0.21.1, 0.22.0
    • None
    • io
    • None

    Description

      The SequenceFile.Reader api should give the user an easy way to distinguish between a Network/Low-level IOE and a Parsing IOE. The use case appeared recently in the HBase project:

      Originally, if a RegionServer got an IOE from HDFS while opening a region file, it would abort the open and let the HMaster reassign the region. The assumption being that this is a network failure that will likely disappear at a later time or different partition of the network. However, if HBase gets parsing exceptions, we want to log the problem and continue opening the region anyways, because parsing is an idempotent problem and retries won't fix this issue.

      Although this problem was found in HBase, it seems to be a generic problem of being able to more easily identify idempotent vs transient errors.

      Attachments

        1. HADOOP-6986_20-append.patch
          5 kB
          Nicolas Spiegelberg
        2. HADOOP-6986_0.21.patch
          5 kB
          Nicolas Spiegelberg

        Issue Links

          Activity

            People

              Unassigned Unassigned
              nspiegelberg Nicolas Spiegelberg
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: