Hive
  1. Hive
  2. HIVE-3182

Add an option in hive to ignore corrupt data

    Details

      Description

      In many scenarios, we have seen java.lang.InternalError due to corruption.
      This may be due to LZMA or some other kind of corrupt data. It would be
      useful to add an option in hive to ignore corrupt data, or ignore any internal
      errors. Typically, this should only be used to handle corrupt data.

        Activity

        Hide
        Edward Capriolo added a comment -

        Doesn't hadoop offer skip failed row options?

        Show
        Edward Capriolo added a comment - Doesn't hadoop offer skip failed row options?
        Hide
        Carl Steinbach added a comment -

        Any options that are added should be narrowly defined and specific to a particular class of errors. We want to avoid the situation where a user enables this property to mask one type of problem only to end up missing other problems which are important.

        Show
        Carl Steinbach added a comment - Any options that are added should be narrowly defined and specific to a particular class of errors. We want to avoid the situation where a user enables this property to mask one type of problem only to end up missing other problems which are important.

          People

          • Assignee:
            Namit Jain
            Reporter:
            Namit Jain
          • Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

            • Created:
              Updated:

              Development