Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-5623

ORC accessing array column that's empty will fail with java out of bound exception

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.11.0
    • 1.3.0, 2.0.0
    • File Formats

    Description

      In our ORC tests we saw that queries that work on RCFile failed on the corresponding ORC version with Java IndexOutOfBoundsException in OrcStruct.java. The queries failed b/c the table has an array type column and there are rows with an empty array. We noticed that the getList(Object list, int i) method in OrcStruct.java simply returns the i-th element from list without checking if list is not null or if i is within valid range. After fixing that the queries run fine. The fix is really simple, but maybe there are other similar cases that need to be handled.
      The fix is to check if listObj is null and if i falls within range:

          public Object getListElement(Object listObj, int i) {
            if (listObj == null) {
                return null;
            }
            List list = ((List) listObj);
            if (i < 0 || i >= list.size()) {
                return null;
            }
            return list.get(i);
          }
      

      Attachments

        1. HIVE-5623.patch
          1 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              ericchu30 Eric Chu
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: