Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-20827

Inconsistent results for empty arrays

    XMLWordPrintableJSON

Details

    Description

      LazySimpleDeserializeRead parses an empty array wrong. For example, a line ',' in a text file table with a delimiter ',' and schema 'array<int>, array<array<string>>' shows [null], [[""]], instead of [], [] with MapReduce engine and vectorized execution enabled. LazySimpleDeserializeRead has following code; 

      switch (complexField.complexCategory) {
      case LIST:
        {
          // Allow for empty string, etc.
          final boolean isNext = (fieldPosition <= complexFieldEnd);
      

      Empty string value read should be only applied to string families, not to other data types. 

      Attachments

        1. HIVE-20827.1.patch
          12 kB
          Teddy Choi

        Issue Links

          Activity

            People

              teddy.choi Teddy Choi
              teddy.choi Teddy Choi
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h