Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1671

Incorrect results reported by drill when we have more than 10 flattens (2048 records)

    XMLWordPrintableJSON

Details

    Description

      git.commit.id.abbrev=60aa446
      I ran the below test against the private branch of Jason which has some patches for bugs related to flatten which are not yet merged into the master.

      The data is in such a way that each array within the record contains only 2 records. So with each flatten added to the query the no of rows should get doubled

      The below query works as expected

      0: jdbc:drill:schema=dfs.drillTestDir>select count(*) from (select id, flatten(evnts1), flatten(evnts2), flatten(evnts3), flatten(evnts4), flatten(evnts5), flatten(evnts6), flatten(evnts7), flatten(evnts8), flatten(evnts9), flatten(evnts10) from `json_kvgenflatten/many-arrays-50.json`) ;
      +------------+
      |   EXPR$0   |
      +------------+
      | 1024       |
      +------------+
      

      However the below query reports incorrect results. The correct output is 2048.

      0: jdbc:drill:schema=dfs.drillTestDir> select count(*) from (select id, flatten(evnts1), flatten(evnts2), flatten(evnts3), flatten(evnts4), flatten(evnts5), flatten(evnts6), flatten(evnts7), flatten(evnts8), flatten(evnts9), flatten(evnts10), flatten(evnts11) from `json_kvgenflatten/many-arrays-50.json`) ;
      +------------+
      |   EXPR$0   |
      +------------+
      | 2047       |
      +------------+
      

      From here on no matter how many flattens we add to the query, the output still remains the same. However the duration of the query seems to more and more with each new flatten added.

      I attached the data file. Let me know if you have any questions.

      Attachments

        1. many-arrays-50.json
          10 kB
          Rahul Kumar Challapalli

        Activity

          People

            Unassigned Unassigned
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: