Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1743

IndexOutOfBoundsException when reading the publicly available yelp jsongit.commit.id.abbrev=108d29f dataset

    XMLWordPrintableJSON

Details

    Description

      git.commit.id.abbrev=108d29f

      Query

      select attributes from `json_kvgenflatten/yelp-business-small.json`;
      

      I am seeing the below error. I narrowed down the data set so that it contains 4500 records. When I tried to split it further (2 files of 2225 records each), the error did not show up in either of the split. It might have something to do with the record batch boundaries.

      java.lang.IndexOutOfBoundsException: index: 16384, length: 4 (expected: range(0, 16384))
      

      I attached the dataset and the error log files. Let me know if you need anything else.

      Attachments

        1. error.log
          6 kB
          Rahul Kumar Challapalli
        2. yelp-business-small.json
          3.62 MB
          Rahul Kumar Challapalli
        3. DRILL-1743.1.patch.txt
          1 kB
          Hanifi Gunes

        Issue Links

          Activity

            People

              parthc Parth Chandra
              rkins Rahul Kumar Challapalli
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: