Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1743

IndexOutOfBoundsException when reading the publicly available yelp jsongit.commit.id.abbrev=108d29f dataset

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      git.commit.id.abbrev=108d29f

      Query

      select attributes from `json_kvgenflatten/yelp-business-small.json`;
      

      I am seeing the below error. I narrowed down the data set so that it contains 4500 records. When I tried to split it further (2 files of 2225 records each), the error did not show up in either of the split. It might have something to do with the record batch boundaries.

      java.lang.IndexOutOfBoundsException: index: 16384, length: 4 (expected: range(0, 16384))
      

      I attached the dataset and the error log files. Let me know if you need anything else.

      Attachments

        1. yelp-business-small.json
          3.62 MB
          Rahul Kumar Challapalli
        2. error.log
          6 kB
          Rahul Kumar Challapalli
        3. DRILL-1743.1.patch.txt
          1 kB
          Hanifi Gunes

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            parthc Parth Chandra
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment