Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1593

Hash join does not support schema changes in joins on JSON files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Cannot Reproduce
    • None
    • 0.7.0
    • Execution - Data Types
    • None

    Description

      Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.

      SELECT  b.name, sum(r.votes.cool)
      FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
       r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
      ON (r.business_id = b.business_id)
      GROUP BY  b.name
      ORDER BY sum(r.votes.cool) DESC
      LIMIT 10;
      

      0: jdbc:drill:zk=local> SELECT b.name, sum(r.votes.cool)
      . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
      . . . . . . . . . . . > r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
      . . . . . . . . . . . > ON (r.business_id = b.business_id)
      . . . . . . . . . . . > GROUP BY b.name
      . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
      . . . . . . . . . . . > LIMIT 10;
      Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]

      Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)

      Adding the explain plan.

      0: jdbc:drill:zk=local> explain plan for SELECT b.name, sum(r.votes.cool)
      . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
      . . . . . . . . . . . > r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
      . . . . . . . . . . . > ON (r.business_id = b.business_id)
      . . . . . . . . . . . > GROUP BY b.name
      . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
      . . . . . . . . . . . > LIMIT 10
      . . . . . . . . . . . > ;
      ----------------------+

      text json

      ----------------------+

      00-00 Screen
      00-01 Project(name=[$0], EXPR$1=[$1])
      00-02 SelectionVectorRemover
      00-03 Limit(fetch=[10])
      00-04 SelectionVectorRemover
      00-05

      ----------------------+

      Attachments

        Activity

          People

            hgunes Hanifi Gunes
            Neeraja Neeraja
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: