Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1713

sqlline dumps core when querying yelp JSON dataset

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Cannot Reproduce
    • 0.6.0
    • 0.7.0
    • Client - CLI
    • None
    • Mac OS X, Drill 0.6.0, invoked as /opt/drill/bin/sqlline -u jdbc:drill:zk=local

    Description

      Dataset is 113MB of JSON from the yelp dataset challenge, data available here: http://www.yelp.com/dataset_challenge

      sqlline crashes many lines into the output.

      My query:

      SELECT
      CAST(t.votes.funny AS INT) AS funny,
      CAST(t.votes.useful AS INT) AS useful,
      CAST(t.votes.cool AS INT) AS cool,
      t.user_id,
      t.review_id,
      CAST(t.stars AS INT) as stars,
      t.`date`,
      t.text,
      t.type,
      t.business_id
      FROM dfs.`/Users/vince/Desktop/data/yelp/yelp_academic_dataset_review.json` AS t

      Attachments

        Activity

          People

            hgunes Hanifi Gunes
            vicenteg Vince Gonzalez
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: