Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-238

parquet reader hangs when reading nullable columns

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.4.0
    • None
    • None

    Description

      Running this physical plan:

      {
      head : {
      type : "APACHE_DRILL_PHYSICAL",
      version : 1,
      generator :

      { type : "optiq", info : "na" }

      },
      graph : [ {
      pop : "parquet-scan",
      @id : 1,
      entries : [

      { path : "/tmp/tpc-h/customer" }

      ],
      storageengine :

      { type : "parquet", dfsName : "file:///" }

      ,
      ref : "_MAP",
      fragmentPointer : 0
      }, {
      pop : "project",
      @id : 2,
      exprs : [

      { ref : "output.$f0", expr : "1" }

      ],
      child : 1
      }, {
      pop : "streaming-aggregate",
      @id : 4,
      child : 2,
      keys : [ ],
      exprs : [

      { ref : "EXPR$0", expr : "count($f0) " }

      ]
      },

      { pop : "screen", @id : 6, child : 4 }

      ]
      }

      against parquet data generated with Pig. The reader hangs. the last message in the log is this:

      records to read in this pass: 4000

      Attachments

        Activity

          People

            jaltekruse Jason Altekruse
            sphillips Steven Phillips
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: