Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3266

Drill's hive storage plugin cannot find RegexSerDe

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.1.0
    • Storage - Hive
    • None

    Description

      git.commit.id.abbrev=5f26b8b

      HIve DDL :

      CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
      remote_host STRING,
      remote_logname STRING,
      remote_user STRING,
      request_time STRING,
      first_line STRING,
      http_status STRING,
      bytes STRING
      )
      ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
      WITH SERDEPROPERTIES (
      "input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\") (-|[0-9]*) (-|[0-9]*)",
      "output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
      )
      LOCATION '/drill/testdata/hive_storage/raw_access_log';
      

      When I try to query this table from sqlline, I get the below error

      0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
      Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe not found)
      
      
      [Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010] (state=,code=0)
      

      The query works from hive without any issues. I attached the data set and the error log

      Attachments

        1. DRILL-3266-1.patch
          8 kB
          Venki Korukanti
        2. error.log
          13 kB
          Rahul Kumar Challapalli
        3. apache_log.txt
          5 kB
          Rahul Kumar Challapalli

        Activity

          People

            vkorukanti Venki Korukanti
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: