Uploaded image for project: 'Apache AsterixDB'
  1. Apache AsterixDB
  2. ASTERIXDB-2129

UTF8StringUtil key normalization failure

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None

    Description

      This query:

      SELECT text,c
      FROM(
      SELECT h.text AS text, datetime_from_unix_time_in_ms(to_bigint(t.timestamp_ms)) as time
      FROM aca_int AS t
      UNNEST t.hashtags h
      where t.isRelated = 1 and t.`SA-OM` is not missing and t.createdDate is not missing
      ) AS g
      GROUP BY g.text AS text WITH c AS count(g.time)
      ORDER BY c DESC;

      Where the un-nested hashtag field text is in a closed schema, causes this failure:

      {{Oct 11, 2017 7:10:05 AM org.apache.hyracks.control.cc.dataset.DatasetDirectoryService reportJobFailure
      INFO: job JID:4 failed and is being reported to DatasetDirectoryService
      org.apache.hyracks.api.exceptions.HyracksDataException: java.lang.IllegalArgumentException
      ↪ at org.apache.hyracks.api.exceptions.HyracksDataException.create(HyracksDataException.java:134)
      ↪ at org.apache.hyracks.control.common.utils.ExceptionUtils.setNodeIds(ExceptionUtils.java:63)
      ↪ at org.apache.hyracks.control.nc.Task.run(Task.java:362)
      ↪ at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
      ↪ at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
      ↪ at java.lang.Thread.run(Thread.java:748)
      Caused by: java.lang.IllegalArgumentException
      ↪ at org.apache.hyracks.util.string.UTF8StringUtil.charAt(UTF8StringUtil.java:60)
      ↪ at org.apache.hyracks.util.string.UTF8StringUtil.normalize(UTF8StringUtil.java:228)
      ↪ at org.apache.hyracks.dataflow.common.data.normalizers.UTF8StringNormalizedKeyComputerFactory$1.normalize(UTF8StringNormalizedKeyComputerFactory.java:33)
      ↪ at org.apache.asterix.dataflow.data.nontagged.keynormalizers.AWrappedAscNormalizedKeyComputerFactory$1.normalize(AWrappedAscNormalizedKeyComputerFactory.java:46)
      ↪ at org.apache.hyracks.dataflow.std.sort.AbstractFrameSorter.sort(AbstractFrameSorter.java:139)
      ↪ at org.apache.hyracks.dataflow.std.sort.AbstractSortRunGenerator.flushFramesToRun(AbstractSortRunGenerator.java:60)
      ↪ at org.apache.hyracks.dataflow.std.sort.AbstractSortRunGenerator.close(AbstractSortRunGenerator.java:50)
      ↪ at org.apache.hyracks.dataflow.std.sort.AbstractSorterOperatorDescriptor$SortActivity$1.close(AbstractSorterOperatorDescriptor.java:132)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:119)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.std.StreamSelectRuntimeFactory$1.close(StreamSelectRuntimeFactory.java:112)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.std.AssignRuntimeFactory$1.close(AssignRuntimeFactory.java:119)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.close(AbstractOneInputOneOutputOneFramePushRuntime.java:60)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$2.close(AlgebricksMetaOperatorDescriptor.java:140)
      ↪ at org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.close(IndexSearchOperatorNodePushable.java:243)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.std.EmptyTupleSourceRuntimeFactory$1.close(EmptyTupleSourceRuntimeFactory.java:65)
      ↪ at org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$1.initialize(AlgebricksMetaOperatorDescriptor.java:104)
      ↪ at org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable.lambda$runInParallel$1(SuperActivityOperatorNodePushable.java:204)
      ↪ at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      ↪ ... 3 more
      }}

      Attachments

        Activity

          People

            wyk Wail Y. Alkowaileet
            imaxon Ian Maxon
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: