Uploaded image for project: 'Apache Hop (Retired)'
  1. Apache Hop (Retired)
  2. HOP-4187

Merge join on Beam mixes up fields

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 2.0.0
    • 2.1.0
    • Beam
    • None

    Description

      See the [test case here|https://github.com/fbarthez/BeamJoinSpike.]

       

      2022/09/02 15:05:06 - Hop - Pipeline opened.
      2022/09/02 15:05:06 - Hop - Launching pipeline [join-spike]...
      2022/09/02 15:05:06 - Hop - Started the pipeline execution.
      2022/09/02 15:05:14 - General - Created Apache Beam pipeline with name 'join-spike'
      2022/09/02 15:05:14 - General - Handled transform (INPUT) : Agents
      2022/09/02 15:05:14 - General - Handled transform (INPUT) : Sites
      2022/09/02 15:05:14 - General - Handled Merge Join (TRANSFORM) : Merge join
      2022/09/02 15:05:14 - General - Handled generic transform (TRANSFORM) : Select values, gets data from 1 previous transform(s), targets=0, infos=0
      2022/09/02 15:05:14 - General - Handled transform (OUTPUT) : Write valid agents, gets data from Select values
      2022/09/02 15:05:14 - join-spike - Executing this pipeline using the Beam Pipeline Engine with run configuration 'Beam-Direct'
      2022/09/02 15:05:14 - join-spike - ERROR: Error starting the Beam pipeline
      2022/09/02 15:05:14 - join-spike - ERROR: org.apache.hop.core.exception.HopException: 
      2022/09/02 15:05:14 - join-spike - Error executing pipeline with runner Direct
      2022/09/02 15:05:14 - join-spike - java.lang.RuntimeException: Error converting Hop data to string lines
      2022/09/02 15:05:14 - join-spike - 
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.engines.BeamPipelineEngine.executePipeline(BeamPipelineEngine.java:258)
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.engines.BeamPipelineEngine.lambda$startThreads$0(BeamPipelineEngine.java:305)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.lang.Thread.run(Thread.java:829)
      2022/09/02 15:05:14 - join-spike - Caused by: org.apache.beam.sdk.Pipeline$PipelineExecutionException: java.lang.RuntimeException: Error converting Hop data to string lines
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DirectRunner$DirectPipelineResult.waitUntilFinish(DirectRunner.java:373)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DirectRunner$DirectPipelineResult.waitUntilFinish(DirectRunner.java:341)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:218)
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.engines.BeamPipelineEngine.executePipeline(BeamPipelineEngine.java:246)
      2022/09/02 15:05:14 - join-spike -     ... 2 more
      2022/09/02 15:05:14 - join-spike - Caused by: java.lang.RuntimeException: Error converting Hop data to string lines
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.core.fn.HopToStringFn.processElement(HopToStringFn.java:148)
      2022/09/02 15:05:14 - join-spike - Caused by: org.apache.hop.core.exception.HopException: 
      2022/09/02 15:05:14 - join-spike - Error getting String from field site_id Integer on index 1 in input: [id Integer], [site_id Integer], [site_name String], [telephone String], [agent_name String], native value found: Mbabane
      2022/09/02 15:05:14 - join-spike - 
      2022/09/02 15:05:14 - join-spike - site_id Integer : There was a data type error: the data type of java.lang.String object [Mbabane] does not correspond to value meta [Integer]
      2022/09/02 15:05:14 - join-spike - 
      2022/09/02 15:05:14 - join-spike - 
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.core.fn.HopToStringFn.processElement(HopToStringFn.java:135)
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.core.fn.HopToStringFn$DoFnInvoker.invokeProcessElement(Unknown Source)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.repackaged.direct_java.runners.core.SimpleDoFnRunner.invokeProcessElement(SimpleDoFnRunner.java:228)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.repackaged.direct_java.runners.core.SimpleDoFnRunner.processElement(SimpleDoFnRunner.java:187)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.repackaged.direct_java.runners.core.SimplePushbackSideInputDoFnRunner.processElementInReadyWindows(SimplePushbackSideInputDoFnRunner.java:79)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.ParDoEvaluator.processElement(ParDoEvaluator.java:244)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DoFnLifecycleManagerRemovingTransformEvaluator.processElement(DoFnLifecycleManagerRemovingTransformEvaluator.java:54)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DirectTransformExecutor.processElements(DirectTransformExecutor.java:165)
      2022/09/02 15:05:14 - join-spike -     at org.apache.beam.runners.direct.DirectTransformExecutor.run(DirectTransformExecutor.java:129)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
      2022/09/02 15:05:14 - join-spike -     at java.base/java.lang.Thread.run(Thread.java:829)
      2022/09/02 15:05:14 - join-spike - Caused by: org.apache.hop.core.exception.HopValueException: 
      2022/09/02 15:05:14 - join-spike - site_id Integer : There was a data type error: the data type of java.lang.String object [Mbabane] does not correspond to value meta [Integer]
      2022/09/02 15:05:14 - join-spike - 
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.core.row.value.ValueMetaBase.getString(ValueMetaBase.java:1944)
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.core.row.RowMeta.getString(RowMeta.java:301)
      2022/09/02 15:05:14 - join-spike -     at org.apache.hop.beam.core.fn.HopToStringFn.processElement(HopToStringFn.java:111)
      2022/09/02 15:05:14 - join-spike -     ... 13 more
      

      Attachments

        Issue Links

          Activity

            People

              mcasters Matt Casters
              fbarthez Fabian Peters
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: