Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-25471

wrong result if table transfrom to DataStream then keyby sum in Batch Mode

    XMLWordPrintableJSON

Details

    Description

      I have a dataStream with 6 lines datas like this:

      Row.of("Alice"),
      Row.of("alice"),
      Row.of("Bob"),
      Row.of("lily"),
      Row.of("lily"),
      Row.of("lily")

      then  make it to  table  with one column "word"

      then sql transform : select upper(word) from tmp_table

      then change to dataStream

      then keyby sum.

       

      in batch mode:

      I think correct result is:

      > (BOB,1)
      > (ALICE,2)
      > (LILY,3)

       

      but the result is :

      > (BOB,1)

      if i set different parallelism ,the result is different.

       

      the source file  and pom is in attach.

       is  a bug?

      pelease help me!!!

       

       

       

       

       

       

       

       

       

      Attachments

        1. TableToDataStreamBatchWordCount-1.scala
          2 kB
          zhangzh
        2. pom.xml
          12 kB
          zhangzh

        Issue Links

          Activity

            People

              paul8263 Yao Zhang
              zhangzihan zhangzh
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: