Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2274

beam on spark runner run much slower than using spark

Details

    • Test
    • Status: Resolved
    • P2
    • Resolution: Duplicate
    • None
    • Not applicable
    • runner-spark
    • None

    Description

      I run a job,read hdfs files using Read.from(HDFSFileSource.from()) and do some ParDo.of functions. and I also run the same job, read hdfs file using sc.textFile(file) and do some RDDs.but I find beam job is much slower than spark job.Is there something that beam should improve or something wrong with my system and my code?thank you.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              yuntian liyuntian
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: