Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-3217

Add a performance test for HadoopInputFormatIO

Details

    • Test
    • Status: Resolved
    • P2
    • Resolution: Done
    • None
    • 2.4.0
    • io-java-hadoop-format
    • None

    Description

      We should add a large scale performance test for HadoopInputFormatIO. We should use PerfKitBenchmarker based performance testing framework [1] to manage Kubernetes based muti-node data store and to publish benchmark results.

      Example input format implementation to use: DBInputFormat to connect to a Postgres instance.
      https://github.com/hanborq/hadoop/blob/master/src/mapred/org/apache/hadoop/mapreduce/lib/db/DBInputFormat.java

      Example docker image to use: https://hub.docker.com/_/postgres/

      [1] https://beam.apache.org/documentation/io/testing/

      Attachments

        Activity

          People

            ŁukaszG Lukasz Gajowy
            chamikara Chamikara Madhusanka Jayalath
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 6h 10m
                6h 10m