Details
-
Test
-
Status: Resolved
-
P2
-
Resolution: Fixed
-
None
-
None
Description
A while back we implemented the 'queries' from
http://datalab.cs.pdx.edu/niagara/NEXMark/
as Gooogle Dataflow pipelines. We found them useful
for uncovering performance problems with the sdk, our runners,
and our service. Many of those problems only manifested under
high load, multi-day runs, or with high 'backlog' on the incoming
pub/sub subscriptions.
We thus think they would be useful for other runners.
Disclaimer: Though the original 'queries' were proposed as a way to
benchmark 'continuous SQL' implementations, we have so far only
used them for internal A/B and regression testing and have not validated
them as representative of customer workloads. We would thus discourage their use for competitive benchmarks without more work.
Attachments
Issue Links
- relates to
-
BEAM-775 Remove Aggregators from the Java SDK
- Resolved
-
BEAM-2111 ClassCastException in streaming Spark pipeline.
- Open
-
BEAM-2499 Support Custom Windows in Spark runner
- Open
-
BEAM-2408 Flink unbounded source does not emit watermarks when there are multiple Readers
- Resolved
-
BEAM-1772 Support merging WindowFn other than IntervalWindow on Flink Runner
- Resolved
-
BEAM-1802 Spark Runner does not shutdown correctly when executing multiple pipelines in sequence
- Resolved
-
BEAM-1810 Spark runner combineGlobally uses Kryo serialization
- Resolved
-
BEAM-2029 NullPointerException when using multi output ParDo in Spark runner in streaming mode.
- Resolved
-
BEAM-2409 Spark runner produces exactly twice the number of results in streaming mode when use triggers to re-window results on global window.
- Resolved
-
BEAM-1948 Null pointer exception in DirectRunner.DirectPipelineResult.getAggregatorValues()
- Resolved
-
BEAM-593 Support unblocking run() in FlinkRunner and cancel() and waitUntilFinish() in FlinkRunnerResult
- Open
-
BEAM-1035 Support for new State API in SparkRunner
- Open
-
BEAM-1115 Support for new Timer API in Spark runner
- Open
-
BEAM-1037 Support for new State API in ApexRunner
- Resolved
-
BEAM-1114 Support for new Timer API in Apex runner
- Resolved
-
BEAM-2112 Add support for PCollectionView in spark runner in streaming mode
- Resolved
-
BEAM-2847 Stop streaming pipeline when global watermark reaches +INF
- Open
-
BEAM-2971 Nexmark: migrate WinningBids to state API in place of custom window merge.
- Open
-
BEAM-2972 Nexmark: create a query that illustrates a simple custom window merge
- Open
-
BEAM-829 Implement Metrics support for Apex runner
- Resolved
-
BEAM-1261 State API should allow state to be managed in different windows
- Open
- links to