Description
This is an umbrella issue to refactor all benchmarks to use a common style using main-method (instead of `test` method) and saving the result as a separate file (instead of embedding as comments). This is not only for consistency, but also for making the benchmark-automation easy. SPARK-25339 is finished as a reference model.
Completed
- FilterPushdownBenchmark.scala (
SPARK-25339)
Candidates
- AggregateBenchmark.scala
- AvroWriteBenchmark.scala (
SPARK-24777) - ColumnarBatchBenchmark.scala
- CompressionSchemeBenchmark.scala
- DataSourceReadBenchmark.scala
- DataSourceWriteBenchmark.scala (
SPARK-24777) - DatasetBenchmark.scala
- ExternalAppendOnlyUnsafeRowArrayBenchmark.scala
- HashBenchmark.scala
- HashByteArrayBenchmark.scala
- JoinBenchmark.scala
- KryoBenchmark.scala
- MiscBenchmark.scala
- ObjectHashAggregateExecBenchmark.scala
- OrcReadBenchmark.scala
- PrimitiveArrayBenchmark.scala
- SortBenchmark.scala
- SynthBenchmark.scala
- TPCDSQueryBenchmark.scala
- UDTSerializationBenchmark.scala
- UnsafeArrayDataBenchmark.scala
- UnsafeProjectionBenchmark.scala
- WideSchemaBenchmark.scala
Candidates will be reviewed and converted as a subtask of this JIRA.
Attachments
Issue Links
- relates to
-
SPARK-24777 Add write benchmark for AVRO
- Resolved