Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Exploring an e-commerce events timeseries dataset that can be used for periodic benchmarks. The dataset is here: https://www.kaggle.com/mkechinov/ecommerce-behavior-data-from-multi-category-store. Since this is non-commercial use of this dataset, only for performance benchmarks, I think it is fair use.
My goal is to use this dataset to benchmark indexing, faceting, join performance.
Will use the benchmarking suite developed as part of SOLR-10317 and SOLR-13933. Details here: https://searchscale.com/blog/solr-bench/