Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Provide a more intuitive way to construct nested dataset:
```python
- instead of using confusing factory function
dataset([
factory("s3://old-taxi-data", format="parquet"),
factory("local/path/to/new/data", format="csv")
])
- let the user to construct a new dataset directly from dataset objects
dataset([
dataset("s3://old-taxi-data", format="parquet"),
dataset("local/path/to/new/data", format="csv")
])
```
In the future we might want to introduce a new Dataset class which wraps functionality of both the dataset actory and the materialized dataset enabling optimizations over rediscovery of already materialized datasets.
Attachments
Issue Links
- links to