Details
-
Task
-
Status: Resolved
-
Trivial
-
Resolution: Fixed
-
None
-
None
-
None
Description
It'd be nice to be able to pass in a file-based list of fetch keys and have the pipes-iterator just work. This would be equivalent to the -fileList option in the current tika-batch.
This is useful for processing only a specific subset of files within a directory or s3 bucket.
There's some overlap with the CSV pipes iterator, but this is simpler and can live in tika-core because of no extra dependencies.