Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3463

Add FileListIterator as a pipes-iterator

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • None
    • 2.0.0
    • None
    • None

    Description

      It'd be nice to be able to pass in a file-based list of fetch keys and have the pipes-iterator just work. This would be equivalent to the -fileList option in the current tika-batch.

      This is useful for processing only a specific subset of files within a directory or s3 bucket.

      There's some overlap with the CSV pipes iterator, but this is simpler and can live in tika-core because of no extra dependencies.

      Attachments

        Activity

          People

            tallison Tim Allison
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: