Uploaded image for project: 'Apache Hop (Retired)'
  1. Apache Hop (Retired)
  2. HOP-4239

Consider a binary file data type

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Normal
    • Resolution: Fixed
    • None
    • Migrated to GHI
    • API, Pipelines
    • None

    Description

      We could consider a new value type called "Binary file" which stores large data sets in (temporary) files instead of keeping data in memory.

      This would solve use-cases where large amounts of data need to be transferred from one location to another.  

      For example, Oracle BLOB fields have the ability to contain up to 4GB or data.  That is more than fits in a Binary field and it would be a shame to waste memory on it in the first place.

      When writing the data to another database or location you can stream the data back out from the file.

      The trick would be that we would need a way to clean the temporary files back up at the end of a pipeline.  The constructor of the ValueMetaBinaryFile() would need to have a hook to a mechanism to remove the file when no longer needed.

      Attachments

        Activity

          People

            Unassigned Unassigned
            mcasters Matt Casters
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: