Uploaded image for project: 'Apache Hop (Retired)'
  1. Apache Hop (Retired)
  2. HOP-3474

s3 is slower than it should be

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • Migrated to GHI
    • VFS
    • None

    Description

      It seems that we are losing precious time somewhere in VFS when listing objects from S3.

      Further investigation is needed, attached is a sample pipeline to list items via the AWS SDK v1,v2 and using get file names.

      The get file names transform is much slower. see if it is the VFS driver, or the transform.

      But as also the browser is much slower than listing the files via the SDK I think something is wrong in the VFS driver.

      Attachments

        1. s3access.hpl
          11 kB
          Hans Van Akelyen

        Issue Links

          Activity

            People

              Unassigned Unassigned
              hansva Hans Van Akelyen
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: