Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-33059

Support transparent compression for file-connector for all file input formats

    XMLWordPrintableJSON

Details

    Description

      Some FileInputFormats don't use FileInputFormat#createSplits (that would detect that the file is non-splittable and deal with reading boundaries correctly), they all create split manually from FileSourceSplit. If input files are compressed, split length is determined by the compressed file length leading to this bug. We should force reading the whole file split (like it is done for binary input formats) on compressed files. Parallelism is still done at the file level (as now)

      Attachments

        Issue Links

          Activity

            People

              echauchot Etienne Chauchot
              echauchot Etienne Chauchot
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: