Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-12741

Read multiple files keeping track of file names (Python)

Details

    • Improvement
    • Status: Resolved
    • P3
    • Resolution: Duplicate
    • 2.31.0
    • Missing
    • io-py-files

    Description

      When reading lines from text files with multiple patterns it is sometimes useful to keep track of the file names from which the lines originated. Example: read tab-delimited files and map their lines to column headers coming from separate files.

      It would be nice to have a ReadAllFromTextWithFilename transform, which modifies ReadAllFromText transform in a similar way as ReadFromTextWithFilename modifies  the ReadFromText transform to produce tuples of file names paired with text lines.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              EugeneNikolaiev Eugene Nikolaiev
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 50m
                  1h 50m