Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-12741

Read multiple files keeping track of file names (Python)

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: P3
    • Resolution: Duplicate
    • Affects Version/s: 2.31.0
    • Fix Version/s: None
    • Component/s: io-py-files
    • Labels:

      Description

      When reading lines from text files with multiple patterns it is sometimes useful to keep track of the file names from which the lines originated. Example: read tab-delimited files and map their lines to column headers coming from separate files.

      It would be nice to have a ReadAllFromTextWithFilename transform, which modifies ReadAllFromText transform in a similar way as ReadFromTextWithFilename modifies  the ReadFromText transform to produce tuples of file names paired with text lines.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                EugeneNikolaiev Eugene Nikolaiev
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 50m
                  1h 50m