Uploaded image for project: 'Apache Apex Malhar'
  1. Apache Apex Malhar
  2. APEXMALHAR-1633

Add a BlockReader to library where records are separated by end of line character and doesn't require record start validation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • adapters other
    • None

    Description

      We have an AbstractLineReader in library that efficiently parses lines from a block. It is efficient because it doesn't read ahead of the block boundary unless it is absolutely necessary. But this reader makes it necessary for being able to have a way to validate the start of record.

      For a lot of cases this is not possible. In case of CSV files, we don't always have a way of validating start of a row. In this scenario what we can do is that a reader should always start reading from the first eol character. The implications of this is that a block would always be read ahead of the block boundary.

      Attachments

        Activity

          People

            csingh Chandni Singh
            csingh Chandni Singh
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: