Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-9152

Hadoop Downloader Range not correct

Details

    • Bug
    • Status: Triage Needed
    • P1
    • Resolution: Unresolved
    • None
    • Missing
    • io-py-tfrecord
    • None

    Description

      I found that the `HdfsDownloader.get_range(self, start, end)' get range [start, end] rather than [start, end). This will cause an error while reading hdfs file.

       

      File "/data/anaconda3/envs/tfdv1.15/lib/python3.7/site-packages/apache_beam/io/tfrecordio.py", line 127, in read_record
      buf = file_handle.read(buf_length_expected)
      File "/data/anaconda3/envs/tfdv1.15/lib/python3.7/site-packages/apache_beam/io/filesystemio.py", line 123, in readinto
      b[:len(data)] = data
      ValueError: memoryview assignment: lvalue and rvalue have different structures

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              Benjzhou Fangyuan Zhou
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated: