Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-7826

Problem loading ISO-8859-1 into BigQuery using DataFlow

Details

    • Bug
    • Status: Open
    • P3
    • Resolution: Unresolved
    • 2.8.0
    • None
    • io-java-gcp, io-java-text
    • None

    Description

      Hi all,
      I'm trying to load an ISO-8859-1 file into BigQuery using DataFlow. I've built a template with Apache Beam Java. Everything works well but when I check the content of the Bigquery table I see that some characters like 'ñ' or accents 'á','é', etc. haven't been stored propertly, they have been stored as �.

      I've tried several charset changing before write into BigQuery. Also, I've created a special ISOCoder passed to the pipeline using the method setCoder(), but nothing works.

      Attachments

        Activity

          People

            Unassigned Unassigned
            Israel Israel Gómez
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: