Uploaded image for project: 'Sqoop (Retired)'
  1. Sqoop (Retired)
  2. SQOOP-2561

Special Character removal from Column name as avro data results in duplicate column and fails the import

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.4.6
    • 1.4.7
    • None
    • cdh5.3.2

    Description

      When a Special character like '$' or '#' are present in column name, sqoop/avro removes those special character. In some cases it leads to duplicate column.
      e.g. If we have COL$1 and COL1$ in the schema, it removes both of them and creates the duplicate column as COL1 and it results in failure of the SQOOP import job as a avro data. The same table can be loaded without --as-avarodata flag.

      The similar issue is raised in, https://issues.apache.org/jira/browse/SQOOP-1361 - which i suppose is fixed and the fix is creating this new issue.

      Attachments

        1. 0001-SQOOP-2561.patch
          5 kB
          VISHNU S NAIR

        Activity

          People

            vishnusn VISHNU S NAIR
            SureshDeoda Suresh Deoda
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: