Uploaded image for project: 'Sqoop'
  1. Sqoop
  2. SQOOP-2561

Special Character removal from Column name as avro data results in duplicate column and fails the import

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.4.6
    • Fix Version/s: 1.4.7
    • Component/s: None
    • Labels:
    • Environment:

      cdh5.3.2

      Description

      When a Special character like '$' or '#' are present in column name, sqoop/avro removes those special character. In some cases it leads to duplicate column.
      e.g. If we have COL$1 and COL1$ in the schema, it removes both of them and creates the duplicate column as COL1 and it results in failure of the SQOOP import job as a avro data. The same table can be loaded without --as-avarodata flag.

      The similar issue is raised in, https://issues.apache.org/jira/browse/SQOOP-1361 - which i suppose is fixed and the fix is creating this new issue.

        Attachments

        1. 0001-SQOOP-2561.patch
          5 kB
          VISHNU S NAIR

          Activity

            People

            • Assignee:
              vishnusn VISHNU S NAIR
              Reporter:
              SureshDeoda Suresh Deoda
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: