Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-3623

Allow non-XML representable separator characters in the ImportTSV tool

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.90.1
    • 0.92.0
    • mapreduce
    • Cloudera Hadoop/HBase (3B4)

    • Reviewed
    • Allow use of non-XML friendly characters as separators in the ImportTSV tool.
    • importtsv, mapreduce, configuration, xml, serialization

    Description

      The current importtsv functionality will not work if one passes a non-XML representable character as the separator character (say, an escape character - \u001b, fairly common in use).

      -Dimporttsv.separator=$'\x1b' # This param fails the submitter when serialized.
      

      While this is a limitation with the Configuration class's being serialized as an XML, it can be circumvented by applying a suitable encoding that makes a string XML-compatible.

      Attachments

        Activity

          People

            Unassigned Unassigned
            qwertymaniac Harsh J
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 0.5h
                0.5h
                Remaining:
                Remaining Estimate - 0.5h
                0.5h
                Logged:
                Time Spent - Not Specified
                Not Specified