[CSV-58] Unescape handling needs rethinking - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: Patch Needed, 1.x
Component/s: Parser
Labels:
None

Description

The current escape parsing converts <esc><char> to plain <char> if the <char> is not one of the special characters to be escaped.

This can affect unicode escapes if the <esc> character is backslash.

One way round this is to specifically check for <char> == 'u', but it seems wrong to only do this for 'u'.

Another solution would be to leave <esc><char> as is unless the <char> is one of the special characters.

There are several possible ways to treat unrecognised escapes:

treat it as if the escape char had not been present (current behaviour)
leave the escape char as is
throw an exception

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

commons-csv.diff
26/Jun/12 05:26
35 kB
Anirudha Khanna

Issue Links

is related to

CSV-56 Do not use exotic escape characters for sequences like \r or \n

Open

Activity

People

Assignee:: Unassigned

Reporter:: Sebb

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 14/Mar/12 15:37

Updated:: 10/Jul/14 19:43