Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.20.204.0, 1.0.3, 0.21.0, 2.0.0-alpha
-
Linux
-
TextInputFormat record delimiter
Description
TextInputFormat delimiter bug scenario , a character sequence of the input text, in which the first character matches with the first character of delimiter, and the remaining input text character sequence matches with the entire delimiter character sequence from the starting position of the delimiter.
eg delimiter ="record";
and Text =" record 1:- name = Gelesh e mail = gelesh.hadoop@gmail.com Location Bangalore record 2: name = sdf .. location =Bangalorrecord 3: name .... "
Here string "=Bangalorrecord 3: " satisfy two conditions
1) contains the delimiter "record"
2) The character / character sequence immediately before the delimiter (ie ' r ') matches with first character (or character sequence ) of delimiter. (ie "=Bangalor" ends with and Delimiter starts with same character/char sequence 'r' ),
Here the delimiter is not encountered by the program resulting in improper value text in map that contains the delimiter