Affects Version/s: core 1.4.3
Fix Version/s: None
Environment:OS: Windows 2003 sp2 My-eclipse6.0 / tomcat 5.5 and Athelon500+
I have a .doc file which contains data inside a table. Now i want to parse the table to get the table values. Normal Parsing is not working for table( I mean using String tokenizer) because it is giving some unwanted special characters while parsing the table. So I just want to convert that .doc to .txt file, then only it is easy to split the values. But i can't make it! Can any one please tell me how to parse a MS WORD TABLE Values?
We need to know the process by which we can index a doc file excluding special characters,
When we will show the excerpt then these special characters make it unreadable.
Thanks in advance.
|Field||Original Value||New Value|
|Workflow||jira [ 12447527 ]||no-reopen-closed, patch-avail [ 12467826 ]|
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Resolution||Incomplete [ 4 ]|
|Status||Resolved [ 5 ]||Closed [ 6 ]|