[ODFTOOLKIT-400] Unable to obtain the charset encoding of an odt document - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: odfdom
Labels:
None
Environment:
linux - ubuntu 14.04

Description

Im trying to convert odt to html. In doing the conversion Im trying to obtain the charset encoding of the odt document so that I can set the appropriate value on the html end. However I always get a 'null' value when trying to read the charset.

        OdfTextDocument odfDoc = OdfTextDocument.loadDocument(is)
        System.out.println(odfDoc.getContentDom.getXmlEncoding)

For the test document attached I am expecting to get UTF-8 but always see 'null'. Happens on other docs as well,

Is there a better way to obtain the charset encoding of an odt document?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

400-part3-main-OdfFileDom_initXmlDecl.patch
09/Aug/15 13:53
4 kB
Nimarukan
400-part2-test-OdfFileDom_xmlDeclTest.patch
09/Aug/15 13:53
6 kB
Nimarukan
400-part1-pom_xml-FromJava1_5To1_6ForStAX.patch
09/Aug/15 13:53
0.5 kB
Nimarukan
testOdt.odt
29/Jul/15 01:44
53 kB
Joshua

Activity

People

Assignee:: Unassigned

Reporter:: Joshua

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 29/Jul/15 01:40

Updated:: 14/Sep/15 10:16