[Bug 57008] Wrting _x0427_ to a string cell changes the string to some strange UTF-8 character

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[Bug 57008] Wrting _x0427_ to a string cell changes the string to some strange UTF-8 character

Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=57008

--- Comment #23 from Vishal Jaldawar <[hidden email]> ---
I have similar issue. Please refer the excel file attached by me.
There are some control characters which are not read correctly by Apache POI.

I have written a custom Sheet Handler extending DefaultHandler.
I read the data and form a xml out of it to store into database. I have used
JAXB marshaller. When I unmarshall xml I get

org.xml.sax.SAXParseException: An invalid XML character (Unicode: 0x1) was
found in the element content of the document.

I debugged the code and found the XSSFRichTextString.utfDecode has some
decoding logic. but it seems it is not working for some characters.

I am using Apache POI  3.17 version.

Please help.

--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]