[Bug 58247] New: UTF-16 characters in .xlsx file can't write properly

classic Classic list List threaded Threaded
11 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] New: UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

            Bug ID: 58247
           Summary: UTF-16 characters in .xlsx file can't write properly
           Product: POI
           Version: 3.10-FINAL
          Hardware: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: XSSF
          Assignee: [hidden email]
          Reporter: [hidden email]

After read .xlsx, UTF-16 characters was still displayed well. But after written
back to disk again, it become "??" at that point before.

Tried to convert .xlsx file to .xls by Excel, opened and saved with HSSF but
UTF-16 characters was still displayed properply. (In this case, I've used
UnicodeString to set UTF-16 characters to cell).

I've checked with 3.12 latest source and same phenomena can be produced. So I
think it only happens in XSSF or SXSSF.

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

Dominik Stadler <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 OS|                            |All
             Status|NEW                         |NEEDINFO
            Version|3.10-FINAL                  |3.12-FINAL

--- Comment #1 from Dominik Stadler <[hidden email]> ---
Please provide some more details so other people can reproduce the problem,
i.e. please attache sample files and a self-sufficient piece of code that
reproduces the problem, ideally as a unit-test so we can add it to the
test-suite for poi.

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

--- Comment #2 from raveufo <[hidden email]> ---
Created attachment 33002
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33002&action=edit
Sample input

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

--- Comment #3 from raveufo <[hidden email]> ---
Created attachment 33003
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33003&action=edit
Sample output

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

--- Comment #4 from raveufo <[hidden email]> ---
Created attachment 33004
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33004&action=edit
Reproduce source code

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

--- Comment #5 from raveufo <[hidden email]> ---
(In reply to Dominik Stadler from comment #1)
> Please provide some more details so other people can reproduce the problem,
> i.e. please attache sample files and a self-sufficient piece of code that
> reproduces the problem, ideally as a unit-test so we can add it to the
> test-suite for poi.

I've attached sample input, sample output and the source code I used to
reproduce this problem.

If I convert the sample input above to .xls file and read/write with HSSF,
characters in  output file will be the same with input file

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

--- Comment #6 from raveufo <[hidden email]> ---
(In reply to Dominik Stadler from comment #1)
> Please provide some more details so other people can reproduce the problem,
> i.e. please attache sample files and a self-sufficient piece of code that
> reproduces the problem, ideally as a unit-test so we can add it to the
> test-suite for poi.

I've attached sample input, sample output and the source code I used to
reproduce this problem.

If I convert the sample input above to .xls file and read/write with HSSF,
characters in  output file will be the same with input file

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

raveufo <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |NEW

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] UTF-16 characters in .xlsx file can't write properly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

Dominik Stadler <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Depends on|                            |54084

--- Comment #7 from Dominik Stadler <[hidden email]> ---
This is likely a similar issue as bug 54084 where we debugged the problem to
some degree and it seems the XMLBeans third pary library is involved here.

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] Some UTF-16 characters are not handled correctly (likely surrogate pair related)

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

Dominik Stadler <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|UTF-16 characters in .xlsx  |Some UTF-16 characters are
                   |file can't write properly   |not handled correctly
                   |                            |(likely surrogate pair
                   |                            |related)

--
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Bug 58247] Some UTF-16 characters are not handled correctly (likely surrogate pair related)

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=58247

Dominik Stadler <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Depends on|                            |59268


Referenced Bugs:

https://bz.apache.org/bugzilla/show_bug.cgi?id=59268
[Bug 59268] Work on providing an updated version of XMLBeans
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Loading...