[Bug 61787] New: Text extraction omitting text incorrectly

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[Bug 61787] New: Text extraction omitting text incorrectly

Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=61787

            Bug ID: 61787
           Summary: Text extraction omitting text incorrectly
           Product: POI
           Version: unspecified
          Hardware: PC
                OS: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: XWPF
          Assignee: [hidden email]
          Reporter: [hidden email]
  Target Milestone: ---

Text extract omits run text where the run contains a rsidDel attribute. This is
incorrect as rsid* attributes are simply revision session identifiers. It is
possible for this attribute to be present, but the run text still be valid.
Instead of the revision session id attributes, text extract should key on
specific revision tags to determine which text to omit. The appropriate tag to
omit is <delText>

--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Bug 61787] Text extraction omitting text incorrectly

Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=61787

--- Comment #1 from Mark Murphy <[hidden email]> ---
This issue was introduced by Bug #58067

--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[Bug 61787] Text extraction omitting text incorrectly

Bugzilla from bugzilla@apache.org
In reply to this post by Bugzilla from bugzilla@apache.org
https://bz.apache.org/bugzilla/show_bug.cgi?id=61787

Simon Gaeremynck <[hidden email]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[hidden email]

--- Comment #2 from Simon Gaeremynck <[hidden email]> ---
Created attachment 35540
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=35540&action=edit
A .docx with rsidDel attributes

--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]