missing oleobjectelement.xsb in ooxml-lite?

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

missing oleobjectelement.xsb in ooxml-lite?

Tim Allison
All,
  Over on Tika [1], I'm getting an exception that oleobjectelement.xsb
can't be found.  When I look in the ooxml-lite.jar, I see there's an
oleobjelement.xsb, but no oleobjectelement.xsb.
  I tried adding the triggering document (EmbeddedDocument.docx) to a
poi unit test[2] and rebuilding 5.0.1-SNAPSHOT locally, but I'm
finding the same issue.
  I do see oleobjectelemebt.xsb in ooxml-full.  Any recommendations on
how to get this into ooxml-lite?

   Thank you!

              Cheers,

                                 Tim

[1] https://issues.apache.org/jira/browse/TIKA-3164?focusedCommentId=17307099&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17307099
[2] try (XWPFDocument doc =
XWPFTestDataSamples.openSampleDocument("EmbeddedDocumentTika.docx")) {
    XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
    String txt = extractor.getText();
}

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: missing oleobjectelement.xsb in ooxml-lite?

Tim Allison
Going back to Andi's point [1]...trying this now.

[1] https://lists.apache.org/x/thread.html/ra9ff58e6af046a51ba459915fe536a2ea1fe71e85329abc4e513711e@%3Cuser.poi.apache.org%3E

On Tue, Mar 23, 2021 at 10:17 AM Tim Allison <[hidden email]> wrote:

>
> All,
>   Over on Tika [1], I'm getting an exception that oleobjectelement.xsb
> can't be found.  When I look in the ooxml-lite.jar, I see there's an
> oleobjelement.xsb, but no oleobjectelement.xsb.
>   I tried adding the triggering document (EmbeddedDocument.docx) to a
> poi unit test[2] and rebuilding 5.0.1-SNAPSHOT locally, but I'm
> finding the same issue.
>   I do see oleobjectelemebt.xsb in ooxml-full.  Any recommendations on
> how to get this into ooxml-lite?
>
>    Thank you!
>
>               Cheers,
>
>                                  Tim
>
> [1] https://issues.apache.org/jira/browse/TIKA-3164?focusedCommentId=17307099&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17307099
> [2] try (XWPFDocument doc =
> XWPFTestDataSamples.openSampleDocument("EmbeddedDocumentTika.docx")) {
>     XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
>     String txt = extractor.getText();
> }

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: missing oleobjectelement.xsb in ooxml-lite?

Tim Allison
All seems to work if I uncomment this line in build.xml:

<include name="org/apache/poi/schemas/ooxml/system/ooxml/*.xsb"/>

Any objections?

On Tue, Mar 23, 2021 at 10:24 AM Tim Allison <[hidden email]> wrote:

>
> Going back to Andi's point [1]...trying this now.
>
> [1] https://lists.apache.org/x/thread.html/ra9ff58e6af046a51ba459915fe536a2ea1fe71e85329abc4e513711e@%3Cuser.poi.apache.org%3E
>
> On Tue, Mar 23, 2021 at 10:17 AM Tim Allison <[hidden email]> wrote:
> >
> > All,
> >   Over on Tika [1], I'm getting an exception that oleobjectelement.xsb
> > can't be found.  When I look in the ooxml-lite.jar, I see there's an
> > oleobjelement.xsb, but no oleobjectelement.xsb.
> >   I tried adding the triggering document (EmbeddedDocument.docx) to a
> > poi unit test[2] and rebuilding 5.0.1-SNAPSHOT locally, but I'm
> > finding the same issue.
> >   I do see oleobjectelemebt.xsb in ooxml-full.  Any recommendations on
> > how to get this into ooxml-lite?
> >
> >    Thank you!
> >
> >               Cheers,
> >
> >                                  Tim
> >
> > [1] https://issues.apache.org/jira/browse/TIKA-3164?focusedCommentId=17307099&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17307099
> > [2] try (XWPFDocument doc =
> > XWPFTestDataSamples.openSampleDocument("EmbeddedDocumentTika.docx")) {
> >     XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
> >     String txt = extractor.getText();
> > }

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: missing oleobjectelement.xsb in ooxml-lite?

kiwiwings
ok, here is my objection (now via the mailing list).
I'll have a look at the source problem.
Uncommenting the *.xsb line makes the optimization superfluous.

Andi

On 23.03.21 15:40, Tim Allison wrote:

> All seems to work if I uncomment this line in build.xml:
>
> <include name="org/apache/poi/schemas/ooxml/system/ooxml/*.xsb"/>
>
> Any objections?
>
> On Tue, Mar 23, 2021 at 10:24 AM Tim Allison <[hidden email]> wrote:
>> Going back to Andi's point [1]...trying this now.
>>
>> [1] https://lists.apache.org/x/thread.html/ra9ff58e6af046a51ba459915fe536a2ea1fe71e85329abc4e513711e@%3Cuser.poi.apache.org%3E
>>
>> On Tue, Mar 23, 2021 at 10:17 AM Tim Allison <[hidden email]> wrote:
>>> All,
>>>    Over on Tika [1], I'm getting an exception that oleobjectelement.xsb
>>> can't be found.  When I look in the ooxml-lite.jar, I see there's an
>>> oleobjelement.xsb, but no oleobjectelement.xsb.
>>>    I tried adding the triggering document (EmbeddedDocument.docx) to a
>>> poi unit test[2] and rebuilding 5.0.1-SNAPSHOT locally, but I'm
>>> finding the same issue.
>>>    I do see oleobjectelemebt.xsb in ooxml-full.  Any recommendations on
>>> how to get this into ooxml-lite?
>>>
>>>     Thank you!
>>>
>>>                Cheers,
>>>
>>>                                   Tim
>>>
>>> [1] https://issues.apache.org/jira/browse/TIKA-3164?focusedCommentId=17307099&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17307099
>>> [2] try (XWPFDocument doc =
>>> XWPFTestDataSamples.openSampleDocument("EmbeddedDocumentTika.docx")) {
>>>      XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
>>>      String txt = extractor.getText();
>>> }
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: missing oleobjectelement.xsb in ooxml-lite?

kiwiwings
I hope it's fixed now - see also the comment in https://issues.apache.org/jira/browse/TIKA-3164

Andi

On 23.03.21 18:08, Andreas Beeker wrote:

> ok, here is my objection (now via the mailing list).
> I'll have a look at the source problem.
> Uncommenting the *.xsb line makes the optimization superfluous.
>
> Andi
>
> On 23.03.21 15:40, Tim Allison wrote:
>> All seems to work if I uncomment this line in build.xml:
>>
>> <include name="org/apache/poi/schemas/ooxml/system/ooxml/*.xsb"/>
>>
>> Any objections?
>>
>> On Tue, Mar 23, 2021 at 10:24 AM Tim Allison <[hidden email]> wrote:
>>> Going back to Andi's point [1]...trying this now.
>>>
>>> [1] https://lists.apache.org/x/thread.html/ra9ff58e6af046a51ba459915fe536a2ea1fe71e85329abc4e513711e@%3Cuser.poi.apache.org%3E
>>>
>>> On Tue, Mar 23, 2021 at 10:17 AM Tim Allison <[hidden email]> wrote:
>>>> All,
>>>>    Over on Tika [1], I'm getting an exception that oleobjectelement.xsb
>>>> can't be found.  When I look in the ooxml-lite.jar, I see there's an
>>>> oleobjelement.xsb, but no oleobjectelement.xsb.
>>>>    I tried adding the triggering document (EmbeddedDocument.docx) to a
>>>> poi unit test[2] and rebuilding 5.0.1-SNAPSHOT locally, but I'm
>>>> finding the same issue.
>>>>    I do see oleobjectelemebt.xsb in ooxml-full.  Any recommendations on
>>>> how to get this into ooxml-lite?
>>>>
>>>>     Thank you!
>>>>
>>>>                Cheers,
>>>>
>>>>                                   Tim
>>>>
>>>> [1] https://issues.apache.org/jira/browse/TIKA-3164?focusedCommentId=17307099&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17307099
>>>> [2] try (XWPFDocument doc =
>>>> XWPFTestDataSamples.openSampleDocument("EmbeddedDocumentTika.docx")) {
>>>>      XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
>>>>      String txt = extractor.getText();
>>>> }
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]