Bug 60470 - Not extracting comments anchored in runs in XWPF
Summary: Not extracting comments anchored in runs in XWPF
Status: NEW
Alias: None
Product: POI
Classification: Unclassified
Component: XWPF (show other bugs)
Version: 3.16-dev
Hardware: PC All
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-12-12 14:35 UTC by Tim Allison
Modified: 2016-12-28 19:54 UTC (History)
0 users



Attachments
triggering file, based on Tika's testWORD_numbered_list.docx (19.11 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2016-12-12 14:35 UTC, Tim Allison
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Tim Allison 2016-12-12 14:35:27 UTC
Created attachment 34521 [details]
triggering file, based on Tika's testWORD_numbered_list.docx

While working on SAX parser for XWPF on Tika, I found that we're not extracting comments anchored in runs in Tika or POI with our DOM parser.

unit test:

    public void testCommentsAnchoredInRuns() throws IOException {
        XWPFDocument doc = XWPFTestDataSamples.openSampleDocument("testCommentAnchoredInRun.docx");
        XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
        assertTrue(extractor.getText().contains("add a list"));
    }