Bug 66467 - Unsupported Sprm operation: 58 (0x3A) when extracting data from a Word 97 document
Summary: Unsupported Sprm operation: 58 (0x3A) when extracting data from a Word 97 doc...
Status: NEW
Alias: None
Product: POI
Classification: Unclassified
Component: HWPF (show other bugs)
Version: 5.2.2-FINAL
Hardware: Macintosh All
: P2 normal with 1 vote (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-02-06 13:44 UTC by George Gastaldi
Modified: 2023-03-11 08:05 UTC (History)
0 users



Attachments
Word document reproducing the bug (22.00 KB, application/msword)
2023-02-06 13:44 UTC, George Gastaldi
Details

Note You need to log in before you can comment on or make changes to this bug.
Description George Gastaldi 2023-02-06 13:44:15 UTC
Created attachment 38497 [details]
Word document reproducing the bug

When extracting the text of a Word 97 document I get the following INFO in the logs (using poi-scratchpad 5.2.3): 

2023-02-06 10:15:08,983 INFO  [org.apa.poi.hwp.spr.SectionSprmUncompressor] (executor-thread-0) Unsupported Sprm operation: 58 (0x3A)


The following unit test reproduces the issue (using the attached .doc file): 

    @Test
    void testSprmOperation58() throws IOException {
        try (HWPFDocument hwpfDocument = openSampleFile( "hello_poi.doc" )) {
            WordExtractor extractor = new WordExtractor(hwpfDocument);
            assertEquals("Hello POI", extractor.getText().trim());
        }
    }