Created attachment 33491 [details] Problematic unparseable document I have the exception java.lang.IllegalArgumentException: Value for parameter 'id' was out of bounds at org.apache.poi.util.IdentifierManager.reserve(IdentifierManager.java:80) at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:111) at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:156) at org.apache.poi.xwpf.usermodel.XWPFParagraph.buildRunsInOrderFromXml(XWPFParagraph.java:134) at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:87) at org.apache.poi.xwpf.usermodel.XWPFHeader.onDocumentRead(XWPFHeader.java:116) at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:183) at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:166) at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:118) at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:59) at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:181) at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86) ... 99 more when parsing the attached document with Tika, while using (according to its POM) POI 3.13. It was created using LibreOffice.
Created attachment 33492 [details] Error when opening the file with Microsoft Office
When using latest POI, I get a different error: Caused by: org.apache.xmlbeans.XmlException: error: duplicate attribute 'w:eastAsia' at org.apache.xmlbeans.impl.store.Locale$SaxLoader.load(Locale.java:3511) and when I try to open the file in Microsoft Word, I get a similar error message, see screenshot. Therefore it seems this file is not correctly written in the first place and you need to discuss with the developers of LibreOffice to see if there is a bug there.