Hi, In the method ExtractorFactory.createExtractor(File), an OPCPackage is created with OPCPackage.open(f.toString()). This opens the package with PackageAccess.READ_WRITE. IMHO, a text extractor should open with PackageAccess.READ instead. Not doing so may cause POI to modify the file when the package's close method is called for some reason, a behavior a client would not expect from a text extractor. Also, in my case, this issue caused my program to get stuck in an infinite loop, as the program was constantly reindexing files which it thought were modified. Best regards
This was actually fixed in r1652877 already.