Word2Text not succcess when the ms word containe hebrew letter i can't create index for doc file in hebrew
I saw that we had a fix for Arabic sometime ago. I'm not sure if it applies to word2text and Hebrew. If you still care about this functionality would you mind testing a recent POI release and then providing a simple test case doc file that we can use to assure that once any fix has been made it does not get broken later.
I believe that unicode text works fine in recent POI releases with HWPF. Please re-open the bug however if your problem still remains when you try a recent POI release (eg POI 3.8 beta 1) with HWPF