Apache OpenOffice (AOO) Bugzilla – Issue 101327
Extra spaces in PDF import of Hebrew documents
Last modified: 2010-04-30 11:15:50 UTC
When importing the attached Hebrew PDF document, I saw that there were extra spaces in the text. This was because the PDF importer was not treating a non-breaking space (160) like a breaking space (32). I changed the code to treat a non-breaking space like a breaking space in PDFIProcessor::drawGlyphLine, which solved the problem. Note: the words are now split correctly, but the spaces between them are too big. This will be reported in another issue.
Created attachment 61795 [details] Sample Hebrew PDF document
Created attachment 61796 [details] propsed patch
reassign
committed in CWS pdfextfix02
please verify in CWS pdfextfix02
@mru: thanks for taking over
Verified in CWS pdfextfix02.
Closed.