Issue 81854

Summary: Combine OCR with PDF import
Product: Writer Reporter: phills <phil406>
Component: open-importAssignee: AOO issues mailing list <issues>
Status: CONFIRMED --- QA Contact:
Severity: Trivial    
Priority: P3 CC: issues, oo
Version: OOo 1.0.0   
Target Milestone: ---   
Hardware: All   
OS: All   
Issue Type: ENHANCEMENT Latest Confirmation in: ---
Developer Difficulty: ---

Description phills 2007-09-22 22:31:58 UTC
Since the next new release will import PDFs (I hear), then this is a good time
to consider integrating OCR into the new feature.  People don't necessarily know
beforehand if a PDF was scanned or generated directly from Adobe.  

In law firms, attorneys frequently receive batches of PDFs by email to which
they must respond by quoting parts of other PDFs.  It's understood that OCR
isn't a perfect science, but getting 90% of the characters right is lot of time
and retyping that would be saved.

Thus integrating OCR will (1) increase functionality by making people more
efficient, (2) reduce the inevitable support questions of "why my PDF didn't
import", and (3) expand the potential market segment.  Thank you.
Comment 1 eric.savary 2007-09-25 10:51:19 UTC
Reassigned
Comment 2 cianoz 2008-09-08 23:06:46 UTC
*** Issue 81854 has been confirmed by votes. ***
Comment 3 amoose136 2008-11-01 05:37:06 UTC
This isn't getting enough attention. I really hope that someone makes this.
Linux really lacks OCR, not that it's a particularly easy thing to code...