Issue 81854 - Combine OCR with PDF import
Summary: Combine OCR with PDF import
Alias: None
Product: Writer
Classification: Application
Component: open-import (show other issues)
Version: OOo 1.0.0
Hardware: All All
: P3 Trivial with 8 votes (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
Depends on:
Reported: 2007-09-22 22:31 UTC by phills
Modified: 2013-08-07 14:38 UTC (History)
2 users (show)

See Also:
Latest Confirmation in: ---
Developer Difficulty: ---


Note You need to log in before you can comment on or make changes to this issue.
Description phills 2007-09-22 22:31:58 UTC
Since the next new release will import PDFs (I hear), then this is a good time
to consider integrating OCR into the new feature.  People don't necessarily know
beforehand if a PDF was scanned or generated directly from Adobe.  

In law firms, attorneys frequently receive batches of PDFs by email to which
they must respond by quoting parts of other PDFs.  It's understood that OCR
isn't a perfect science, but getting 90% of the characters right is lot of time
and retyping that would be saved.

Thus integrating OCR will (1) increase functionality by making people more
efficient, (2) reduce the inevitable support questions of "why my PDF didn't
import", and (3) expand the potential market segment.  Thank you.
Comment 1 eric.savary 2007-09-25 10:51:19 UTC
Comment 2 cianoz 2008-09-08 23:06:46 UTC
*** Issue 81854 has been confirmed by votes. ***
Comment 3 amoose136 2008-11-01 05:37:06 UTC
This isn't getting enough attention. I really hope that someone makes this.
Linux really lacks OCR, not that it's a particularly easy thing to code...