Issue 81854

Summary:	Combine OCR with PDF import
Product:	Writer	Reporter:	phills <phil406>
Component:	open-import	Assignee:	AOO issues mailing list <issues>
Status:	CONFIRMED ---	QA Contact:
Severity:	Trivial
Priority:	P3	CC:	issues, oo
Version:	OOo 1.0.0
Target Milestone:	---
Hardware:	All
OS:	All
Issue Type:	ENHANCEMENT	Latest Confirmation in:	---
Developer Difficulty:	---

Description phills 2007-09-22 22:31:58 UTC

Since the next new release will import PDFs (I hear), then this is a good time
to consider integrating OCR into the new feature.  People don't necessarily know
beforehand if a PDF was scanned or generated directly from Adobe.  

In law firms, attorneys frequently receive batches of PDFs by email to which
they must respond by quoting parts of other PDFs.  It's understood that OCR
isn't a perfect science, but getting 90% of the characters right is lot of time
and retyping that would be saved.

Thus integrating OCR will (1) increase functionality by making people more
efficient, (2) reduce the inevitable support questions of "why my PDF didn't
import", and (3) expand the potential market segment.  Thank you.

Comment 1 eric.savary 2007-09-25 10:51:19 UTC

Reassigned

Comment 2 cianoz 2008-09-08 23:06:46 UTC

*** Issue 81854 has been confirmed by votes. ***

Comment 3 amoose136 2008-11-01 05:37:06 UTC

This isn't getting enough attention. I really hope that someone makes this.
Linux really lacks OCR, not that it's a particularly easy thing to code...