spridgets
[Top] [All Lists]

RE: Those effing CDs that rock

To: Duncan Sinclair <duncan@pondhop.com>
Subject: RE: Those effing CDs that rock
Date: Mon, 17 Oct 2005 20:36:25 -0400 with any abuse report
Cc: "'Jay Fishbein'" <type79@ix.netcom.com>, Spridget List <spridgets@autox.team.net>
References: <003901c5d379$b65c6820$6501a8c0@Pondhop.local>
Duncan,

Having dealt with thousands of lines of Parts List descriptions, I can
tell you that current OCR technology is next to useless on these BMC
publications.

Not only is the text font very hard to OCR, but any OCR mistake will
render a search engine useless (human eyes can see that PMVV!039 is the
same as PMW1039, but a computer can't - ask me how I know!)

John
 
On Mon, 2005-10-17 at 20:20, Duncan Sinclair wrote:

> Acrobat 7.0 has a feature to use OCR to convert pdf image files to text ones
> where possible. 
> 
> Duncan




<Prev in Thread] Current Thread [Next in Thread>