Ask a Question related to Adobe Acrobat Windows, Design and Development.
-
ray_mayer@adobeforums.com #1
OCR (optical character recognition) with scan documents stored in pdf format
We have converted a substantial number of drawings and documents into pdf format via several methods;
1)large format flatbed scanner (scan to tif then convert to pdf)
2)desktop scanner found on a multifunction printer/copier/scanner (scan to tif then convert to pdf)
3)print to file using the pdf distiller (distill word, excel, autocad, etc) directly to pdf
We found that it would be very useful to "find" or "search" for words and phrases, etc. using a search command like the one in the adobe 6.0 reader.
does adobe provide a product or does a third party add-on exist that would provide this type of capability?
thanks in advance for your support,
ray
ray_mayer@adobeforums.com Guest
-
Acro Scan v. Photoshop Scan (CS3)
Peter, Acrobat is taking whatever is sent to it via the twain driver and wrapping the result in a pdf file. The twain driver could be sending a... -
Virus Scan for uploaded documents?
Platform: ASP.Net 1.1/C# Web App ========================== Folks, I am trying to build a functionality to check any documents being uploaded... -
Index scan vs. Seq scan on timestamps
On Tue, Dec 07, 2004 at 09:25:20AM +0100, Per Jensen wrote: CURRENT_TIMESTAMP is fixed to the time of transaction start, not session start; this... -
Saving InDesign documents in a previous format
The graphic community is been so loyal to Adobe by buying their products and not using piracy but I don’t thing Adobe is been fair to us. Their last... -
Line spacing, file format recognition & effects
Three problems that I'm having: 1] When I change the size of a piece of text, the leading jumps to 100+ (instead of 0,1, whatever) for the entire... -
W_T_Allen@adobeforums.com #2
Re: OCR (optical character recognition) with scan documents stored in pdf format
Are you looking for something that will perform OCR on your image documents or are you looking for something that will build indexes to allow searching across large sets of your (already OCR-ed) documents?
W_T_Allen@adobeforums.com Guest
-
Captain #3
Re: OCR (optical character recognition) with scan documents stored in pdf format
[email]ray_mayer@adobeforums.com[/email] wrote in message news:<3bb427c6.-1@webx.la2eafNXanI>...
> We have converted a substantial number of drawings and documents into pdf format via several methods;Yes, Adobe has Acrobat Capture for this very purpose. Many third>
> does adobe provide a product or does a third party add-on exist that would provide this type of capability?
>
> thanks in advance for your support,
>
> ray
party softwares exist and do a great job. I prefer Finereader. Be
prepared to review and rectify capture suspects/errors. It is
tedious.
Ravi
Captain Guest
-
ray_mayer@adobeforums.com #4
Re: OCR (optical character recognition) with scan documents stored in pdf format
presently our need is to find words in a single open document that has been scanned on a flat bed scanner that did not use any OCR software during the scan process.
ray_mayer@adobeforums.com Guest
-
W_T_Allen@adobeforums.com #5
Re: OCR (optical character recognition) with scan documents stored in pdf format
Acrobat can do this, but for large numbers of documents or for large-format images you should look into getting Capture.
W_T_Allen@adobeforums.com Guest
-
ray_mayer@adobeforums.com #6
Re: OCR (optical character recognition) with scan documents stored in pdf format
thanks for the response.
am presently looking into capture.
how can i do this with my adobe 5.0 or adobe 6.0 reader?
have tried the search tool which does not work on my scanned document.
any thoughts?
thanks
ray_mayer@adobeforums.com Guest
-
Simon_Gill@adobeforums.com #7
Re: OCR (optical character recognition) with scan documents stored in pdf format
I think that anything that has been converted to PDF from Word, Excel, etc. using acrobat distiller can be easily searched using the Acrobat Search Function. You could also build an Acrobat Catalog of all your documents and then you would be able to search across all your documents for specific words or phrases. If you plan to create a catalog, may I suggest that you use Acrobat 5. I have found Acrobat 6's Catalog function to be highly inefficient - it tends to create very large index files which far exceed the aggregate size of the cataloged files.
However, for the tif files converted to pdf, you are out of luck since these are just image files and there are no words to search. You could resolve this problem by having the documents OCRed (I use the OmniPage software to do that), or alternatively, you could enter certain key words in the document information page, but those would be the only words you will be able to search for.
Simon_Gill@adobeforums.com Guest
-
W_T_Allen@adobeforums.com #8
Re: OCR (optical character recognition) with scan documents stored in pdf format
Yes, the PDF created from Word docs can already be searched, but the scanned images need to be OCRed, which is why I recommended Capture if there were going to be lots of documents. If it's not many, Acrobat already has a built-in OCR engine.
Ray, you cannot do this with Reader of any version, you need Acrobat or Capture.
W_T_Allen@adobeforums.com Guest
-
Captain #9
Re: OCR (optical character recognition) with scan documents stored in pdf format
[email]ray_mayer@adobeforums.com[/email] wrote in message news:<3bb427c6.1@webx.la2eafNXanI>...
You cannot search for words till you cary out optical character> presently our need is to find words in a single open document that has been scanned on a flat bed scanner that did not use any OCR software during the scan process.
recognition on the document. If OCR was not carried out at the time
of scan you have to do so now. Hopefully, the scan parameters would
support a good OCR processing.
Ravi
Captain Guest



Reply With Quote

