Create searchable (Images) in PDFs via the SDK (by using .Net)

Ask a Question related to Adobe Acrobat SDK, Design and Development.

  1. #1

    Default Create searchable (Images) in PDFs via the SDK (by using .Net)

    Good afternoon,

    we're currently starting to roll-out a (pdf) document generation solution based on XSL-FO transformation from xml to pdf files which all works perfectly fine, but one current constraint is that embedded images are not searchable by the users.

    Now I understand that you can in fact do that manually when having Acrobat Professional installed and using it's Text Recognition/OCR functionality, but I was wondering whether the same technology is available through the SDK.

    As I haven't worked with the SDK myself, yet, I'd also like to know how well it performs when being used in .Net/C#-based applications, are there any specific constraints, limitations that this environment would cause?

    The way I would want to implement it is to add an (optional) post-processing step after the xsl-fo transformation (which takes place on a central server with a queue etc) which would perform this text recognition, but since we create a couple hundred thousand pages each day, reliability and scalability are a big factor. One document-generation machine/server currently creates a couple documents in parallel on a standard multicore machine with a couple gigs of ram, but has anyone tested the sdk's functionality under heavy load extensively?

    I am just double-checking because we've hit several walls in the past with such under-communicated limitations of 3rd party vendors....

    Cheers and thanks,
    -Jörg Battermann
    Jörg_Battermann@adobeforums.com Guest

  2. Similar Questions and Discussions

    1. Creating Searchable PDFs
      Hello, I am somewhat need to Acrobat development and would really appreciate some help and direction with these questions I have: 1) Whats the...
    2. PDFs dropping images
      I work for a newspaper publisher. We receive many camera ready ads in pdf format that we then place onto pages in IDCS and then re-export them again...
    3. How to create new text which is searchable in PDF
      Hi, I would like to create a new font to create a postscript2 file and there after a PDF file using distiller6.0. I want the same font to be...
    4. Creating searchable pdfs from scans
      Is there a way through acrobat 6 to create a pdf that you can run search options on from a scan? For example, I am wanting to scan about 1000 pages...
    5. images and PDFs
      I have created a PDF and imported an image created in Photoshop, but the image appears in poor quality no matter what the viewing percentage is set...
  3. #2

    Default Re: Create searchable (Images) in PDFs via the SDK (by using .Net)



    but I was wondering whether the same technology is available through the
    SDK.




    Yes, because the SDK only automates Acrobat. It is not a self-contained SDK or redistributable - applications developed with the SDK require a copy of Acrobat installed on the same machine in order to run.

    The way I would want to implement it is to add an (optional) post-processing
    step after the xsl-fo transformation (which takes place on a central server
    with a queue etc) which would perform this text recognition




    Then you will want to look at a different product, since Acrobat's EULA prevents it from being installed on a server as part of a purely automated workflow.

    but since we create a couple hundred thousand pages each day, reliability
    and scalability are a big factor.




    Again, Acrobat is not technically suitable for this type of environment.

    but has anyone tested the sdk's functionality under heavy load extensively?




    No, since again the SDK only automates Acrobat, and Acrobat is not technically suited nor licensed for this type of high-availability server environment.

    Have you contacted Adobe to find out if any of their LiveCycle Server products will suit your needs? LiveCycle PDF Generator has OCR functionality (not sure if it will directly suit your needs - you should contact Adobe directly for that answer), but I know it can OCR filetypes like TIFF, and it was specifically designed to work in a high-availability server environment.
    PDL@adobeforums.com Guest

Posting Permissions

  • You may not post new threads
  • You may post replies
  • You may not post attachments
  • You may not edit your posts

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139