Ask a Question related to Adobe Acrobat SDK, Design and Development.

  1. #1

    Default PDF Parsing

    Hello,
    I'm writing a PDF Parser and I want to recognize lines or Logic units(paragraphs or sentences) in the PDF stream objects.
    I need to take a whole line as it seen in the Acrobat Reader and perform directionality transformation on it
    Can I do it, even if there is no Hard coded new line in the extracted PDF?

    Thank you, Tamir.
    Tamir_Noach@adobeforums.com Guest

  2. Similar Questions and Discussions

    1. How do i Parsing xml
      This how my customers xml is coming across and can not change. <month name="January"> <day> <date>01</date> <name>Thursday</name> <time>12:03...
    2. parsing XML
      Why won't this work? I just want to create one Node for each <placemark> tag in the xml. the xmlData trace gets: <kml...
    3. Parsing URL
      How do I go about parsing a url from the browser location. For example if I have the following url: ...
    4. Parsing PHP
      I am using PHP to develop and web app. The app also has a scripting language for the *end user*. I was thinking if I could expose a very simple...
    5. [PHP] Parsing PHP
      There is the tokenizer extension... http://www.php.net/tokenizer This might give you a good start. -- Peter James petej@phparch.com ...
  3. #2

    Default Re: PDF Parsing

    First thing you need to do is read the PDF Reference. After doing so, you will understand that there is no such "logic unit" as a paragraph or sentence in PDF but instead is a series of drawing instructions.

    Leonard
    Leonard_Rosenthol@adobeforums.com Guest

Posting Permissions

  • You may not post new threads
  • You may post replies
  • You may not post attachments
  • You may not edit your posts

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139