Ask a Question related to PHP Development, Design and Development.
-
JustinCase #1
Extracting text from pdf
Hi,
I have to index the text of a pdf document.
Does any of you know of a PHP script/extension or a binary that is able
to extract the text ?
The pdf extension mentioned in the php.net docs seem to indicate that
it's for _creation_ of documents only, is that so? Same with all the
PHP classes i have found.
Regards,
Johnny
--
Never express yourself more clearly than you are able to think.
- Niels Bohr
JustinCase Guest
-
Help with extracting text
Hi everyone, How can I extract the text before the first occurrence of dot (.) or single space from the first field. This is my file LB1571... -
: Help with extracting text
Hi Zary, I have attached a sample file with the data you offered and a perl script which can be copied and pasted into the command line on win32.... -
Help with extracting text file
Hi everyone, I have a file with data similar to this ........... Exxxxx|FExxxxx|NQxxxxxx|OUxxxxxx|GExxxxxx|OVxxxxxxx|IQxxxxxxxx|ORxxxx... -
Extracting the text in a text sprite.
Hi, Is there a way to extract the text from a text sprite and place the text into a global variable? Thanks, Stef -
extracting text
I have a HTML table which I would like to extract text inside a <TD>. For an example <TD class=12>Some text</TD> I can write a code that detects... -
Alvaro #2
Re: Extracting text from pdf
*** JustinCase wrote/escribió (25 Oct 2004 16:09:36 GMT):
There's a Unix program that might help you: ps2ascii
--
-- Álvaro G. Vicario - Burgos, Spain
-- Thank you for not e-mailing me your questions
--
Alvaro Guest
-
JustinCase #3
Re: Extracting text from pdf
On 25-10-2004 Alvaro G Vicario wrote:
>
>There's a Unix program that might help you: ps2ascii[/ref]
Thanks for the pointer,
I'll have a look
/Johnny
--
He's turned his life around. He used to be depressed and miserable. Now
he's miserable and depressed.
- David Frost
JustinCase Guest
-
JustinCase #4
Re: Extracting text from pdf
On 25-10-2004 Alvaro G Vicario wrote:
>
>There's a Unix program that might help you: ps2ascii[/ref]
Does anyone know of any other tool for PDF text extraction ?
ps2ascii cannot seem to parse all of the pdf file. I tried the pstotext
tool to, but with same result.
I figured that it has something to do with my ghostscript version being
too old (7.05, newest is 8.14).
Unfortunally I have no experience in installing/upgrading unix stuff
(having spend half an evening trying in vain and confusion).
Regards,
Johnny
--
In the beginning the Universe was created. This has made a lot of
people very angry and been widely regarded as a bad move.
- Douglas Adams
JustinCase Guest
-
Kurt #5
Re: Extracting text from pdf
Adobe's website will convert a pdf file that is on a website to html. Try
http://www.adobe.com/products/acrobat/access_onlinetools.html
Kurt Guest



Reply With Quote

