Hello everybody,

I was set a problem to figure out how to convert PDF documents to
another proprietary vector format.
The main rub I've struck against is how to extract text content (with
all metrics and position) and corresponding font information from PDF
file. The most convenient way of solving this problem I think is to
find any intermediate file format to convert PDF to instead of
parsing raw pdf stream.
Does anybody know any suitable and simple file formats (XML or smth.
else) I could convert PDF to using stable
software ( may be from Adobe, it's desirable of course ) and what kind
of software it could be ?

Best regards,
Serg.