Convert PDF to Text?

bdk at unb.ca bdk at unb.ca
Sat Apr 21 23:39:24 UTC 2007


I think pdftohtml is part of

poppler-utils


....Brian Kaye
....UNB


On Sun, 22 Apr 2007, Keith G. Robertson-Turner wrote:

> Date: Sun, 22 Apr 2007 00:26:03 +0100
> From: Keith G. Robertson-Turner <fedora-gmane.00003 at genesis-x.nildram.co.uk>
> Reply-To: For users of Fedora <fedora-list at redhat.com>
> To: fedora-list at redhat.com
> Subject: Re: Convert PDF to Text?
> 
> Verily I say unto thee, that Frank Cox spake thusly:
>> On Sat, 21 Apr 2007 22:31:51 +0100
>> "Keith G. Robertson-Turner" <fedora-gmane.00003 at genesis-x.nildram.co.uk> wrote:
>>
>>> Is there any command I can use to extract the text from these PDF
>>> documents in a batch? I have a couple of thousand documents that need
>>> converting.
>>
>> man pdftohtml
>
> ~]$ man pdftohtml
> No manual entry for pdftohtml
>
> ~]$ sudo yum install pdf2html
> Nothing to do
>
> I downloaded the tarball from SourceForge, but the build fails with:
>
> make[1]: Entering directory `/home/kgr/Desktop/pdftohtml-0.39/src'
> g++ -g -O2 -DHAVE_CONFIG_H -DHAVE_DIRENT_H=1  -I.. -DHAVE_REWINDDIR=1
> -DHAVE_POPEN=1 -I.. -I../goo -I../xpdf -I../fofi -I../splash -I
>  -I/usr/X11R6/include -c HtmlOutputDev.cc
> HtmlLinks.h:22: error: extra qualification ‘HtmlLink::’ on member
> ‘isEqualDest’
> make[1]: *** [HtmlOutputDev.o] Error 1
> make[1]: Leaving directory `/home/kgr/Desktop/pdftohtml-0.39/src'
> make: *** [all] Error 2
>
> Anything else I could try?
>
>


More information about the users mailing list