Scanning Text Doc.

Paul Smith phhs80 at gmail.com
Mon Apr 12 15:58:10 UTC 2010


On Mon, Apr 12, 2010 at 4:23 PM, Joachim Backes
<joachim.backes at rhrk.uni-kl.de> wrote:
>> FC12/KDE/X86_64
>>
>> Having problems of saving a Scanned Text Doc.
>>
>> I can save the Scanned Doc as a image, jpg,png and ps and pdf but it
>> won't save the scanned document as a Text file. When it's saved as a
>> out.txt , it looks like a scrambled bunch of characters. It scans the
>> doc and shows it is a text file, but saving it is the problem.
>>
>> gocr.rpm is installed
>
> Hi Jim,
>
> Initially, I used gocr too, but it has IMHO no good text recognition, so in
> the meantime I installed *tesseract* which has a rather good text
> recongnition.
>
> I scan with *xsane* into .tif files, because
> if I remember correctly, tesseract needs .tif files as input:
>
> tesseract <input>.tif <output>

A program that I very much like for scanning (with interaction with
tesseract) is gscan2pdf (available with yum).

Paul


More information about the users mailing list