OCR in Fedora?

Valent Turkovic valent.turkovic at gmail.com
Mon Jul 21 14:37:28 UTC 2008


2008/7/21 joachim.backes at rhrk.uni-kl.de <joachim.backes at rhrk.uni-kl.de>:
> Valent Turkovic wrote:
>>
>> On Mon, Jul 21, 2008 at 12:13 PM, Paul Smith <phhs80 at gmail.com> wrote:
>>>
>>> 2008/7/21 joachim.backes at rhrk.uni-kl.de <joachim.backes at rhrk.uni-kl.de>:
>>>>>
>>>>> Does anybody do OCR using software available in Fedora? Which ones do
>>>>> you use? How do you use them?
>>>>> I saw an article about OCRopus [1] and how great app it is but there
>>>>> is no ocropus in fedora currently.
>>>>>
>>>>> [1]
>>>>>
>>>>> http://arstechnica.com/news.ars/post/20071024-hands-on-with-googles-ocropus-open-source-scanning-software.html
>>>>
>>>> I use gocr-0.45-2.fc9.i386
>>>>
>>>> I think it comes from the fedora repo.
>>>
>>> Tesseract is better:
>>>
>>> yum install tesseract
>>>
>>> Paul
>>>
>>> --
>>> fedora-list mailing list
>>> fedora-list at redhat.com
>>> To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list
>>>
>>
>> Hi Joachim and Paul,
>> do gocr and tesseract have GUIs? How are you using them? Do you get
>> formated text or just plain text file? Do gocr and tesseract recognise
>> colums? Is it possible to get formated OpenOffice Writer document that
>> matches the original scanned page?
>>
>> I read the article I posed the link to about OCRopus and it seams that
>> uses tesseract but it somehow improved.
>>
>> Cheers,
>> Valent.
>>
>>
> Hi Valent,
>
> gocr is ia simple CLI which reads from a file containing text as graphics
> and writes only plain text to stdout.
> It has no features such as the WIN$ ocr tools.
>
> --
> Joachim Backes <joachim.backes at rhrk.uni-kl.de>
>
>
> --
> fedora-list mailing list
> fedora-list at redhat.com
> To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list
>


I guess that it the strenght of OCRopus because it should have that
feature if I'm not mistaken. Unfortunatelly OCRopus is not in Fedore
repos so I can't test it. I hope it gets there soon.

Valent.

-- 
http://kernelreloaded.blog385.com/
linux, blog, anime, spirituality, windsurf, wireless
registered as user #367004 with the Linux Counter, http://counter.li.org.
ICQ: 2125241, Skype: valent.turkovic




More information about the users mailing list