[To the index]

Scanning with Kooka or xscanimage, Optical Character Recognition (OCR) with gocr

Scan text with  Kooka or xscanimage.

Use gocr to recognize characters and text.

Remark: When  clicking is mentioned, in general clicking with your left mouse button is meant.


Scanning with Kooka

Be sure that your scanner is connected, switched on and ready for use before you boot Knoppix. Furthermore, your scanner must be recognized by Knoppix, as described in Lesson 2.

Put something to scan on your scanner.

k_in_gear Click at the Knoppix-KDE-Desktop at the bottom left on the symbol with character "K".

k_multimedia Click in the menu that opened at the entry "Multimedia".

k_graphics Click in the sub menu that opened at the entry "Graphics".

k_kooka Click in the sub menu that opened at the entry "Kooka (Scan & OCR Program)".

kooka_selectscanner Select in the "Welcome to Kooka" window the scanner you need and click on <OK>.
Remark: When you do not see this window, then the scanner is probably not recognized. Is this so, test the scanner as showed to you in Lesson 2.

Now you see the "Kooka" window:

Kooka-Window empty

Scanning an image and saving it as an image file

(1) Select at lower left which settings you want to use for the scan: Color/Gray/Black-White, resolution, contrast, etc. The possibilities that you see  depend on the type of scanner.

(2) Click bottom right at the <Preview Scan> button and await the result of the preview scan...

Kooka-Window Preview Top right in the "Scan Preview" pane,  a preview of the image is shown. This preview is meant to be used for the setting of the scan limits and helps to determine the settings of the Gamma, Illumination and Contrast.

(1) With the left mouse button pressed and drawing a selection rectangle, you can mark in the preview picture which region actually need to be scanned,

(2) Click on <Final Scan> and wait until the scanning is finished.

You are asked in the new opening "Kooka Save Assistent" window, in which format you want to store the image. Select a format (e.g. PNG) and click on  <OK>.

Kooka-Window Scan Top right, in the pane "Image Viewer" (and below it right) the scanned image is shown.

(1) With mouse clicks on the symbols on the tools list you can now crop the image, or mirror it, rotate it, etc.

(2) With a click on the menu item "File" and then on "Save Image" you can store the image in a file.

Beware: When you click in the "Kooka" window top right at Help and then on "Kooka Handbook" you - sorry - get an old version of the manual.

Ending Kooka

window_button_x_top_right You end Kooka by clicking on the "X" symbol in the window top right.

[Back to the beginning of the page]


Scanning with xscanimage

This program is described in Lesson 2 .

[Back to the beginning of the page]


Text recognition with gocr

Take care : Starting gocr from within the program Kooka using the tool list entries "OCR Image" or "OCR on Selection" does not work.

That's why you have to do this:

Scan your text with Kooka or xscanimage and save the text in an image file (e.g. in the file bookpage_1.ppm in the directory /home/knoppix).

k_konsole Click on the console icon at the bottom left in the control list.  This opens a console (command window), type here the following command:

"gocr /home/knoppix/bookpage_1.ppm > /home/knoppix/bookpage_1.txt"

The file  bookpage_1.txt is then translated from pixels into text and stored in the directory /home/knoppix under the same name.  You can read the contents of the file by using  Konqueror.

[Back to the beginning of the page]

[To the index]


This Document describes the Knoppix Version 3.4 CeBit 2004 Edition.
Copyright (c) 2004-04-19 Karl Schock. (Thanks to all that have contributed to this document - special thanks to Pieter Jonker for translation into english.)
You have permission to copy, distribute or change this document under the conditions of the GNU Free Documentation License.
THE USE OF THIS DOCUMENT IS AT YOUR OWN RISK. The author cannot be held responsible in any way for direct or indirect damage that may come forward from using this document.