Optical Character Recognition
The best package that I have used is from a company called ABBYY. Their package called FineReader comes with the ability to scan in a large number of languages. The latest version can scan into pdf files. I have used the package with some success to scan languages that do not have a latin script. You can train the package to recognise new symbols and even symbols that are very close together. The work I carried out was with the old Irish script and the old German script called Fraktur. You can train the package on a sample page then let it of by itself to do the OCR.