Tesseract
FineReader
XML
Europeana
HOCR
Microsoft Office Document Imaging
Apache Ant
Microsoft Word
Software
Optical character recognition
Computing