||The talk will cover the background issues, challenges and opportunities in image processing and analysis of historical documents in the context of large-scale digitisation initiatives. The talk starts by examining the different factors that influence technical decisions in document digitisation. The types of documents typically encountered are discussed next with the challenges and possibilities they offer for digitisation and full-text conversion. Focussing on the needs and expectations of major libraries, the different stages in full-text conversion (image acquisition, enhancement, segmentation, OCR and post-processing) are examined along with the corresponding challenges and possibilities for improvement. Major past and current initiatives are also mentioned for the processing, analysis and recognition of historical documents.
The seminar will be given at the Ångström Laboratory in Polhemsalen.