ICDAR 2015 MultiSpectral Text Extraction Contest (MS-TEx 2015)
In recent years, Multispectral (MS) imaging becomes a very important tool for historical document analysis. This technique is widely known as a non-invasive method of investigation thanks to its simultaneous use of ultraviolet, infrared and visible light. It enables conservators and art historians to obtain valuable information on ancient documents without causing any physical damages to the materials, and makes it possible to reveal the newly overwritten text, to distinguish and recognize the chemical material composing the ink, to detect signs of degradation in historical documents. It can also help to extract information from cultural heritage patterns which cannot be extracted using conventional color photography. Extracting (segmenting) the original text (old writing) from MS document image is a very important step for subsequent document image analysis and investigation. In order to facilitate comparison of the results of different algorithms and track their progress over time towards a level of human performance, it would be of great interest to obtain a standard benchmark and accurate ground-truth with the most representative information about the targeted samples. To this end, we have generated a dataset of 21 MS document images for training purposes and another dataset of ten MS document images for testing purposes.
(1): Synchromedia laboratory for multimedia and communication in telepresence, ÉTS, University of Québec. 1100, Notre-Dame Street West, Montreal, Quebec H3C 1K3, Canada
(2): Laboratory of Remote Sensing, Department of Geography, McGill University, 805, Sherbrook Street West, Montreal, QC H3A 2K6, Canada
Database and Contest Rules
All the description of the contest and the dataset can be found here.
This is the link to the S-MS dataset for the training and test phases.
In order to register for the contest, please send an email to the organizers using this email.
Registration Date: March 20th, 2015
Submission Date: April 10th, 2015