Latest Publications
Visual language processing (VLP) of ancient manuscripts: Converting collections to windows on the past
Keywords: Adaptation models, ancient manuscript, computational pattern analysis, cultural heritage, data mining, data-driven mining, Degradation, directed graphical model, Feature extraction, feature vector extraction, graph-based representation, Hidden Markov models, history, HMM, image document restoration, image restoration, natural language processing, social network, […]
W-TSV: Weighted topological signature vector for lexicon reduction in handwritten Arabic documents
Abstract
This paper proposes a holistic lexicon-reduction method for ancient and modern handwritten Arabic documents. The word shape is represented by the weighted topological signature vector (W-TSV), which encodes graph data into a low-dimensional vector […]
Sparse Descriptor for Lexicon Reduction in Handwritten Arabic Documents
Abstract Arabic words have a rich structure. They are made of subwords (groups of connected letters) and diacritical marks (dots). This paper proposes a sparse descriptor specifically designed for lexicon reduction in handwritten Arabic documents. The topological and geometrical features […]
A Robust Word Spotting System for Historical Arabic Manuscripts
Abstract A novel system for word spotting in old Arabic manuscripts is developed. The system has a complete chain of operations and consists of three major steps: pre-processing, data preparation, and word spotting. In the pre-processing step, using multi-level classifiers, […]











