Revolutionizing Historical Manuscript Analysis: A Deep Learning Approach with Intelligent Feature Extraction for Script Classification

doi:10.18267/j.aip.239

Acta Informatica Pragensia 2024, 13(2), 251-272 | DOI: 10.18267/j.aip.2392716

Revolutionizing Historical Manuscript Analysis: A Deep Learning Approach with Intelligent Feature Extraction for Script Classification

Merouane Boudraa ORCID...¹, Akram Bennour ORCID...¹, Tahar Mekhaznia ORCID...¹, Abdulrahman Alqarafi², Rashiq Rafiq Marie², Mohammed Al-Sarem ORCID...², Ayush Dogra ORCID...³: ¹ Laboratory of Mathematics, Informatics and Systems, Larbi Tebessi University – Tebessa, Algeria; ² College of Computer Science and Engineering, Taibah University, Madinah, Saudi Arabia; ³ Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India

The automated classification of historical document scripts holds profound implications for historians, providing unprecedented insights into the contexts of ancient manuscripts. This study introduces a robust deep learning system integrating an intelligent feature selection method, elevating the script classification process. Our methodology, applied to the CLaMM dataset, involves preprocessing steps such as advanced denoising through non-local means and binarization using Canny edge detection. These steps, pivotal for image cleaning and segmentation, set the stage for subsequent in-depth analysis. To enhance feature detection, we employ the Harris corner detector, followed by a k-means clustering process to eliminate redundancy and outliers. This process facilitates the extraction of consistently sized patches, capturing distinctive features of various scripts in historical manuscripts. The dataset undergoes rigorous training using precise convolutional neural network (CNN) models, empowering our system to discern intricate patterns and features for informed decision-making during the classification process. Ultimately, for the definitive script classification of an entire document, we employ a majority voting mechanism on the patches. The results highlight the effectiveness of this comprehensive approach, with the system achieving an impressive accuracy rate of 89.2%. This underscores the system proficiency in accurately classifying historical document scripts, offering a reliable and efficient solution for historians and researchers. The robustness of our methodology positions it as a compelling tool for meticulous analysis of historical manuscripts, contributing significantly to the field of historical document research and preservation.

Keywords: Script classification; Historical manuscripts; Intelligent features; Feature extraction; CNNs; Transfer learning.

Received: February 5, 2024; Revised: May 30, 2024; Accepted: June 3, 2024; Prepublished online: July 22, 2024; Published: August 4, 2024 Show citation

Boudraa, M., Bennour, A., Mekhaznia, T., Alqarafi, A., Marie, R.R., Al-Sarem, M., & Dogra, A. (2024). Revolutionizing Historical Manuscript Analysis: A Deep Learning Approach with Intelligent Feature Extraction for Script Classification. Acta Informatica Pragensia, 13(2), 251-272. doi: 10.18267/j.aip.239

Download citation

References

Arandjelovic, R., & Zisserman, A. (2012). Three things everyone should know to improve object retrieval. In 2012 IEEE conference on computer vision and pattern recognition (pp. 2911-2918). IEEE. https://doi.org/10.1109/cvpr.2012.6248018 Go to original source...
Bennour, A., Boudraa, M., Siddiqi, I., Al-Sarem, M., Al-Shabi, M., & Ghabban, F. (2024). A deep learning framework for historical manuscripts writer identification using data-driven features. Multimedia Tools and Applications, (in press). https://doi.org/10.1007/s11042-024-18187-y Go to original source...
Bennour, A., Djeddi, C., Gattal, A., Siddiqi, I., & Mekhaznia, T. (2019). Handwriting based writer recognition using implicit shape codebook. Forensic Science International, 301, 91-100. https://doi.org/10.1016/j.forsciint.2019.05.014 Go to original source...
Boudraa, M., Bennour, A., Al-Sarem, M., Ghabban, F., & Bakhsh, O. A. (2024). Contribution to Historical Manuscript Dating: A Hybrid Approach Employing Hand-Crafted Features with Vision Transformers. Digital Signal Processing, 149, 104477. https://doi.org/10.1016/j.dsp.2024.104477 Go to original source...
Boudraa, M., & Bennour, A. (2023). Combination of local features and deep learning to historical manuscripts dating. In Intelligent Systems and Pattern Recognition - ISPR 2023, (pp. 129-143). Springer. https://doi.org/10.1007/978-3-031-46335-8_11 Go to original source...
Buades, A., Coll, B., & Morel, J. (2011). Non-Local means denoising. Image Processing Online, 1, 208-212. https://doi.org/10.5201/ipol.2011.bcm_nlm Go to original source...
Canny, J. (1986). A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(6), 679-698. https://doi.org/10.1109/tpami.1986.4767851 Go to original source...
Christlein, V., Bernecker, D., & Angelopoulou, E. (2015). Writer identification using VLAD encoded contour-Zernike moments. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (pp. 906-910). IEEE. https://doi.org/10.1109/ICDAR.2015.7333893 Go to original source...
Christlein, V., Bernecker, D., Hönig, F., Maier, A., & Angelopoulou, E. (2017). Writer identification using GMM Supervectors and Exemplar-SVMs. Pattern Recognition, 63, 258-267. https://doi.org/10.1016/j.patcog.2016.10.005 Go to original source...
Cloppet, F., Eglin, V., Kieu, V.C., Stutzmann, D., & Vincent, N. (2016). ICFHR2016 Competition on the classification of medieval handwritings in latin script. In Proceedings of the International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE. https://doi.org/10.1109/ICFHR.2016.0113 Go to original source...
Cloppet, F., Eglin, V., Helias-Baron, M., Kieu, C., Vincent, N., & Stutzmann, D. (2017). ICDAR2017 Competition on the Classification of Medieval Handwritings in Latin Script. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), (pp. 1371-1376). IEEE. https://doi.org/10.1109/ICDAR.2017.224 Go to original source...
Dehak, N., Torres-Carrasquillo, P. A., Reynolds, D. A., & Dehak, R. (2011). Language recognition via i-Vectors and dimensionality reduction. In Interspeech 2011, (pp. 857-860). ISCA. Go to original source...
Demilew, F. A., & Sekeroglu, B. (2019). Ancient Geez script recognition using deep learning. SN Applied Sciences, 1(11). https://doi.org/10.1007/s42452-019-1340-4 Go to original source...
El Bahi, H., & Zatni, A. (2019). Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network. Multimedia Tools and Applications, 78, 26453-26481. https://doi.org/10.1007/s11042-019-07855-z Go to original source...
Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., & Stolz, M. (2010). Ground truth creation for handwriting recognition in historical documents. In Proceedings of the 9th IAPR International Workshop on Document Analysis Systems (pp. 3-10). ACM. https://doi.org/10.1145/1815330.1815331 Go to original source...
Harris, C., & Stephens, M. (1988). A combined corner and edge detector. In Alvey vision conference (pp. 147-152). BMVA. https://doi.org/10.5244/C.2.23 Go to original source...
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (pp. 770-778). IEEE. https://doi.org/10.1109/cvpr.2016.90 Go to original source...
Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely Connected Convolutional Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (pp. 2261-2269). IEEE. https://doi.org/10.1109/cvpr.2017.243 Go to original source...
Jin, X., & Han, J. (2011). K-Means Clustering. In Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer. https://doi.org/10.1007/978-1-4899-7687-1_431 Go to original source...
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436-444. https://doi.org/10.1038/nature14539 Go to original source...
Lombardi, F., & Marinai, S. (2020). Deep Learning for Historical Document Analysis and Recognition-A survey. Journal of Imaging, 6(10), 110. https://doi.org/10.3390/jimaging6100110 Go to original source...
Lowe, D. G. (2004). Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60(2), 91-110. https://doi.org/10.1023/b:visi.0000029664.99615.94 Go to original source...
Meraoumia, A., Bendjenna, H., Amroune, M., & Dris, Y. (2018). Towards a Secure Online E-voting Protocol Based on Palmprint Features. In 2018 3rd International Conference on Pattern Analysis and Intelligent Systems. IEEE. https://doi.org/10.1109/pais.2018.8598520 Go to original source...
Paris, S., Kornprobst, P., Tumblin, J., & Durand, F. (2009). Bilateral Filtering: Theory and applications. Foundations and Trends in Computer Graphics and Vision, 4(1), 1-75. https://doi.org/10.1561/0600000020 Go to original source...
Parkes, M. B. (2016). Pause and effect: An introduction to the history of punctuation in the West. Routledge. Go to original source...
Peake, G. S., & Tan, T. N. (1997). Script and language identification from document images. In Proceedings Workshop on Document Image Analysis (DIA'97), (pp. 10-17). IEEE. https://doi.org/10.1109/DIA.1997.627086 Go to original source...
Rosten, E., Porter, R., & Drummond, T. (2010). Faster and Better: A machine learning approach to corner detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(1), 105-119. https://doi.org/10.1109/tpami.2008.275 Go to original source...
Samai, D., Meraoumia, A., Bendjenna, H., & Laimeche, L. (2017). Oriented Local Binary Pattern (LBPθ): A new scheme for an efficient feature extraction technique. In International Conference on Mathematics and Information Technology (ICMIT). IEEE. https://doi.org/10.1109/MATHIT.2017.8259710 Go to original source...
Seuret, M., Nicolaou, A., Rodríguez-Salas, D., Weichselbaumer, N., Stutzmann, D., Mayr, M., Maier, A., & Christlein, V. (2021). ICDAR 2021 Competition on Historical Document Classification. In Document Analysis and Recognition - ICDAR 2021 (pp. 618-634). Springer. https://doi.org/10.1007/978-3-030-86337-1_41 Go to original source...
Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. In 3rd International Conference on Learning Representations (ICLR 2015). ICLR.
Singh, A. K., Mishra, A., Dabral, P., & Jawahar, C. V. (2016). A simple and effective solution for script identification in the wild. In 2016 12th IAPR Workshop on Document Analysis Systems (DAS), (pp. 428-433). IEEE. https://doi.org/10.1109/DAS.2016.57 Go to original source...
Stutzmann, D. (2016). Clustering of medieval scripts through computer image analysis: Towards an evaluation protocol. Digital Medievalist, 10. https://doi.org/10.16995/dm.61 Go to original source...
Tan, T. (1998). Rotation invariant texture features and their use in automatic script identification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(7), 751-756. https://doi.org/10.1109/34.689305 Go to original source...

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.

Return to the content