Facebook Instagram Twitter RSS Feed PodBean Back to top on side

Automated transcription of historical encrypted manuscripts

In: Tatra Mountains Mathematical Publications, vol. 82, no. 2
Eugen Antal - Pavol Marák

Details:

Year, pages: 2023, 65 - 86
Language: eng
Keywords:
historical ciphers, nomenclator, manuscript, transcription, machine learning, deep convolutional neural networks, Mask R-CNN
Article type: Mathematics
Document type: scientific paper, pdf
About article:
This paper deals with historical encrypted manuscripts and introduces an automated method for the detection and transcription of ciphertext symbols for subsequent cryptanalysis. Our database contains documents used in the past by aristocratic families living in the territory of Slovakia. They are encrypted using a nomenclator which is a specific type of substitution cipher. In our case, the nomenclator uses digits as ciphertext symbols. We have proposed a method for the detection, classification, and transcription of handwritten digits from the original documents. Our method is based on Mask R-CNN which is a deep convolutional neural network for instance segmentation. Mask R-CNN was trained on a manually collected database of digit annotations. We employ a specific strategy where the input image is first divided into small blocks. The image blocks are then passed to Mask R-CNN to obtain detections. This way we avoid problems related to the detection of a large number of small dense objects in a high-resolution image. Experiments have shown promising detection performance for all digit types with minimum false detections.
How to cite:
ISO 690:
Antal, E., Marák, P. 2023. Automated transcription of historical encrypted manuscripts. In Tatra Mountains Mathematical Publications, vol. 82, no.2, pp. 65-86. 1210-3195. DOI: https://doi.org/10.2478/tmmp-2022-0019

APA:
Antal, E., Marák, P. (2023). Automated transcription of historical encrypted manuscripts. Tatra Mountains Mathematical Publications, 82(2), 65-86. 1210-3195. DOI: https://doi.org/10.2478/tmmp-2022-0019
About edition:
Publisher: Mathematical Institute, Slovak Academy of Sciences, Bratislava
Published: 20. 1. 2023
Rights:
The Creative Commons Attribution-NC-ND 4.0 International Public License