skip to main content
10.1145/3352631.3352646acmotherconferencesArticle/Chapter ViewAbstractPublication PageshipConference Proceedingsconference-collections
research-article

Papy-S-Net: A Siamese Network to match papyrus fragments

Published: 20 September 2019 Publication History

Abstract

Like all heritage documents, papyri are the subject of an in-depth study by scientists. While large volumes of papyri have been digitized and indexed, many are still waiting to be so. It takes time to study a papyrus mainly because they are rarely available in one piece. Papyrologists must review a large number of fragments, find those that go together and then assemble them to finally analyze the text. Unfortunately, some fragments no longer exist. It is then a time consuming puzzle to solve, where not all the pieces are available and where fragments boundaries are not perfectly matching.AB@This article describes a method to help Papyrologists save time by helping them to solve this complex puzzle. We provide a solution where an expert use a fragment as a request element and get fragments that belong to the same papyrus. The main contribution is the proposal of a deep siamese network architecture, called Papy-S-Net for Papyrus-Siamese-Network, designed for papyri fragment matching. This network is trained and validated on 500 papyrus fragments approx. We compare the results of Papy-S-Net with a previous work of Koch et al. [14] which proposes a siamese network to match written symbols. In order to train and validate the network, we proceed to the extraction of patches from the papyrus fragments to create our ground truth. Papy-S-Net outperforms Koch et al.'s network. We also evaluate our approach on a real use case on which Papy-S-Net achieves 79% of correct matches.

References

[1]
Kian Ahrabian and Bagher BabaAli. 2017. On Usage of Autoencoders and Siamese Networks for Online Handwritten Signature Verification. CoRR abs/1712.02781 (2017). arXiv:1712.02781
[2]
Athina A Alexopoulou, Agathi-Anthoula Kaminari, Athanasios Panagopoulos, and Egert Pöhlmann. 2013. Multispectral documentation and image processing analysis of the papyrus of tomb II at Daphne, Greece. Journal of Archaeological Science 40, 2 (2013), 1242--1249.
[3]
Ahmed S Atallah, E Emary, and Mohamed S El-Mahallawy. 2015. A step toward speeding up cross-cut shredded document reconstruction. In 2015 Fifth International Conference on Communication Systems and Network Technologies. IEEE, 345--349.
[4]
Luca Bertinetto, Jack Valmadre, João F. Henriques, Andrea Vedaldi, and Philip H. S. Torr. 2016. Fully-Convolutional Siamese Networks for Object Tracking. CoRR abs/1606.09549 (2016). arXiv:1606.09549
[5]
J. Bromley, I. Guyon, Y. LeCun, and R. Shah E. Sackinger. 1994. Signature verification using a "siamese" time delay neural network. NIPS (1994), 737âĂŞ744.
[6]
Patrick Butler, Prithwish Chakraborty, and Naren Ramakrishan. 2012. The deshredder: A visual analytic approach to reconstructing shredded documents. In 2012 IEEE Conference on Visual Analytics Science and Technology (VAST). IEEE, 113--122.
[7]
Junhua Chen, Miao Tian, Xingming Qi, Wenxing Wang, and Youjun Liu. 2019. A Solution to Reconstruct Cross-Cut Shredded Text Documents Based on Constrained Seed K-means Algorithm and Ant Colony Algorithm. Expert Systems with Applications (2019).
[8]
Nikhil Bhagwat et al. 2018. Modeling and prediction of clinical symptom trajectories in AlzheimerâĂŹs disease using longitudinal data. PLOS, Computational Biology 14, 9 (2018), 1--25.
[9]
Marco Gargano, Duilio Bertani, Marinella Greco, John Cupitt, Davide Gadia, and Alessandro Rizzi. 2015. A perceptual approach to the fusion of visible and NIR images in the examination of ancient documents. Journal of Cultural Heritage 16, 4 (2015), 518--525.
[10]
Tobias Grüning, Gundram Leifert, Tobias Strauß, and Roger Labahn. 2018. A two-stage method for text line detection in historical documents. arXiv preprint arXiv:1802.03345 (2018).
[11]
Edson Justino, Luiz S Oliveira, and Cinthia Freitas. 2006. Reconstructing shredded documents through feature matching. Forensic science international 160, 2-3 (2006), 140--147.
[12]
M. Kassis, J. Nassour, and J. El-Sana. 2017. Alignment of Historical Handwritten Manuscripts Using Siamese Neural Network. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 01. 293--298. https://doi.org/10.1109/ICDAR.2017.56
[13]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. (2014). http://arxiv.org/abs/1412.6980 cite arxiv:1412.6980Comment: Published as a conference paper at the 3rd International Conference for Learning Representations, San Diego, 2015.
[14]
Gregory R. Koch. 2015. Siamese Neural Networks for One-Shot Image Recognition.
[15]
Brenden M. Lake, Ruslan Salakhutdinov, and Joshua B. Tenenbaum. 2015. Humanlevel concept learning through probabilistic program induction. Science 350, 6266 (2015), 1332--1338. https://doi.org/10.1126/science.aab3050
[16]
Gil Levi, Pinhas Nisnevich, Adiel Ben-Shalom, Nachum Dershowitz, and Lior Wolf. 2018. A Method for Segmentation, Matching and Alignment of Dead Sea Scrolls. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 208--217.
[17]
Hairong Liu, Shengjiao Cao, and Shuicheng Yan. 2011. Automated assembly of shredded pieces from multiple photos. IEEE Transactions on Multimedia 13, 5 (2011), 1154--1162.
[18]
Fabian Richter, Christian X Ries, Nicolas Cebron, and Rainer Lienhart. 2012. Learning to reassemble shredded documents. IEEE Transactions on multimedia 15, 3 (2012), 582--593.
[19]
Fabian Richter, Christian X Ries, Stefan Romberg, and Rainer Lienhart. 2014. Partial contour matching for document pieces with content-based prior. In 2014 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6.
[20]
Dror Sholomon, Omid E David, and Nathan S Netanyahu. 2016. DNN-Buddies: A deep neural network-based estimation metric for the jigsaw puzzle problem. In International Conference on Artificial Neural Networks. Springer, 170--178.
[21]
Haoran Wu, Zhiyong Xu, Jianlin Zhang, Wei Yan, and Xiao Ma. 2017. Face Recognition based on Convolution Siamese Networks. In 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI 2017).
[22]
Sergey Zagoruyko and Nikos Komodakis. 2015. Learning to Compare Image Patches via Convolutional Neural Networks. CoRR abs/1504.03641 (2015). arXiv:1504.03641
[23]
Z. Zhong, W. Pan, L. Jin, H. Mouchere, and C. Viard-Gaudin. 2016. SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). 295--300. https://doi.org/10.1109/ICFHR.2016.0063

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
HIP '19: Proceedings of the 5th International Workshop on Historical Document Imaging and Processing
September 2019
98 pages
ISBN:9781450376686
DOI:10.1145/3352631
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • FamilySearch: FamilySearch

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 September 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep Learning
  2. Historical documents
  3. Siamese network

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

HIP '19

Acceptance Rates

HIP '19 Paper Acceptance Rate 15 of 26 submissions, 58%;
Overall Acceptance Rate 52 of 90 submissions, 58%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)2
Reflects downloads up to 29 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media