research-article

Papy-S-Net: A Siamese Network to match papyrus fragments

Authors:

Antoine Pirrone,

Marie Beurton Aimar,

Nicholas JournetAuthors Info & Claims

HIP '19: Proceedings of the 5th International Workshop on Historical Document Imaging and Processing

Pages 78 - 83

https://doi.org/10.1145/3352631.3352646

Published: 20 September 2019 Publication History

Abstract

Like all heritage documents, papyri are the subject of an in-depth study by scientists. While large volumes of papyri have been digitized and indexed, many are still waiting to be so. It takes time to study a papyrus mainly because they are rarely available in one piece. Papyrologists must review a large number of fragments, find those that go together and then assemble them to finally analyze the text. Unfortunately, some fragments no longer exist. It is then a time consuming puzzle to solve, where not all the pieces are available and where fragments boundaries are not perfectly matching.AB@This article describes a method to help Papyrologists save time by helping them to solve this complex puzzle. We provide a solution where an expert use a fragment as a request element and get fragments that belong to the same papyrus. The main contribution is the proposal of a deep siamese network architecture, called Papy-S-Net for Papyrus-Siamese-Network, designed for papyri fragment matching. This network is trained and validated on 500 papyrus fragments approx. We compare the results of Papy-S-Net with a previous work of Koch et al. [14] which proposes a siamese network to match written symbols. In order to train and validate the network, we proceed to the extraction of patches from the papyrus fragments to create our ground truth. Papy-S-Net outperforms Koch et al.'s network. We also evaluate our approach on a real use case on which Papy-S-Net achieves 79% of correct matches.

References

[1]

Kian Ahrabian and Bagher BabaAli. 2017. On Usage of Autoencoders and Siamese Networks for Online Handwritten Signature Verification. CoRR abs/1712.02781 (2017). arXiv:1712.02781

[2]

Athina A Alexopoulou, Agathi-Anthoula Kaminari, Athanasios Panagopoulos, and Egert Pöhlmann. 2013. Multispectral documentation and image processing analysis of the papyrus of tomb II at Daphne, Greece. Journal of Archaeological Science 40, 2 (2013), 1242--1249.

[3]

Ahmed S Atallah, E Emary, and Mohamed S El-Mahallawy. 2015. A step toward speeding up cross-cut shredded document reconstruction. In 2015 Fifth International Conference on Communication Systems and Network Technologies. IEEE, 345--349.

[4]

Luca Bertinetto, Jack Valmadre, João F. Henriques, Andrea Vedaldi, and Philip H. S. Torr. 2016. Fully-Convolutional Siamese Networks for Object Tracking. CoRR abs/1606.09549 (2016). arXiv:1606.09549

[5]

J. Bromley, I. Guyon, Y. LeCun, and R. Shah E. Sackinger. 1994. Signature verification using a "siamese" time delay neural network. NIPS (1994), 737âĂŞ744.

[6]

Patrick Butler, Prithwish Chakraborty, and Naren Ramakrishan. 2012. The deshredder: A visual analytic approach to reconstructing shredded documents. In 2012 IEEE Conference on Visual Analytics Science and Technology (VAST). IEEE, 113--122.

Digital Library

[7]

Junhua Chen, Miao Tian, Xingming Qi, Wenxing Wang, and Youjun Liu. 2019. A Solution to Reconstruct Cross-Cut Shredded Text Documents Based on Constrained Seed K-means Algorithm and Ant Colony Algorithm. Expert Systems with Applications (2019).

[8]

Nikhil Bhagwat et al. 2018. Modeling and prediction of clinical symptom trajectories in AlzheimerâĂ&Zacute;s disease using longitudinal data. PLOS, Computational Biology 14, 9 (2018), 1--25.

[9]

Marco Gargano, Duilio Bertani, Marinella Greco, John Cupitt, Davide Gadia, and Alessandro Rizzi. 2015. A perceptual approach to the fusion of visible and NIR images in the examination of ancient documents. Journal of Cultural Heritage 16, 4 (2015), 518--525.

[10]

Tobias Grüning, Gundram Leifert, Tobias Strauß, and Roger Labahn. 2018. A two-stage method for text line detection in historical documents. arXiv preprint arXiv:1802.03345 (2018).

[11]

Edson Justino, Luiz S Oliveira, and Cinthia Freitas. 2006. Reconstructing shredded documents through feature matching. Forensic science international 160, 2-3 (2006), 140--147.

[12]

M. Kassis, J. Nassour, and J. El-Sana. 2017. Alignment of Historical Handwritten Manuscripts Using Siamese Neural Network. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 01. 293--298. https://doi.org/10.1109/ICDAR.2017.56

[13]

Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. (2014). http://arxiv.org/abs/1412.6980 cite arxiv:1412.6980Comment: Published as a conference paper at the 3rd International Conference for Learning Representations, San Diego, 2015.

[14]

Gregory R. Koch. 2015. Siamese Neural Networks for One-Shot Image Recognition.

[15]

Brenden M. Lake, Ruslan Salakhutdinov, and Joshua B. Tenenbaum. 2015. Humanlevel concept learning through probabilistic program induction. Science 350, 6266 (2015), 1332--1338. https://doi.org/10.1126/science.aab3050

[16]

Gil Levi, Pinhas Nisnevich, Adiel Ben-Shalom, Nachum Dershowitz, and Lior Wolf. 2018. A Method for Segmentation, Matching and Alignment of Dead Sea Scrolls. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 208--217.

[17]

Hairong Liu, Shengjiao Cao, and Shuicheng Yan. 2011. Automated assembly of shredded pieces from multiple photos. IEEE Transactions on Multimedia 13, 5 (2011), 1154--1162.

Digital Library

[18]

Fabian Richter, Christian X Ries, Nicolas Cebron, and Rainer Lienhart. 2012. Learning to reassemble shredded documents. IEEE Transactions on multimedia 15, 3 (2012), 582--593.

Digital Library

[19]

Fabian Richter, Christian X Ries, Stefan Romberg, and Rainer Lienhart. 2014. Partial contour matching for document pieces with content-based prior. In 2014 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6.

[20]

Dror Sholomon, Omid E David, and Nathan S Netanyahu. 2016. DNN-Buddies: A deep neural network-based estimation metric for the jigsaw puzzle problem. In International Conference on Artificial Neural Networks. Springer, 170--178.

[21]

Haoran Wu, Zhiyong Xu, Jianlin Zhang, Wei Yan, and Xiao Ma. 2017. Face Recognition based on Convolution Siamese Networks. In 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI 2017).

[22]

Sergey Zagoruyko and Nikos Komodakis. 2015. Learning to Compare Image Patches via Convolutional Neural Networks. CoRR abs/1504.03641 (2015). arXiv:1504.03641

[23]

Z. Zhong, W. Pan, L. Jin, H. Mouchere, and C. Viard-Gaudin. 2016. SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). 295--300. https://doi.org/10.1109/ICFHR.2016.0063

Cited By

Chiș R(2024)Matching Apictorial Puzzle Pieces Using Deep LearningStudia Universitatis Babeș-Bolyai Informatica10.24193/subbi.2024.1.0169:1(5-20)Online publication date: 10-Jun-2024
https://doi.org/10.24193/subbi.2024.1.01
Han YZhu SZhang YLi HHe BZhang SDou HYan JLiu ZZhang YWu F(2024)DHelper: A Collaborative Toolkit for Manuscript RestorationProceedings of the 6th ACM International Conference on Multimedia in Asia Workshops10.1145/3700410.3702125(1-7)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3700410.3702125
Zhang YLi HZhang SWang RHe BDou HYan JZhang YWu F(2024)LLMCO4MR: LLMs-Aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on DunhuangComputer Vision – ECCV 202410.1007/978-3-031-73226-3_15(253-269)Online publication date: 1-Nov-2024
https://doi.org/10.1007/978-3-031-73226-3_15
Show More Cited By

Index Terms

Papy-S-Net: A Siamese Network to match papyrus fragments
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Document analysis
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Method of tire pattern image retrieval based on wavelet transform and Siamese network
ICASIT 2020: Proceedings of the 2020 International Conference on Aviation Safety and Information Technology

Image retrieval of tire tread pattern plays a vital role in traffic accidents and criminal investigation. We propose a tire pattern retrieval method based on wavelet transform and Siamese network, which combine traditional techniques with deep learning. ...
Apple leaf disease recognition method based on Siamese dilated Inception network with less training samples
Highlights
- Dilated Inception convolution module is introduced into AlexNet to construct two subnetworks of Siamese network.
- SDINet is exploited to improve apple disease recognition results using a small number of training images.
- A large ...
Abstract
The application of existing deep learning networks may prove difficult when datasets are small. Siamese network can achieve better accuracy on small dataset. A recognition method of apple leaf disease based on Siamese dilated Inception network (...
Relation-aware Siamese region proposal network for visual object tracking
Abstract
The backbone networks used in Siamese trackers are relatively shallow, such as AlexNet and VGGNet, resulting in insufficient features for tracking task. Therefore, this paper focuses on extracting more discriminative features to improve the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

HIP '19: Proceedings of the 5th International Workshop on Historical Document Imaging and Processing

September 2019

98 pages

ISBN:9781450376686

DOI:10.1145/3352631

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

FamilySearch: FamilySearch

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 September 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

HIP '19

HIP '19: The 5th International Workshop on Historical Document Imaging and Processing

September 20 - 21, 2019

NSW, Sydney, Australia

Acceptance Rates

HIP '19 Paper Acceptance Rate 15 of 26 submissions, 58%;

Overall Acceptance Rate 52 of 90 submissions, 58%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

15
Total Citations
View Citations
176
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)2

Reflects downloads up to 29 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Chiș R(2024)Matching Apictorial Puzzle Pieces Using Deep LearningStudia Universitatis Babeș-Bolyai Informatica10.24193/subbi.2024.1.0169:1(5-20)Online publication date: 10-Jun-2024
https://doi.org/10.24193/subbi.2024.1.01
Han YZhu SZhang YLi HHe BZhang SDou HYan JLiu ZZhang YWu F(2024)DHelper: A Collaborative Toolkit for Manuscript RestorationProceedings of the 6th ACM International Conference on Multimedia in Asia Workshops10.1145/3700410.3702125(1-7)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3700410.3702125
Zhang YLi HZhang SWang RHe BDou HYan JZhang YWu F(2024)LLMCO4MR: LLMs-Aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on DunhuangComputer Vision – ECCV 202410.1007/978-3-031-73226-3_15(253-269)Online publication date: 1-Nov-2024
https://doi.org/10.1007/978-3-031-73226-3_15
De Gregorio GPerrin SPena RMarthot-Santaniello IMouchère H(2024)NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri RetrievalDocument Analysis and Recognition – ICDAR 2024 Workshops10.1007/978-3-031-70642-4_5(71-86)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1007/978-3-031-70642-4_5
Gattal ADjeddi CAbbas FSiddiqi IBouderah B(2023)A new method for writer identification based on historical documentsJournal of Intelligent Systems10.1515/jisys-2022-024432:1Online publication date: 24-Feb-2023
https://doi.org/10.1515/jisys-2022-0244
Gigilashvili DLukesova HGulbrandsen CHarijan AHardeberg J(2023)Computational techniques for virtual reconstruction of fragmented archaeological textilesHeritage Science10.1186/s40494-023-01102-311:1Online publication date: 13-Dec-2023
https://doi.org/10.1186/s40494-023-01102-3
Sommerschield TAssael YPavlopoulos JStefanak VSenior ADyer CBodel JPrag JAndroutsopoulos Ide Freitas N(2023)Machine Learning for Ancient Languages: A SurveyComputational Linguistics10.1162/coli_a_0048149:3(703-747)Online publication date: 1-Sep-2023
https://doi.org/10.1162/coli_a_00481
Vocaturo EZumpano E(2023)Assembling Fragments of Ancient Papyrus via Artificial IntelligencePervasive Knowledge and Collective Intelligence on Web and Social Media10.1007/978-3-031-31469-8_1(3-13)Online publication date: 28-Apr-2023
https://doi.org/10.1007/978-3-031-31469-8_1
Lastilla LAmmirati SFirmani DKomodakis NMerialdo PScardapane S(2022)Self-supervised learning for medieval handwriting identification: A case study from the Vatican Apostolic LibraryInformation Processing & Management10.1016/j.ipm.2022.10287559:3(102875)Online publication date: May-2022
https://doi.org/10.1016/j.ipm.2022.102875
Lu YZhou JMcDorman SZhang CScott DBukuts JWilder CSmith KWang S(2022)Snowvision: Segmenting, Identifying, and Discovering Stamped Curve Patterns from Fragments of PotteryInternational Journal of Computer Vision10.1007/s11263-022-01669-7130:11(2707-2732)Online publication date: 27-Aug-2022
https://doi.org/10.1007/s11263-022-01669-7
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten