Application of computer vision techniques for 3D matching and retrieval of archaeological objects

Diego Jiménez-Badillo; Omar Mendoza-Montoya; Salvador Ruiz-Correa

doi:10.12688/f1000research.127095.2

Home Browse Application of computer vision techniques for 3D matching and retrieval...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

Revised

Application of computer vision techniques for 3D matching and retrieval of archaeological objects

[version 2; peer review: 2 approved]

Diego Jiménez-Badillo ¹, Omar Mendoza-Montoya^2,3, Salvador Ruiz-Correa⁴

PUBLISHED 25 Mar 2024

Author details Author details

¹ Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, CDMX, 06060, Mexico
² Tecnologico de Monterrey, Escuela de Ingeniería y Ciencias, Monterrey, N.L., 64849, Mexico
³ Instituto Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Zapopan, Jalisco, 45201, Mexico
⁴ You-i Lab, Instituto Potosino de Investigacion en Ciencia y Tecnologia (IPICYT), San Luis Potosi, San Luis Potosi, 78216, Mexico

Diego Jiménez-Badillo
Roles: Conceptualization, Data Curation, Funding Acquisition, Investigation, Methodology, Project Administration, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Omar Mendoza-Montoya
Roles: Investigation, Methodology, Software, Writing – Review & Editing

Salvador Ruiz-Correa
Roles: Conceptualization, Data Curation, Investigation, Methodology, Project Administration, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the New Digital Archaeologies collection.

Abstract

Background

As cultural institutions embark in projects oriented to digitise art and archaeological collections in three dimensions, the need for developing means to access the resulting 3D models has become imperative. Shape recognition techniques developed in the field of computer vision can help in this task.

Methods

This paper describes the implementation of three shape descriptors, specifically shape distributions, reflective symmetry and spherical harmonics as part of the development of a search engine that retrieves 3D models from an archaeological database without the need of using keywords as query criteria.

Use case

The usefulness of this system is obvious in the context of cultural heritage museums, where it is essential to provide automatic access to archaeological and art collections. The prototype described in this paper uses, as study case, 3D models of archaeological objects belonging to Museo del Templo Mayor, a Mexican institution that preserves one of the largest collections of Aztec cultural heritage.

Conclusions

This work is part of an ongoing project focused on creating generic methodologies and user-friendly computational tools for shape analysis for the benefit of scholars and students interested in describing, interpreting and disseminating new knowledge about the morphology of cultural objects.

Keywords

3D shape matching and retrieval, content-based shape engine, archaeological shape recognition

Corresponding authors: Diego Jiménez-Badillo, Omar Mendoza-Montoya, Salvador Ruiz-Correa

Competing interests: No competing interests were disclosed.

Grant information: The work presented here is part of the project Developing computer applications in archaeology, funded by Instituto Nacional de Antropología e Historia (SIP 31338).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Jiménez-Badillo D et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Jiménez-Badillo D, Mendoza-Montoya O and Ruiz-Correa S. Application of computer vision techniques for 3D matching and retrieval of archaeological objects [version 2; peer review: 2 approved]. F1000Research 2024, 12:182 (https://doi.org/10.12688/f1000research.127095.2) First published: 16 Feb 2023, 12:182 (https://doi.org/10.12688/f1000research.127095.1) Latest published: 25 Mar 2024, 12:182 (https://doi.org/10.12688/f1000research.127095.2)

Revised Amendments from Version 1

This revised version takes into account the recommendations of the reviewers by briefly describing digitisation techniques (i.e. Photo-modelling and Structure from Motion), and discussing Semantic Web advances oriented to facilitate queries of cultural heritage multimedia information. In particular, references on the use of ontologies like CIDO-CRM, as well as segmentation and annotation techniques are included in the introduction. Also, we included references on deep-learning techniques for shape analysis.
Additionally, we included more details and a new figure illustrating the architecture of the ShapeAnalizer module, as well as a step-by-step explanation on how the software works during a matching and retrieval operation. This explanation is supported by new figures that illustrate the different windows that constitute the ShapeAnalizer interface.
We believe that the paper provides enough details of the methods and techniques used to implement the three algorithms of shape recognition, and consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.

See the authors' detailed response to the review by Edgar Roman-Rangel
See the authors' detailed response to the review by Federica Maietti

Introduction

Around the world, many professionals face the challenge of disseminating information of cultural heritage collections in such a way that objects can be known and studied, anywhere in the world, and preferably without the need of physical contact to guarantee their long-term preservation (Ekengren et al., 2021). To achieve that goal, cultural institutions have embarked on ambitious 3D digitisation projects and researchers have been looking for better means to improve access to the resulting 3D models (Clark et al., 2002; Ekengren et al., 2021).

Digitisation indeed have been very successful thanks to the surprising evolution of photogrammetry and laser scanning, which makes possible to model the surface of objects with little effort and in a relatively short period of time (Pieraccini et al., 2001). Photo-modelling, for example, uses principles of projective geometry to calculate 3D coordinates from overlapping areas of two or more photographs taken from different perspectives. Points representing an object’s feature in one image are matched with homologous points in other images. This allows acquiring the point cloud that represents the surface of the object and at the same time its external appearance (texture). In this way, the technique integrates surveying, modelling and representation into a single workflow (Bianchini et al., 2015). Structure from Motion (SfM), another common technique, also detects matching features from overlapping images to create a point cloud, but as its name suggests, the acquisition of images is done using one or several moving sensors around the target object. While photo-modelling is typically applied to objects, SfM is more common for modelling architectural structures or for creating digital elevation models (DEMs), in which case Unmanned Aerial Vehicles (UAVs) are frequently used for acquiring images from videos. SfM has also been used to digitise historical maps and documents (Brandolini & Patrucco, 2019).

The continued adoption of these techniques has generated thousands, if not millions, of 3D digital models valuable for research and conservation. The geometric and morphological analysis of such models, for example, is now common in the cultural heritage field, as the bibliographic survey by Pintus et al. (2016) demonstrates.

Unfortunately, the search for better means to access collections has not achieved the same level of success. Many times, digital models are produced and then stored in databases without implementing appropriate means to retrieve the 3D information (Ekengren et al., 2021; Koller et al., 2009). In the case of entity-relationship databases, the simplest way to locate models consists of formulating queries by using keywords that describe the objects’ features. During a search operation, the system is instructed to retrieve all 3D models corresponding, for example, to “tripod vessels” or “anthropomorphic figures”. However, this strategy works only if the categories used in the query coincide with those defined for that particular repository. For instance, if the term bowl does not exist in the database thesaurus, the search engine won’t find vessels that are similar but have been registered with another name. Another limiting factor is the language in which the objects are described, because the system might recognize “bowl”, but not terrine (French), cuenco or cajete (Spanish). Additional problems may arise if the most relevant keywords to describe an object are unknown at the time of cataloguing the objects, or if important keywords to identify the objects are unknown to the final users of the system.

Of course, some of these limitations can be overcome by developing multilingual ontologies, an effort that implies agreement, among many expert scholars, on the categories and concepts relevant to describe an object collection (Almeida & Costa, 2021; Benjamins et al., 2004; Uschold & Grüninger, 1996). This has been a research subject in the field of Semantic Web and Linked Open Data. One of the most notable results is the development of CIDOC-CRM -a multilingual description conceptual reference model, which was specifically developed for the sector of galleries, libraries, archives and museums (GLAM). This has become a standard model that many institutions extend and adapt to facilitate the description and exchange of information among their repositories. Within this environment, the exchange of multimedia information is based on the implementation of web service protocols that make contents and semantics of the data sources machine operable and interoperable, though in this regard no standard exists yet (Bikakis et al., 2021; Crofts et al., 2011). A system that uses the CIDOC CRM ontology is SCULPTEUR, developed for searching and retrieving digital images, 3D models, and free text documents using a combination of content-based examples, textual metadata and ontological concepts. Computer query operations are supported by a web service protocol called Z39.50 that allows remote applications to access multimedia data from several museums (Addis et al., 2003, 2005; Goodall et al., 2004; Sinclair et al., 2024).

These advances have brought benefits such as the possibility to make data interactive, integrated, and contextualized, as well as to record provenance and facilitate logical inferencing, as well as to achieve Web persistence, machine readability, and content repurposing (Isaksen, 2011:10). Furthermore, semantic web and linked data technologies also allow customizing the retrieval of information based on user profiles and preferences, which results in personalised experiences (Bikakis et al., 2021).

However, to make these benefits a reality it is also necessary to develop an annotation system that links the ontology to corresponding parts of objects (Benjamins et al., 2004). This in turns involves implementing segmentation tools for tagging features in images or 3D surfaces that correspond to meaningful elements of cultural heritage entities.

Segmentation and annotation have been major challenges for some time (Benjamins & Fensel 1998; Hayes-Roth et al., 1983) and still are, especially when dealing with 3D models. These tasks can be performed either through manual procedures - a time consuming effort - or with semi-automatic tools (Grilli & Remondino, 2019). Some successful projects are reported in the field of architecture (Croce et al., 2021; Grilli et al., 2019; Grilli & Remondino, 2019, Roussel & De Luca, 2023; Teruggi et al., 2020). For example, manual segmentation and annotation through interactive software has been developed for managing thousands of condition reports, architectural descriptions, chemical and physical analysis generated during the restoration of Notre Dame Cathedral (Roussel & De Luca, 2023). The system operates through a platform called Äioli that allows tracing the contour of an architectural feature (column, floor, window, etc.) on an image to establish a link between the 2D and 3D representations of the target section. In this and other applications queries are still based on textual input (Roussel & De Luca, 2023). Another example is the system developed by Croce et al. (2021), which manually label a training set of shape features to then apply a Random Forest Classifier (RFC) to semantically annotate the different parts of a 3D model of the Grand Ducal Cloister in Pisa. Another example using RFC is the multi-level multi-resolution approach to classify 3D point clouds from Milan Cathedral and Pomposa Abbey in Ferrara (Teruggi et al., 2020).

As for semi or automatic segmentation, recent advances in deep-learning methods are promising but no ideal solution exists yet (see a survey of techniques in Attene et al., 2006; Croce et al., 2020; García-García et al., 2017; Grilli et al., 2017; Shamir, 2006). Deep-learning algorithms, however, focused on segmenting objects according to their geometric properties, regardless if these have a semantic meaning for the final user (Attene et al., 2009). Cultural heritage objects are particularly challenging for this kind of approaches, because contrary to architecture in which parts of a building (columns, porticos, doors, windows etc.) have certain standard structures, artefacts y objects in general present an enormous diversity of features, which complicates the recognition task. Also, the techniques have to account for the fuzziness in the boundaries of those features. As Attene et al. (2009) point out: “… in a human body model the neck may be considered part of both the head and the torso”. To complicate matters further, the training of deep-learning models relies on the availability of massive amounts of data, a condition rarely met in cultural heritage applications, though this limitation can be reduced with transfer-learning, an approach in which the recognition capabilities of a model trained with massive generic data are extrapolated to analyse a smaller sample of data.

But whatever the means to describe collections and share information, it would be appropriate to have a system that analyses the intrinsic visual characteristics of the objects, specifically a system that process queries recognising the shape of objects without relying exclusively on keywords as search criteria (i.e. Content-based Information Retrieval or CBIR). One step for the development of such a system is computing a numerical representations of shape for each object (i.e. its shape descriptor), and the implementation of algorithms that compare all the shape descriptors stored in a database (i.e. a matching operation) to facilitate the retrieval of 3D models (Funkhouser et al., 2003; Tangelder & Veltkamp, 2008).

This paper describes the first stage of an ongoing project oriented to such goal. It describes the implementation of the first module (MeshAnalizer) of a system called ArcheoShape, based on three types of shape descriptors and four dissimilarity measures that facilitate the matching and retrieval operations. The module is called MeshAnalizer and will function as a kind of search engine that instead of using keywords will recognize objects automatically by comparing their numerical shape descriptions. Future work will integrate deep-learning techniques for annotation and segmentation. The benefits of this system are conspicuous in the context of museums, where it is necessary to find and retrieve 3D models from large collections.

Basic requirements

A system of shape recognition must be able to discover shape similarities between partially isometric objects, that is, between objects that share shape characteristics even if they are not identical. For example, a researcher might need 3D models of all the anthropomorphic figurines in a museum. In this case, the query should not be affected by the fact that one object lacks a head, while another is missing a leg. Also, it must retrieve several complete models of the same class, regardless of whether they differ in certain details (Gal & Cohen-Or, 2006). The three objects shown in Figure 1 illustrate this situation; they belong to the same class of anthropomorphic figures, but their heads and arms show some morphological differences.

Figure 1. Three figurines found in the Sacred Precinct of Tenochtitlan.

They are similar in their overall geometry, but differ in some details, such as the design of the head and the position of the arms. A recognition system must be able to find the 3D models of these objects, despite their partial differences in shape.

The second requirement is that the system be able to detect similarities without being affected by affine variations such as rotation, translation, reflection and scale of the 3D models. Ideally, different combinations of translation, rotation, and scale applied to equal objects during the digitisation process should not affect the system’s capacity to recognise their similarity.

To fulfil those requirements, it is necessary to apply a specialized set of methods of computer vision specifically designed to identify objects’ similarities efficiently, within the processing limits of today’s computers and that are sufficiently discriminatory to resolve the requirements of cultural heritage institutions.

Development of a search-engine module

The development of shape recognition systems has been the subject of research since the 1980s, especially in the fields of computer vision, geometric modelling, and machine learning (Besl & Jain, 1985; Bustos et al., 2004, 2005; Campbell & Flynn 2001; Jain & Mishra 2014; Lara López et al., 2017; Loncaric 1998; Tangelder & Veltkamp 2008; Theologou et al., 2014; Veltkamp & Hagedoorn, 1999). In the early 2000’s. Funkhouser et al. (2003) developed a system for retrieving 3D models applying the shape descriptor of spherical harmonics, implementing an interface that allowed queries by example and also sketch-based searches. Another early system was developed by Paquet and Rioux (2000) with shape descriptors based on tensors of inertia, distribution of normal vectors, distribution of cords and multiresolution analysis. Its interface allows entering a combination of parameters such as scale, shape or colour to search the database. Other innovations were due to Suzuki (2001) who included the processing of material data (colour and texture) as search criteria; although its descriptors require normalization of 3D models via PCA.

In the field of cultural heritage, Rowe et al. (2001) and Rowe and Razdan (2002) implemented a shape-based search engine for analysis and retrieval of native American ceramic vessels. Objects were modelled as parametric surfaces and the interface allows query by example and sketch-based query, and links to descriptive data. Another interesting system was designed by Schurmans et al. (2002) according to specific archaeological research objectives. For example, indicators of craft specialization can be gathered from the morphology of ceramic vessels. This involves matching shapes, as well as text, numeric, and vessel data calculated with the system tools.

Those projects demonstrate that the effectiveness of a recognition system depends above all on implementing efficiently two basic procedures. The first one consists in calculating a “shape descriptor”, that is a numerical representation of the form of each 3D model. Such descriptor can represent the global geometry of the object or a sample of its local features. The computation of shape descriptors involves a combination of mathematical, statistical, and more recently Machine Learning methods to represent shape in a numerical array or feature vector (Tangelder & Veltkamp, 2008). Some examples of characteristics encoded by shape descriptors are the curvature or orientation of a certain quantity of patches drawn around points chosen in a random manner (Jiménez-Badillo et al., 2013), or alternately, signals calculated with spherical functions (Kazhdan & Funkhouser, 2002; Kazhdan et al. 2003), reflective symmetry (Kazhdan et al., 2002, 2004a, 2004b), spin-images (Johnson and Hebert, 1999), shape-contexts (Mori et al., 2001), histograms of spherical orientations (Roman-Rangel et al., 2016), and many others as described in bibliographic surveys by Bustos et al. (2004, 2005), Lara López et al. (2017), and Rostami et al. (2019).

More recently, deep-learning models have been developed to compute shape descriptors for the recognition of generic objects. The spectrum of techniques includes supervised and self-supervised approaches (Herrewegen et al., 2023), such as Neural Networks (Krestenitis et al., 2020; Xie et al., 2017; Yang et al., 2021), Convolutional Neural Networks (Feng et al., 2018; Kim & Chae, 2022), Autoencoders (Furuya & Ohbuchi, 2020), Point Cloud Networks (Wang et al., 2020), Graph Neural Networks (He et al., 2020), and Geometric Deep Learning techniques (Bronstein et al., 2017). These methods, however, require large data sets for training the models, a condition rarely met by cultural heritage collections, including the one used for this project.

In any case, the final objective is that the shape of the object is characterized in the best possible manner, as to constitute a “signature” (numerical representation) of the object readable by a computer. As mentioned above, the numerical descriptor must represent the shape regardless the object’s position, orientation and scale.

The second procedure consists in creating an index of the numerical representations of all the objects (i.e. their shape descriptors), to facilitate the matching operation. The comparison between 3D models is done by measuring their degree of similarity with a mathematical function, such as Euclidean distance, that indicates the degree of their resemblance, so that when a query is implemented the system can rank objects from the more to the less similar (Bustos et al., 2004; Tangelder & Veltkamp, 2008).

The search-engine module developed over the course of this project is based on the implementation of three different global descriptors, namely shape distributions (Osada et al., 2002), reflective symmetry (Kazhdan et al., 2002, 2004a, 2004b) and spherical harmonic functions (Kazhdan and Funkhouser, 2002; Kazhdan et al., 2003).

As for determining the degree of dissimilarity between objects, four measures have been implemented: Euclidean distance, City block (Manhattan) distance, Chebychev distance,¹ and Minimum Coordinate distance (Table 1). In the case of shape distributions, each dissimilarity measure has been implemented in two norms: the probability density function (pdf), and the cumulative distribution function.

Table 1. The four distance measures implemented in the prototype to calculate dissimilarity of shape descriptors.

Dissimilarity measure	Definition
Euclidian distance	$\sqrt{\sum_{i = 1}^{n} {(x_{i} - y_{i})}^{2}}$
“City block” distance	$\sum_{i = 1}^{n} \|x_{i} - y_{i}\|$
Chebychev distance	$max (\|x_{i} - y_{i}\|)$
Minimum coordinate distance	$min (\|x_{i} - y_{i}\|)$

The following sections describe, in layman’s terms, the methods to compute the three shape descriptors selected for the implementation of the search-engine module.

Shape distributions

The simplest shape descriptors included in the search-engine module are five probability distributions proposed by Osada et al. (2002). The names given to the descriptors depend on the type of calculation, “A” stands for angle and “D” for distance:

• A3: The angle between three random points on the surface of the 3D model.
• D1: The distance between the centroid of the model and one random point on its surface.
• D2: The distance between two random points on the surface.
• D3: The square root of the area of the triangle formed by three random points sampled on the surface.
• D4: The cube root of the volume of the tetrahedron formed by four random points sampled on the surface.

As Osada et al. (2002) recommend, we compute those variables for a very large sample of points selected from the surface-mesh of each 3D model, specifically 1,048,576 points (i.e. 1024 × 1024 points). The measurements were then transformed into a frequency histogram (probability distribution), which could then be used as the global signature of the object’s shape. Once the shape histograms for all the objects had been computed, a normalization step was necessary to standardize the scales of all the histograms in order to avoid matching errors due to variations in the size of the objects. The objective is finding the scale that produces the minimal dissimilarity measure during the comparison of two object’s histograms. To achieve this, one of the methods proposed by Osada et al. (2002) involves the following steps: align both shape distributions (i.e. histograms) so that the mean sample in each distribution equals 1; then find the minimum value D(f (x), sg (sx)) for values of log s from -10, 10, in 100 equally spaced intervals; where f and g represent the shape distributions of two models; x correspond to the 3D model; and D corresponds to the distance function chose from A3, D1, D2, D3, or D4. Finally, select the minimum value among the results and use it as the dissimilarity measure for the two normalised shape distributions. This guarantees that two objects of the same shape but different sizes are recognized as similar, and vice versa, two objects of the same size but different shape are recognised as different.

The resemblance between any pair of objects can be determined by applying a function that measures dissimilarity between distributions (i.e. histograms), for example Euclidean distance or any of the other measures mentioned in Table 1.

Figure 2 shows histograms resulting from the descriptor A3 (angle between three random points), representing the shape of four archaeological objects. Notice the probability distributions of the two models on the left, reflecting the differences between the long, wavy form of the serpentiform sceptre and the flat, wide anthropomorphic figurine. For an elongated figure, the angles between vectors tend to concentrate around the mode, while for flat objects the histogram would have a flat distribution because there would not be a predominant value of angle between vectors. In contrast, the images on the right correspond to two vessels whose histograms are quite similar because their shapes are also alike. Through this kind of comparison, the recognition system manages to identify similarities or differences between archaeological objects.

Figure 2. Frequency probability histograms resulting from obtaining measurements of angles between vectors for four objects from the Templo Mayor museum collection.

Notice the great difference between the histogram of the serpentiform scepter (first object on the left), which shows the mode close to zero, and the histogram of the flat anthropomorphic figure (second from left to right). On the other hand, the great similarity of the histograms corresponding to the so-called Tlaloc vessels located on the right side can be appreciated. Comparing these histograms allows the recognition system to determine whether or not two objects belong to the same class.

Reflective symmetry descriptor

The second representation of shape, more complex but at the same time more effective, is the reflective symmetry descriptor proposed by Kazhdan et al. (2002, 2004a, 2004b). As these authors point out, symmetry —or the lack of it — is one of the most distinctive characteristics of any object.

Given a 3D model, denoted by function g, the concept of reflective symmetry implies that there is a reflection function γ, such that $g = γ (g)$ . This means that the pointwise distance between the points of surface g and the points of surface $γ (g)$ is zero.

Kazhdan et al. (2002) propose quantifying reflective symmetry with respect to several cutting planes, oriented on perpendicular axes that pass through the centroid of the model. For any given plane P cutting the shape f, the method consists in finding the function g such that $g = γ$ with $⟦f - g⟧$ as small as possible. Mathematically this is expressed as:

(1)

SD (f, γ) = min_{g | γ (g) = g} ⟦f - g⟧

where SD stands for Symmetry Descriptor. The more symmetric the shape f with respect to plane P, the smaller the value of $‖f - g‖$ . Large values of $‖f - g‖$ indicate that the surface is less symmetric.

The calculation of reflective symmetry can be performed quicker and more efficiently by transforming the description of the surface mesh (3D model) into a discrete volumetric representation (i.e. voxel grid). The process starts by immersing the triangular surface mesh inside a regular 3D grid. When a triangle of the mesh intersects a voxel of the 3D grid, such voxel is assigned a value of 1. Such rasterization process allows determining where the points and triangles of the mesh are located on the 3D grid. Notice that there are voxels in the grid that do not intersect the 3D mesh and therefore lack any information. For calculating the reflective symmetry descriptor, it is convenient to add to these voxels information related to how far they are from the surface of the model, for which the distance transform is used. This transform consists of assigning each voxel the distance to the nearest voxel belonging to the model. Then, the distance is transformed to a measure of similarity with the Gaussian function. Additional voxelization methods available can be found in Aleksandrov et al. (2021) and Huang et al. (1998). The resulting discrete representation consists of a 3D set of voxels, which appears like the archaeological model shown in Figure 3.²

Figure 3. A 3D model of an archaeological figure and its voxel representation.

Once the voxel grid has been labeled in this way, the descriptor can be calculated. Broadly speaking, what it is done is to assume that when passing a cutting plane through the voxel grid there is perfect reflective symmetry between the two halves, so that one of the two halves can be replaced with the other if that property were fulfilled. Then, the reflective symmetry distance is calculated in the real model and compare it with the assumed model to measure any difference. If both representations are equal (zero distance), then there is perfect symmetry and a radius of 1 is assigned to the corresponding plane. If not, a value less than 1 is assigned according to how different these figures are.

Finally, the measures of symmetry obtained from a number of cutting planes (i.e. axes of symmetry) are concatenated to generate a 3D graph, describing the model’s global symmetries. Values near to 1 indicate perfect symmetry, while those near zero indicate that the two halves of a model are too asymmetrical. This graph is used to compare an object with any other for the matching and retrieval application. Visual representations of the descriptors obtained from three archaeological models are shown in Figure 4.

Figure 4. Graphs of the reflexive symmetry descriptors obtained for three objects from the Templo Mayor collection.

As with the histograms shown in Figure 2, reflexive symmetry descriptors offer another method for calculating dissimilarity between 3D models.

Spherical harmonics descriptor

A third way of describing shapes on a computer is to consider them as outcomes of mathematical functions. Each stroke of a drawing, for example, can be regarded as a mix of 2D functions. The numerical representation of the whole drawing would be the sum of many functions. In practice, the functions are unknown, but they can be calculated by applying standard mathematical procedures such as the Fourier Transform, which would find the specific mix of simple functions that represent the complete drawing.

Something similar happens with 3D objects, but in that case the function describing the shape is defined on the surface of the sphere. One way of describing the shape of a surface is calculating the so-called spherical harmonic functions (Figure 5). Intuitively, we can think of the spherical harmonic functions as “Lego” pieces that, together, help built the shape of complex 3D objects. This is possible, because mathematically speaking, spherical harmonics constitute a complete set of orthogonal functions and therefore form an orthonormal basis, upon which any function defined on the sphere (like the shape of a 3D model) can be expressed as the sum of these spherical harmonics.

Figure 5. Graphical representation of harmonic functions calculated to obtain shape descriptors from 3D models.

The exact combination of spherical harmonic functions needed to describe a particular object can be found though a harmonic analysis. This method divides the complex surface of a 3D model into sums of relatively simple components.

Harmonic analysis is the branch of mathematics that deals with the problem of representing functions as the combination of basic elements called waves or harmonics. Typically, the term harmonic refers to functions with sinusoidal variations, but more strictly, it indicates any solution of Laplace’s Equation. The Fourier series is an example of a complete set of harmonics, which consists of sine and cosine waves of different frequencies.

In this work, we adopted the Spherical Harmonic Transform (SHT) to obtain a reduced representation of a 3D mesh. This method is a powerful tool for describing data on a sphere using spherical harmonics as basic functions. Given a function $f (θ, φ)$ in the spherical coordinates $θ$ and $φ$ the decomposition of $f (θ, φ)$ in spherical harmonics $Y_{l}^{m} (θ, φ)$ is written as:

(2)

f (θ, φ) = \sum_{l = 0}^{\infty} \sum_{|m| \leq l} c_{lm} Y_{l}^{m} (θ, φ) .

Here, l ≥ 0 and m are integers such that $|m| \leq l$ , $c_{lm}$ is the coefficient of the harmonic $Y_{l}^{m} (θ, φ)$ , and the general form of $Y_{l}^{m} (θ, φ)$ is:

(3)

Y_{l}^{m} (θ, φ) = \sqrt{\frac{2 l + 1}{4 π} \frac{(l - m)!}{(l + m)!}} P_{l}^{m} (cos θ) e^{imφ},

where $P_{l}^{m} (x)$ is a Legendre polynomial:

(4)

P_{l}^{m} (x) = \frac{{(- 1)}^{m}}{2^{l} l!} {(1 - x^{2})}^{\frac{m}{2}} \frac{d^{l + m}}{{dx}^{l + m}} {(x^{2} - 1)}^{l} .

The problem in the Spherical Harmonic Transform is to calculate the coefficients $c_{lm}$ .

In practice, it is not possible to calculate the coefficients of all the spherical harmonic functions. For this reason, we limit the order of the harmonics to a fixed value b (for instance 16 or 32) so that:

(5)

f (θ, φ) \approx \sum_{l = 0}^{b} \sum_{|m| \leq l} c_{lm} Y_{l}^{m} (θ, φ) .

Finally, the coefficients $c_{lm}$ are estimated by finding the least-squares solution to equation (5). That is to say, for a set of n points $\{(θ_{1}, φ_{1}) (θ_{2}, φ_{2}) \dots (θ_{n}, φ_{n})\}$ where $f (θ, φ)$ is evaluated, we calculate the values of the coefficients $c_{lm}$ that minimize:

(6)

\sum_{i = 1}^{n} {(f (θ_{i}, φ_{i}) - \sum_{l = 0}^{b} \sum_{|m| \leq l} c_{lm} Y_{l}^{m} (θ_{i}, φ_{i}))}^{2}

To describe a 3D mesh using the Spherical Harmonic Transform, we define the function $f^{r} (θ, φ)$ as the intersection between the voxelized version of the 3D mesh and the sphere of radix r, both centered at the origin. The function $f^{r} (θ, φ)$ takes the value 1 only if the sphere intersects a voxel of the mesh at the point $(θ, φ$ ), otherwise, this function is 0. The Spherical Harmonic Transform is applied to different radii so that the functions $f^{r} (θ, φ)$ are characterized by their corresponding harmonic coefficients.

The simplest harmonic function is the sphere, so if the object resembles a balloon, only one harmonic component of degree zero is enough for describing it. However, if the model has a more complex shape, then it is essential to calculate several higher order harmonic functions.

There are several methods to compute shape descriptors based on spherical harmonics. Some require a priori registration of the model along principal axes (Saupe and Vranic 2001; Vranic & Saupe, 2001; Vranic et al., 2001), but these are not good to process 3D models of the same class digitised with different orientations (Funkhouser et al., 2003). A method that solves that limitation is the one proposed by Kazhdan and Funkhouser (2002), and Kazhdan et al. (2003) and it is the one implemented during this project. In practice, the descriptor is computed as follows:

1. The 3D model is subjected to a voxelization process, like the one applied in the case of reflective symmetry (c.f. Huang et al. 1998). The size of the voxel grid is 64 × 64 × 64.
2. The 3D model is aligned with its voxel representation in such a way that is centre of mass coincides with the centre of the voxel grid.
3. A voxel is assigned a value of 1 if it contains any point on the surface of the 3D model, and 0 otherwise.
4. The voxel grid is decomposed into 32 spheres of radii 1 to 32, which produces 32 spherical functions.
5. Each sphere is decomposed as a sum of its first 16 spherical harmonics.
6. Finally, these different signatures are combined to obtain a 32 × 16 signature for the 3D model. The result is a 2D image that represents the decomposition coefficients for each harmonic function and each radii (Figure 6).

Figure 6. Graphs of the decomposition coefficients obtained by calculating harmonic functions for three objects of the collection of the Templo Mayor collection.

To compare two objects using their harmonic representations, it is simply necessary to compute the Euclidean distance between them: “Thus, finding the K closest models to a query is equivalent to solving the nearest-neighbour problem” (Kazhdan & Funkhouser, 2002). Figure 6 shows the spherical harmonics descriptor for three objects.

User interface

Some early systems were tested with collections of 3D models produced with computer-aided design (parametric models) software and acquired on the internet. In contrast, we have developed a first module (MeshAnalyser) of a search engine called ArcheoShape that uses real archaeological objects. At this stage, the objective is to assess how good are the shape descriptors described in the previous sections to match and retrieve real archaeological objects before continuing the development of the entire system and deploying it within a museum environment. The module developed over the course of this project is freely available in the following GitHub repository (Mendoza-Montoya, 2023b), which contains the source code, written in C++, ready to be compiled in Windows and Linux, as well as an executable file. Instructions to compile are included in the GitHub repository. A sample of ten 3D models of archaeological artefacts are also provided under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International. These resources will allow any user to test the MeshAnalyser module in a local computer, as well as replicate or customized the module for his own purposes.

To test the implementation of the shape descriptors, we used a sample of nearly 500 archaeological artefacts from the Museo del Templo Mayor. The collection is available for research purposes through specific agreements with Instituto Nacional de Antropología e Historia.³

The Museo del Templo Mayor preserves objects discovered between 1978 and 1982 within the area occupied by the Sacred Precinct of Tenochtitlan, the most important religious centre of the Aztecs and nowadays a famous archaeological site adjacent to Zócalo square in Mexico City (Matos Moctezuma, 1988). The core collection includes more than 8000 objects from ritual offerings found in the main pyramid temple (i.e. Templo Mayor) of the site and its surroundings, and include ritual artefacts, flora, fauna, and human remains (López Luján, 1994; Nagao, 1985). The collection has increased considerably in recent years thanks to the excavations conducted in the same site by different research teams led by archaeologists Barrera Rivera & Islas Domínguez (2018), Barrera Rodríguez, and López Luján (2012, ), 2019. Indeed, between 2012 and 2019, 43 new offerings, containing 13,925 artefacts and 35,648 samples of organic material have been reported. Digitization of this collection is still at a very early stage, but we have been able to acquire a sample of 495 objects, including stone-masks, anthropomorphic and zoomorphic figures, clay vessels (bowls, pots, jars, braziers), religious paraphernalia like sceptres, earplugs, ritual pendants, as well as flint sacrificial knifes, flutes, and models of drums made of clay or stone.

The implementation of the prototype was divided into two independent offline and online jobs. The main offline task consists in computing the shape distributions, reflective symmetry and spherical harmonics descriptors of all the archaeological 3D models available (“Database of descriptors” in Figure 7). The actual matching and retrieval operations are done online.

Figure 7. Architecture of MeshAnalyser.

The module performs four basic operations as follows:

• Mesh processing and modeling. Each mesh entered by the user is down-sampled to subsequently obtain its shape and reflective symmetry descriptors. In the case of the reflective symmetry and spherical harmonics descriptors, the module also performs a previous step consisting in computing a voxelized representation of the mesh.
• Descriptor database management. The system has a collection of descriptors that up to now correspond to around 500 3D models of archaeological artefacts. This process is done offline. The resulting repository is the source against which any query object is compared during a search and retrieval operation and therefore underpins the artefact-matching framework.
• Descriptor comparison. At the core of the system lies the comparison of the query-model input by the final user against the corpus of descriptors stored the internal database. This process yields a list of objects ordered according to the similarity they have with the newly analyzed 3D mesh.
• Results visualization. The interface presents the outcome of the comparative analysis, showcasing images of the archaeological artifacts arrayed in the order determined by the descriptor-matching procedure. This sequential display facilitates an intuitive assessment of similarity.

The system’s architecture is crafted to reflect the sequential execution of these operations (Figure 7). As mentioned above, we have made an executable file of the module available through the link provided below in the section Software Availability, as well as the source code. Every section of the source code includes detailed comments that will be updated to reflect changes in its functionality. This would facilitate the replication of the software by others as the software evolves.

Next, we explain how these operations interact when a new 3D mesh is introduced into the system.

• A typical operation starts when the user opens a file corresponding to a query 3D model. This is a point cloud or surface mesh that corresponds to the class of object that the user wants to use as example to retrieve all similar objects from the database (Figure 8). This type of content-based query, makes it unnecessary to input text for the search and matching operations.
• Once the model has been loaded, it is possible to open a window to display model properties such as number of vertices and faces (Figure 9). The user can choose to render the query model as a triangular mesh or as a rasterized (i.e. voxelized) model (Figure 10). Additional options include displaying the query model as a solid surface, a triangular mesh, or as a point cloud. From the same window, the colour and level of shininess can also be adjusted.
• Once the model is loaded in the MeshAnalizer module, the user can choose any of the three algorithms available (i.e. Shape Histograms, Reflective Symmetry or Spherical Harmonics) to obtain the descriptor for that particular query model. There are no rules to select an algorithm, it is expected that the user tries different options to see which one works best for retrieving a specific set of archaeological objects.
• For each descriptor, the user is presented with some parameters that can be adjusted to obtain a more or less precision in the computation of the shape descriptor. In the case of Shape Distributions (Figure 11), the user can define how many samples and bins can be used to build the five shape histograms (i.e. distances to centroid, distances between points, area of triangles, volume of tetrahedra or angles between vectors) for that particular query model. The default is 1,048,576 samples and 1,024 bins. The more samples the more accurate the shape representation would be. These histograms are used later to compare the query model with the collection of descriptors of the 3D models stored in the database during the matching and retrieval operation.
• In the case of Reflective Symmetry (Figure 12) and Spherical Harmonics decomposition (Figure 13), the user can define the number of divisions - in the X, Y, and Z axes - that will be considered during the voxelization of the model. As mentioned above, both descriptors are computed from this voxelized representation. For example, if this parameter is set to the default value of 32, then the algorithm will divide the mesh into 32 x 32 x 32 parts. The more divisions the more accurate the computation of the descriptor.
• An additional parameter, number of rotations, is available for the computation of reflective symmetry. This refers to how many times the model is rotated to test its symmetry. For every rotation, a plane cuts the model into two halves that are compared by the algorithm to assess how similar (i.e. symmetrical) they are. The descriptor for the object is obtained by concatenating the symmetry measures of all rotations. A large number of rotations would improve the accuracy of the shape descriptor, but it involves higher computational costs. The default value of 8 is considered appropriate for all models.
• The next step is to compare the descriptor of the query model to all the objects’ descriptors stored in the database (i.e. matching operation). A parameter allows selecting one of the four distance measures implemented, namely Euclidean, City block, Chebychev or Minimun Coordinate.
• Finally, the user presses “Compare descriptor with collection” and the search module proceeds to compare the query model with the descriptors of the models stored in the database, retrieving the results, which are shown in a new window. There is no limit on the number of results displayed by the module. The interface allows saving the results for future reference. Figures 14, 15 and 16 illustrate the user interface during three query examples.

Figure 8. Screenshot of the window to open a file in MeshAnalizer.

Figure 9. Screenshot of the window that displays the properties of a 3D mesh opened with MeshAnalizer.

Figure 10. Rendering a triangular mesh as a voxelised model.

Figure 11. Screenshot of the window to apply the Shape Distribution algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 12. Screenshot of the window to apply the Relective Simmetry algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 13. Screenshot of the window to apply the Spherical Harmonics algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 14. Illustration of the software developed for the search and recovery of 3D models of archaeological objects.

Above, on the left, the consultation model (i.e. anthropomorphic sculpture) is shown; below left illustrates obtaining the reflexive symmetry descriptor for that query model; on the right are the search results obtained when the user requests to compare the query model with the models stored in the repository. It can be seen that the system retrieves all objects similar to the query model.

Figure 15. Another example of searching and retrieving 3D models.

In this case, all copies of anthropomorphic figures were requested from the system applying the descriptor of harmonic functions.

Figure 16. A third example of how the software works.

In this case stone masks were recovered. It should be noted that, despite the fact that the query model lacks a fragment, the system was able to produce the expected results, even recovering a mask fragment that clearly belongs to the class of the objects that the user expected.

Conclusions

Computer tools for shape matching and retrieval designed specifically for archaeological research could improve access to collections in museum institutions. The development of the module presented here is a step forward in this direction.

The search-engine module developed over this project is generic, so we expect they would prove helpful in other contexts. The capacity of our system to perform matching and retrieval of real archaeological objects through the application of shape distributions, reflective symmetry, and spherical harmonics descriptors is significant. However, an extra module to provide full database capabilities to store, update and edit the 3D models are still under construction. Also, we expect to perform a benchmark analysis, whose results will be published shortly.

Particularly important for further development is the implementation of additional shape descriptors that target local features, since these would help to refine the queries to specific details on the objects geometry.

In such endeavor we intent to take advantage of the experience from colleagues in Computer Vision. Attene et al. (2009), for example, have develop a pipeline - and a software called ShapeAnnotator, which segments 3D meshes into parts that are then combined to form meaningful features. The system annotates the resulting features according to an ontology. Concepts in the ontology are entities with meaning that final users can identify and select in an intuitive interface. Furthermore, by analysing the topology and geometry of the segmented parts, the system can relate the features of one type of object to similar instances stored in a knowledge base. These pipeline and software have been applied to recognized parts of virtual avatars and manufacturing parts, but the framework can be used in other domains thanks to its independence of the geometry of the models and the domain ontology. Thus, we would consider this work for future development of ArcheoShape.

We plan to embed the search-engine module described here into a web platform which will be organized around three main application channels:

1. The first channel would be a service platform for the automatic recognition, analysis, and classification of cultural heritage objects based on morphology. Any user can upload a collection of 3D models to have it analysed with the software tools developed throughout the project. For this operation, the user will not need any knowledge of Computer Vision or Machine Learning because all necessary software will be accessible through a very easy-to-use interface.
2. The second channel called research will be designed to encourage specialized collaboration between experts in Computer Vision, Machine Learning, and shape analysis interested in developing new algorithms, applications, and tools for morphological analysis of cultural heritage. Including our current deep learning applications for shape analysis and retrieval. This collaboration will facilitate access to papers, project proposals, discussion forums, and source code. New solutions to technical problems will be expected to evolve from this site. For example, one pervasive challenge when applying machine learning to archaeology is the lack of enough data to train automatic learning models. This channel could provide a forum for discussing new solutions, such as conditions for applying transfer-learning techniques to train models with external knowledge.
3. The third channel will be named People Interaction. Through this channel, scholars, students, and anyone interested in the project can establish collaboration for future projects and share data and resources from all over the world. The main objective is to create synergy to facilitate access to new 3D digital collections and to define new initiatives of morphological analysis with applications to archaeology and the Humanities.

Data availability

Underlying data

Zenodo: omendoza83/ArcheoShape-Data: ArcheoShape 0.2. https://doi.org/10.5281/zenodo.7591490 (Mendoza-Montoya, 2023a).

This project contains the following underlying data:

• Models. (10 triangular meshes of Aztec objects).
• Resources. (6983 numerical shape descriptors, computed from 495 archaeological objects).
• Icons. (Images for the user interface).
• Screenshots. (Images of 495 archaeological objects, used to present results at the end of a search and matching operation).

Data are available under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC-BY-NC-ND 4.0).

Software availability

Source code available from: https://github.com/omendoza83/ArcheoShape/tree/v0.3.0-alpha

Archived source code at the time of publication: https://doi.org/10.5281/zenodo.7583722 (Mendoza-Montoya, 2023b)

License: MIT

References

Addis M, Boniface M, Goodall S, et al.: SCULPTEUR: Towards a New Paradigm for Multimedia Museum Information Handling.Fensel D, Sycara K, Mylopoulos J, editors. The Semantic Web ISWC 2003; Lecture Notes in Computer Science. Vol. 2870. . 2003; pp. 582–596. Publisher Full Text
Addis M, Martinez K, Lewis P, et al.: New ways to search, navigate and use multimedia museum collections over the Web.Trant J, Bearman D, editors. Museums and the Web 2005 Proceedings. Toronto: Archives & Museum Informatics; 2005. http://www.archimuse.com/mw2005/papers/addis/addis.html
Aleksandrov M, Zlatanova S, Heslop DJ: Voxelisation algorithms and data structures: A review. Sensors. 2021; 21: 8241. PubMed Abstract | Publisher Full Text | Free Full Text
Almedia B, Costa R: OntoAndalus: An ontology of Islamic artefacts for terminological purposes. Semantic Web Journal, Special Issue on Semantic Web for Cultural Heritage. 2021; 12(2):295–311. Publisher Full Text
Attene M, Katz S, Mortara M, et al.: Mesh segmentation -A comparative study. IEEE International Conference on Shape Modeling and Applications 2006 (SMI’06). Matsushima, Japam, 2006, pp. 7–7. Publisher Full Text
Attene M, Robbiano F, Spagnuolo M, et al.: Characterization of 3D shape parts for semantic annotation. Computer-Aided Design. 2009; 41:756–763. Publisher Full Text
Barrera Rivera A, Islas Domínguez A: Arqueología urbana en la reconstrucción arquitectónica del recinto sagrado de Tenochtitlan. México: Secretaría de Cultura, Instituto Nacional de Antropología e Historia. Colección Arqueología, Serie Logo; 2018.
Benjamins VR, Contreras J, Blázquez M, et al.: Cultural Heritage and the Semantic Web.Bussler CJ, Davies J, Fensel D, Studer R, editors. The Semantic Web: Research and Applications. ESWS 2004. Lecture Notes in Computer Science. Vol. 3053. . 2004; pp. 433–444. Publisher Full Text
Benjamins VR, Fensel D: Editorial: Problem-solving methods. International Journal of Human-Computer Studies. Special issue on Problem- Solving Methods.October 1998; 49(4): 305–313. Publisher Full Text
Besl PJ, Jain R: Three-dimensional object recognition. Computing Surveys. 1985; 17(1): 75–145. Publisher Full Text
Bianchini C, Ippolito A, Bartolomei C.: The surveying and representation process applied to architecture: Non-contact methods for the documentation of cultural heritage.Brusaporci S, editor. Handbook of Research on Emerging Digital Tools for Architectural Surveying, Modeling and Representation.2015; pp. 44–93. Publisher Full Text
Bikakis A, Hyvönen E, Jean S, et al.: Editorial: Special issue on Semantic Web for Cultural Heritage. Semantic Web. 2021; 12:163–167. Publisher Full Text
Brandolini F, Pratucco G: Structure-from-Motion (SFM) photogrammetry as a non-invasive methodology to digitalize historical documents: A highly flexible and low-cost approach? Heritage. 2019; 2:2124–2136. Publisher Full Text
Bronstein MM, Bruna J, LeCun Y, et al.: Geometric deep learning: Going beyond Euclidean data. IEEE Signal Processing Magazine. 2017; 34(4):18–42. Publisher Full Text
Bustos B, Keim D, Saupe D, et al.: Automatic selection and combination of descriptors for effective 3D similarity search. IEEE Sixth International Symposium on Multimedia Software Engineering. 2004; 2004: 514–521. Publisher Full Text
Bustos B, Keim DA, Saupe D, et al.: Feature-based similarity search in 3D object databases. ACM Computing Surveys. 2005; 37(4): 345–387. Publisher Full Text
Campbell RJ, Flynn PJ: A survey of free-form object representation and recognition techniques. Computer Vision and Image Understanding. 2001; 81(2): 166–210. Publisher Full Text
Clark JT, Slator BM, Bergstrom A, et al.: DANA (Digital Archive Network for Anthropology) A model for digital archiving. Proceedings of the 2002 ACM Symposium on Applied Computing, SAC ’02. March 2002; pp. 483–487. Publisher Full Text
Croce V, Caroti G, De Luca L, et al.: From the semantic point cloud to heritage-building information modeling: A semiautomatic approach exploiting machine learning. Remote Sensing. 2021; 13(3):461. Publisher Full Text
Croce V, Caroti G, De Luca L, et al.: Semantic annotations on heritage models: 2D/3D approaches and future research challenges. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2020; XLIII-B2-2020: 829–836. Publisher Full Text
Crofts N, Doerr M, Gill T, et al.: Definition of the CIDOC Conceptual Reference Model Version 5.0.4. Technical report, ICOM; 2011. https://www.cidoc-crm.org/Version/version-5.0.4
Ekengren F, Vallieri M, Dininno D, et al.: Dynamic collections: A 3D web infrastructure for artifact engagement. Open Archaeology. 2021; 7: 337–352. Publisher Full Text
Feng Y, Zhang Z, Zhao X, et al.: GVCNN: Group-view convolutional neural networks for 3D shape recognition. Proceedings of the IEEE/CVF Conference on Computer vision and Pattern Recognition.2018; pp. 264–272. Publisher Full Text
Funkhouser T, Min P, Kazhdan M, et al.: A search engine for 3D models. ACM Transactions on Graphics. 2003; 22(1): 83–105. Publisher Full Text
Furuya T, Ohbuchi R :Transcoding across 3D shape representations for unsupervised learning of 3D shape feature. Pattern Recognition Letters.2020; 138: 146–154. Publisher Full Text
Gal R, Cohen-Or D: Salient geometric features for partial shape matching and similarity. ACM Transactions on Graphics. 2006; 25(1): 130–150. Publisher Full Text
García-García A, Orts-Escolano S, Oprea SO, et al.: A review on deep learning techniques applied to semantic segmentation. Computer Vision and Pattern Recognition. 2017; arXiv. https://arxiv.org/abs/1704.06857
Goodall S, Lewis PH, Martínez K, et al.: SCULPTEUR: Multimedia Retrieval for Museums.Enser P, Kompatsiaris Y, O’Connor NE, et al., editors. Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science; Vol. 3115. . 2004; pp. 638–646. Publisher Full Text
Grilli E, Özdemir E, Remondino F: Application of machine and deep learning strategies for the classification of heritage point clouds. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2019; XLII-4/W18: 447–454. Publisher Full Text
Grilli E, Menna F, Remondino F :A review of point clouds segmentation and classification algorithms. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2017; XLII-2/W3: 339–344. Publisher Full Text
Grilli E, Remondino F: Classification of 3D digital heritage. Remote Sensing. 2019; 11: 847. Publisher Full Text
Hayes-Roth F, Waterman DA, Lenat DB: Building Expert Systems. Boston: Addison Wesley Longman Publishing Co.; 1983. ISBN:978-0-201-10686-2
He W, Jiang Z, Zhang C, et al.: CurvaNet: Geometric deep learning based on directional curvature for 3D shape analysis. KDD ’20: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.2020; pp. 2214–2224. Publisher Full Text
Herrewegen J, Tourwé T, Wyffels F: Self-supervised learning for robust object retrieval without human annotations. Computers & Graphics.2023; 115: 13–24. Publisher Full Text
Huang J, Yagel R, Filippov V, et al.: An accurate method for voxelizing polygon meshes. VVS ’98 Proceedings of the 1998 IEEE symposium on Volume visualization. North Carolina: ACM; 1998; pp. 119–126.
Isaksen L: Archaeology and the Semantic Web. PhD. Thesis, University of Southampton. Faculty of Physical and Applied Sciences. School of Electronics and Computer Science. 2011.
Jain S, Mishra S: Survey paper on various 3D view based retrieval methods. International Journal of Engineering Research & Technology. 2014; 3(2): 470–473.
Jiménez-Badillo D, Ruiz-Correa S, García Alfaro W: Developing a recognition system for the retrieval of archaeological 3D models. Contreras F, Farjas M, Melero FJ, editors. CAA-2010. Fusion of Cultures. Proceedings of the 38^th Annual Conference on Computer Applications and Quantitative Methods in Archaeology, Granada, Spain. Oxford: Archaeopress, BAR International Series 2494. 2013; pp. 325–332. Publisher Full Text
Johnson AE, Hebert M: Using spin-images for efficient multiple model recognition in cluttered 3-D scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1999; 21(5): 433–449. Publisher Full Text
Kazhdan M, Chazelle B, Dobkin D, et al.: A reflective symmetry descriptor. Proceedings of the 7th European Conference on Computer Vision, ECCV02, Copenhagen, May, 2002. Lecture Notes on Computer Sciences, 2352 (Part II). 2002; pp. 642–656.
Kazhdan M, Chazelle B, Dobkin D, et al.: A Reflective Symmetry Descriptor for 3D Models. Algorithmica. 2004a; 38: 201–225. Publisher Full Text
Kazhdan M, Funkhouser T: Harmonic 3D shape matching. SIGGRAPH ’02: ACM SIGGRAPH 2002 Conference Abstracts and Applications. 2002; p. 191. Publisher Full Text
Kazhdan M, Funkhouser T, Rusinkiewicz S: Rotation invariant spherical harmonic representation of 3D shape descriptors.Kobbelt L, Schröder P, Hoppe H, editors. Proceedings of the 2003 Eurographics/ACM SIGGRAPH Symposium on Geometry Processing (SGP ’03). 2003; pp. 156–164. Publisher Full Text
Kazhdan M, Funkhouser T, Rusinkiewicz S: Symmetry descriptors and 3D shape matching. Proceedings of the 2004 Eurographics/ACM SIGGRAPH Symposium on Geometry Processing (SGP ’04). 2004b; pp. 115–123. Publisher Full Text
Kim S, Chae D :ExMeshCNN: An explainable convolutional neural network architecture for 3D shape analysis. KDD ’22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022; pp. 795–803. Publisher Full Text
Koller D, Frischer B, Humphreys G: Research challenges for digital archives of 3D cultural heritage models. Journal on Computing and Cultural Heritage. 2009; 2(3): pp. 1–17. article 7. Publisher Full Text
Krestenitis M, Passalis N, Iosifidis A, et al.: Recurrent bag-of-features for visual information analysis. Pattern Recognition.2020; 106: 107380. Publisher Full Text
López Luján L: Proyecto Templo Mayor: Séptima temporada (2007-2012).García NMR, editor. Memoria 2007-2012 de la Coordinación Nacional de Arqueología. México: INAH; 2012; pp. 1939–1942. Reference Source
López Luján L: Proyecto Templo Mayor: Séptima y octava temporadas. Nava PFS, editor. La arqueología oficial mexicana a principios del siglo XXI: Estudios de caso. México: INAH; 2019; pp. 341–345. Reference Source
López Luján L: The offerings of the Templo Mayor of Tenochtitlan. Colorado: University Press of Colorado; 1994.
Lara López G, Peña Pérez Negrón A, de Antonio Jiménez A, et al.: Comparative analysis of shape descriptors for 3D objects. Multimedia Tools and Applications. 2017; 76(5): 6993–7040. Publisher Full Text
Loncaric S: A survey of shape analysis techniques. Pattern Recognition. 1998; 31(8): 983–1001. Publisher Full Text
Matos Moctezuma E: The Great Temple of the Aztecs: Treasures of Tenochtitlan. London: Thames and Hudson; 1988.
Mendoza-Montoya O: omendoza83/ArcheoShape-Data: ArcheoShape 0.2 (v0.2.1-alpha). [Data]. Zenodo. 2023a. Publisher Full Text
Mendoza-Montoya O: omendoza83/ArcheoShape: ArcheoShape 0.3 (v0.3.0-alpha). Zenodo. [Code]. 2023b. Publisher Full Text
Mori G, Belongie S, Malik H: Shape contexts enable efficient retrieval of similar shapes. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001. 2001; pp. 723–730.
Nagao D: Mexica buried offerings: A historical and contextual analysis. Oxford: Archaeopress, BAR International Series; 1985; 235.
Osada R, Funkhouser T, Chazelle B, et al.: Shape distributions. ACM Transactions on Graphics. 2002; 21: 807–832. Publisher Full Text
Paquet E, Rioux: Nefertiti: A tool for 3-D shape databases management. SAE Transactions: Journal of Aerospace. 2000; 108(1): 387–393.
Pieraccini M, Guidi G, Atzeni C: 3D digitizing of cultural heritage. Journal of Cultural Heritage. 2001; 2: 63–70. Publisher Full Text
Pintus R, Pal K, Yang Y, et al.: A survey of geometric analysis in cultural heritage. Computer Graphics Forum. 2016; 35(1): 4–31. Publisher Full Text
Roman-Rangel EF, Jimenez-Badillo D, Marchand-Maillet S: Classification and retrieval of archaeological potsherds using histograms of spherical orientations. ACM Journal on Computing and Cultural Heritage. 2016; 9(3): 1–23. Publisher Full Text
Rostami R, Bashiri FS, Rostami B, et al.: A survey on data-driven 3D shape descriptors. Computer Graphics Forum. 2019; 38(1): 356–393. Publisher Full Text
Roussel R, De Luca L :An approach to build a complete digital report of the Notre Dame cathedral after the fire, using the Aioli platform. 29th CIPA Symposium Documenting, Understanding, Preserving Cultural Heritage: Humanities and Digital Technologies for Shaping the Future. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2023; XLVIII-M-2-2023: 1359–1365. Publisher Full Text
Rowe J, Razdan A: Digital library system. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries – JCDL’02. 2002. Publisher Full Text
Rowe J, Razdan A, Collins D, et al.: A 3D digital library system: Capture, analysis, query, and display. Proceedings of the Fourth International Conference on Digital Libraries (ICADL). 2001.
Saupe D, Vranic DV: 3D model retrieval with spherical harmonics and moments.Radig B, Florczyk S, editors. Proceedings of the DAGM 2001. 2001; pp. 392–397.
Schurmans U, Razdan A, Simon A, et al.: Advances in geometric modeling and feature extraction on pots, rocks and bones for representation and query via the Internet.Burenhult G, Arvidsson J, editors. Archaeological Informatics: Pushing the Envelope. CAA2001. Computer Applications and Quantitative Methods in Archaeology. Proceedings of the 29th Conference, Gotland, April 2001, Oxford: Archaeopress, BAR International Series 1016. 2002; pp. 191–202.
Shamir A: Segmentation and shape extraction of 3D boundary meshes.Wyvill B, Wilkie A, editors. Eurographics 2006-State of the Art Reports, pp. 137–149. Publisher Full Text
Sinclair P, Goodall S, Lewis PH, et al.: Concept browsing for multimedia retrieval in the SCULPTEUR project. [accessed March 13 2024].https://eprints.soton.ac.uk/260913/1/eswc.pdf
Suzuki MT: A Web-based retrieval system for 3D polygonal models. Proceedings of the Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569). 2001; pp. 2271–2276. Publisher Full Text
Tangelder JWH, Veltkamp RC: A survey of content based 3D shape retrieval methods. Multimedia Tools and Applications. 2008; 39:441–471. Publisher Full Text
Teruggi S, Grilli E, Russo, et al.: A hierarchical machine learning approach for multi-level and multi-resolution 3D point cloud classification. Remote Sensing.2020; 12: 2598. Publisher Full Text
Theologou P, Pratikakis I, Theoharis T: A review on 3D object retrieval methodologies using a part-based representation. Computer-Aided Design and Applications. 2014; 11(6):670–684. Publisher Full Text
Uschold M, Grüninger M: Ontologies: Principles, methods, and applications. Knowledge Engineering Review. 1996; 11(2):93–136. Publisher Full Text
Veltkamp RC, Hagedoorn M: State-of-the-art in shape matching. Technical Report. UU-CS-1999-27, Utrecht University, the Netherlands.1999. [12 December 2022]. Reference Source
Vranic DV, Saupe D: 3D shape descriptor based on 3D Fourier Transform.Fazekas K, editor. Proceedings of the EURASIP Conference on Digital Signal Processing for Multimedia Communications and Services (ECMCS 2001). 2001; pp. 271–274. [12 December 2022]. Reference Source
Vranic DV, Saupe D, Richter J: Tools for 3D-object retrieval: Karhunen-Loeve transform and spherical harmonics.Dugelay J-L, Rose K, editors. Proceedings of the IEEE 2001 Workshop Multimedia Signal Processing. 2001; pp. 293–298.
Wang Y, Tan DJ, Navab N, et al.: SoftPoolNet: Shape descriptor for point cloud completion and classification.Vedaldi A, Bischof H, Brox T, et al., editors. Proceedings of 16th European Conference on Computer Vision – ECCV, Part III. Lecture Notes in Computer Science. Vol. 12348. . 2020; pp. 70–85. Publisher Full Text
Xie J, Dai G, Zhu F, et al.: DeepShape: Deep-learned shape descriptor for 3D shape retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017; 39(7):1335–1345. PubMed Abstract | Publisher Full Text
Yang L, Wang L, Su Y, et al.: Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition, Signal Processing: Image Communication.2021; 96: 116297. Publisher Full Text

Footnotes

1 Also known as the L∞ metric, Chevychev distance determines which distance between two vectors is the greatest of their differences along any coordinate dimension.

2 The voxel grid preserves the topological characteristics of the original 3D model. These include connectivity, separation, coverage, and tunnelling. Connectivity registers the way voxels must be linked to each in order to preserve the specific shape of the object. Separation registers empty spaces and how these interact in the voxelised object. Coverage, register how “thin” or thick” the voxelised object is. Tunnelling registers the penetration of two voxelised elements of the object, that is how two or more parts of the object overlap and/or penetrate into each other (Aleksandrov et al., 2021).

3 The authors can provide details of the legal and administrative procedures involved in sharing the collection.

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 16 Feb 2023

Author details Author details

Omar Mendoza-Montoya
Roles: Investigation, Methodology, Software, Writing – Review & Editing

Salvador Ruiz-Correa
Roles: Conceptualization, Data Curation, Investigation, Methodology, Project Administration, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The work presented here is part of the project Developing computer applications in archaeology, funded by Instituto Nacional de Antropología e Historia (SIP 31338).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 25 Mar 2024, 12:182

https://doi.org/10.12688/f1000research.127095.2

version 1

Published: 16 Feb 2023, 12:182

https://doi.org/10.12688/f1000research.127095.1

© 2024 Jiménez-Badillo D et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Jiménez-Badillo D, Mendoza-Montoya O and Ruiz-Correa S. Application of computer vision techniques for 3D matching and retrieval of archaeological objects [version 2; peer review: 2 approved]. F1000Research 2024, 12:182 (https://doi.org/10.12688/f1000research.127095.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 25 Mar 2024

Revised

Views

Reviewer Report 28 Aug 2024

Federica Maietti, Department of Architecture, University of Ferrara, Ferrara, Italy

Approved

https://doi.org/10.5256/f1000research.163763.r258907

No ... Continue reading

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 04 Apr 2024

Edgar Roman-Rangel, Computer Science Department, Instituto Tecnológico Autónomo de México, Mexico City, Mexico

Approved

https://doi.org/10.5256/f1000research.163763.r258908

Author have improved their paper and attending to my previous revision. The new version defines its contribution much ... Continue reading

Author have improved their paper and attending to my previous revision. The new version defines its contribution much clearer, and it has sections that are much better defined.

I recommend that this version is acceptable.

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Machine learning; Computer vision; Representation learning; Computational archaeology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 16 Feb 2023

Views

Reviewer Report 22 Sep 2023

Federica Maietti, Department of Architecture, University of Ferrara, Ferrara, Italy

Approved with Reservations

https://doi.org/10.5256/f1000research.139568.r200597

The article is focused on 3D digitization of heritage artefacts and archaeological collections, and on the increasing need for developing means to access the resulting 3D models. In particular, the paper explores shape recognition techniques, deepening three shape descriptors (distributions, reflective symmetry and spherical harmonics) as part of the development of a search engine that retrieves 3D models from an archaeological database.
Aztec archaeological objects belonging to the Templo Mayor Museum, in Mexico City, are the use case proposed to support the tool description, explanation and applicability.

The topic is very relevant nowadays, since main challenges and research directions are oriented toward an improvement in robustness and efficiency of the 3D digitization process, also for collections of objects, increasing the accuracy and completeness of surface appearance, and increasing data sharing, dissemination and usability. Retrieve 3D models from large collections is a curatorial and research need to be addressed.
Possible uses in other contexts and for other collections are given due consideration, and the search-engine module is expected to be tested other contexts.

Future developments are correctly specified: an extra module to provide full database capabilities to store, update and edit the 3D models is foreseen; a benchmark analysis will be delivered; the implementation of additional shape descriptors helping to refine queries is mentioned as well.

Some minor comments:

The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web.

In general, references on Machine Learning and new avenues in shape recognition can be improved. Just some suggestions:

https://doi.org/10.5194/isprs-archives-XLVIII-M-2-2023-1359-2023

https://doi.org/10.3390/rs11070847
The workflow for the implementation of the ArcheoShape search-engine module is clearly described, focusing on comparing computational shape descriptions. Anyway, tool “architecture” should be described in more detail, in order to provide means for replication of the software development by other users.
Expanded query criteria should be listed and explained.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

References

1. Roussel R, De Luca L: AN APPROACH TO BUILD A COMPLETE DIGITAL REPORT OF THE NOTRE DAME CATHEDRAL AFTER THE FIRE, USING THE AIOLI PLATFORM. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2023; XLVIII-M-2-2023: 1359-1365 Publisher Full Text
2. Grilli E, Remondino F: Classification of 3D Digital Heritage. Remote Sensing. 2019; 11 (7). Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Digital Heritage; 3D Survey; Digital Representation; 3D Modeling; Diagnostic procedures; Heritage assessment

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

04 Apr 2024

Author Response
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:
- Reviewer comment: The State of the Art concerning heritage
... Continue reading
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:

Reviewer comment: The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web. In general, references on Machine Learning and new avenues in shape recognition can be improved. Response: We extended the Introduction with descriptions and references on photomodelling / Structure from Motion, as well as advances in Semantic Web and deep-learning for accessing cultural heritage information, particularly the use of ontologies, as well as segmentation and annotation techniques. We also added comments on the papers that you recommended.

The second reviewer also considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis. Response: We offer the same response given to Reviewer 1: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. As for a technical description of the source code, we consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:

Reviewer comment: The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web. In general, references on Machine Learning and new avenues in shape recognition can be improved. Response: We extended the Introduction with descriptions and references on photomodelling / Structure from Motion, as well as advances in Semantic Web and deep-learning for accessing cultural heritage information, particularly the use of ontologies, as well as segmentation and annotation techniques. We also added comments on the papers that you recommended.

The second reviewer also considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis. Response: We offer the same response given to Reviewer 1: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. As for a technical description of the source code, we consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.
Competing Interests: No competing interests. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

04 Apr 2024

Author Response
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:
- Reviewer comment: The State of the Art concerning heritage
... Continue reading
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:

Reviewer comment: The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web. In general, references on Machine Learning and new avenues in shape recognition can be improved. Response: We extended the Introduction with descriptions and references on photomodelling / Structure from Motion, as well as advances in Semantic Web and deep-learning for accessing cultural heritage information, particularly the use of ontologies, as well as segmentation and annotation techniques. We also added comments on the papers that you recommended.

The second reviewer also considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis. Response: We offer the same response given to Reviewer 1: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. As for a technical description of the source code, we consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.
Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:

Reviewer comment: The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web. In general, references on Machine Learning and new avenues in shape recognition can be improved. Response: We extended the Introduction with descriptions and references on photomodelling / Structure from Motion, as well as advances in Semantic Web and deep-learning for accessing cultural heritage information, particularly the use of ontologies, as well as segmentation and annotation techniques. We also added comments on the papers that you recommended.

The second reviewer also considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis. Response: We offer the same response given to Reviewer 1: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. As for a technical description of the source code, we consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.
Competing Interests: No competing interests. Close
Report a concern

Views

Reviewer Report 24 Feb 2023

Edgar Roman-Rangel, Computer Science Department, Instituto Tecnológico Autónomo de México, Mexico City, Mexico

Approved with Reservations

https://doi.org/10.5256/f1000research.139568.r163745

This work presents a software tool that implements a retrieval engine for 3D models of archaeological nature. The retrieval machine combines several shape descriptors and distance functions to obtain different ways of indexing the 3D models.

The need for this tool is well motivated, and the results shown in the document seem both correct and relevant. Moreover, without a doubt the tool is an interesting application and will constitute an asset for improving the work of archaeologists.

However, there is one details that need to be addressed. Concretely, the document focusses more on the mathematical details of the 3D shape descriptors, rather than focussing on the implementation details of the tool itself. Improving this aspect of the presentation will help fit the paper into the scope of the journal.

Additionally, there a few minor observations:

The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR).
"... without being affected by the variations of ...". The term "affine variations" will be more specific.
The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances.
The functions D(f (x), sg (sx)), in page 6 are not defined.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

No
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Machine learning; Computer vision; Representation learning; Computational archaeology.

CITE

Report a concern

Author Response 04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

04 Apr 2024

Author Response

Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
... Continue reading Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
Response: We added the term Content-based Information Retrieval (CBIR) in the corresponding paragraph of the Introduction.

Reviewer comment: The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
Response: We added more details of the MeshAnalizer module, including a new figure illustrating its architecture.

Reviewer comment: "The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances."
Response: We moved the Table 1 reference immediately after the mention of the four distance measures implemented. This would avoid confusion on the kind of information that reader may expect in Table 1.

Reviewer comment: "The functions D(f (x), sg (sx)), in page 6 are not defined."
Response: We defined the functions.

Also, the reviewer considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis.
Response: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. We believe that this improves the section “Development of a Search Engine Module”.
Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
Response: We added the term Content-based Information Retrieval (CBIR) in the corresponding paragraph of the Introduction.

Reviewer comment: The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
Response: We added more details of the MeshAnalizer module, including a new figure illustrating its architecture.

Reviewer comment: "The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances."
Response: We moved the Table 1 reference immediately after the mention of the four distance measures implemented. This would avoid confusion on the kind of information that reader may expect in Table 1.

Reviewer comment: "The functions D(f (x), sg (sx)), in page 6 are not defined."
Response: We defined the functions.

Also, the reviewer considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis.
Response: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. We believe that this improves the section “Development of a Search Engine Module”.
Competing Interests: No competing interests Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

04 Apr 2024

Author Response

Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
... Continue reading Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
Response: We added the term Content-based Information Retrieval (CBIR) in the corresponding paragraph of the Introduction.

Reviewer comment: The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
Response: We added more details of the MeshAnalizer module, including a new figure illustrating its architecture.

Reviewer comment: "The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances."
Response: We moved the Table 1 reference immediately after the mention of the four distance measures implemented. This would avoid confusion on the kind of information that reader may expect in Table 1.

Reviewer comment: "The functions D(f (x), sg (sx)), in page 6 are not defined."
Response: We defined the functions.

Also, the reviewer considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis.
Response: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. We believe that this improves the section “Development of a Search Engine Module”.
Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
Response: We added the term Content-based Information Retrieval (CBIR) in the corresponding paragraph of the Introduction.

Reviewer comment: The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
Response: We added more details of the MeshAnalizer module, including a new figure illustrating its architecture.

Reviewer comment: "The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances."
Response: We moved the Table 1 reference immediately after the mention of the four distance measures implemented. This would avoid confusion on the kind of information that reader may expect in Table 1.

Reviewer comment: "The functions D(f (x), sg (sx)), in page 6 are not defined."
Response: We defined the functions.

Also, the reviewer considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis.
Response: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. We believe that this improves the section “Development of a Search Engine Module”.
Competing Interests: No competing interests Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 16 Feb 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 25 Mar 24	read	read
Version 1 16 Feb 23	read	read

Edgar Roman-Rangel, Instituto Tecnológico Autónomo de México, Mexico City, Mexico
Federica Maietti, University of Ferrara, Ferrara, Italy

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

2 Views

28 Aug 2024 | for Version 2

Federica Maietti, Department of Architecture, University of Ferrara, Ferrara, Italy

2 Views Cite this report Responses(0)

Approved

No comments

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Digital Heritage; 3D Survey; Digital Representation; 3D Modeling; Diagnostic procedures; Heritage assessment

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

3 Views

04 Apr 2024 | for Version 2

Edgar Roman-Rangel, Computer Science Department, Instituto Tecnológico Autónomo de México, Mexico City, Mexico

3 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Machine learning; Computer vision; Representation learning; Computational archaeology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

23 Views

22 Sep 2023 | for Version 1

Federica Maietti, Department of Architecture, University of Ferrara, Ferrara, Italy

23 Views Cite this report Responses(1)

Approved With Reservations

The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web.

In general, references on Machine Learning and new avenues in shape recognition can be improved. Just some suggestions:

https://doi.org/10.5194/isprs-archives-XLVIII-M-2-2023-1359-2023

https://doi.org/10.3390/rs11070847
The workflow for the implementation of the ArcheoShape search-engine module is clearly described, focusing on comparing computational shape descriptions. Anyway, tool “architecture” should be described in more detail, in order to provide means for replication of the software development by other users.
Expanded query criteria should be listed and explained.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Digital Heritage; 3D Survey; Digital Representation; 3D Modeling; Diagnostic procedures; Heritage assessment

Respond to this report

Responses (1)

Author Response

04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

Thanks for your comments and suggestions, which have allowed us to improve this manuscript by doing the following changes:

Reviewer comment: The State of the Art concerning heritage digitization via photogrammetry (photomodelling / Structure from Motion) and laser scanning can be improved, as well as data segmentation and classification. The statement: “Currently, the conventional way to locate models in a database consists of formulating a query by using keywords that describe the objects’ features” does not consider latest innovations in semantic web. In general, references on Machine Learning and new avenues in shape recognition can be improved. Response: We extended the Introduction with descriptions and references on photomodelling / Structure from Motion, as well as advances in Semantic Web and deep-learning for accessing cultural heritage information, particularly the use of ontologies, as well as segmentation and annotation techniques. We also added comments on the papers that you recommended.
The second reviewer also considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis. Response: We offer the same response given to Reviewer 1: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. As for a technical description of the source code, we consider that as the implementation of the whole system is still a work in progress and likely to change over time, readers that want more details -or want to replicate the software- are better served by documenting the source code with commentaries on functionality directly on the source code. This is available in the GitHub repository, from which anyone can download the source code to compile it as such, adapt it or replicate it freely for his own purposes. Documenting the software in this manner, would guarantee that as the module changes the users can always get an explanation of the latest progress.

View more View less

Competing Interests

No competing interests.

Back to all reports

Reviewer Report

20 Views

24 Feb 2023 | for Version 1

Edgar Roman-Rangel, Computer Science Department, Instituto Tecnológico Autónomo de México, Mexico City, Mexico

20 Views Cite this report Responses(1)

Approved With Reservations

The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR).
"... without being affected by the variations of ...". The term "affine variations" will be more specific.
The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances.
The functions D(f (x), sg (sx)), in page 6 are not defined.

Is the rationale for developing the new software tool clearly explained?

Yes
Is the description of the software tool technically sound?

Partly
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

No
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Machine learning; Computer vision; Representation learning; Computational archaeology.

Respond to this report

Responses (1)

Author Response

04 Apr 2024

Diego Jiménez-Badillo, Museo del Templo Mayor, Instituto Nacional de Antropologia e Historia (INAH), Mexico City, 06060, Mexico

Reviewer comment: "The fourth paragraph of the introduction states "...analysing the shape of the objects without relying exclusively on keywords...". The term for that concept is Content-based Information Retrieval (CBIR)."
Response: We added the term Content-based Information Retrieval (CBIR) in the corresponding paragraph of the Introduction.

Reviewer comment: The section named "Development of a search-engine module" reads more like related work, and not like the description of the search engine.
Response: We added more details of the MeshAnalizer module, including a new figure illustrating its architecture.

Reviewer comment: "The third paragraph of page 5, says that there are thirty combinations of descriptors-distances, and makes reference to Table 1. However, Table 1 only shows the different distances."
Response: We moved the Table 1 reference immediately after the mention of the four distance measures implemented. This would avoid confusion on the kind of information that reader may expect in Table 1.

Reviewer comment: "The functions D(f (x), sg (sx)), in page 6 are not defined."
Response: We defined the functions.

Also, the reviewer considers that the paper only describes the software tool partly and that there are no enough details of the code, method and analysis.
Response: Regarding the methods and analysis, we consider that these are sufficiently covered in the explanation of the three descriptors implemented, where the principles, methods and analysis of each algorithm are explained. As for the tool itself, we included more details of MeshAnlizer architecture, as well as a lengthy explanation of how it functions. We believe that this improves the section “Development of a Search Engine Module”.

View more View less

Competing Interests

No competing interests

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] Addis M, Boniface M, Goodall S, et al.: SCULPTEUR: Towards a New Paradigm for Multimedia Museum Information Handling.Fensel D, Sycara K, Mylopoulos J, editors. The Semantic Web ISWC 2003; Lecture Notes in Computer Science. Vol. 2870. . 2003; pp. 582–596. Publisher Full Text

[2] Addis M, Martinez K, Lewis P, et al.: New ways to search, navigate and use multimedia museum collections over the Web.Trant J, Bearman D, editors. Museums and the Web 2005 Proceedings. Toronto: Archives & Museum Informatics; 2005. http://www.archimuse.com/mw2005/papers/addis/addis.html

[3] Aleksandrov M, Zlatanova S, Heslop DJ: Voxelisation algorithms and data structures: A review. Sensors. 2021; 21: 8241. PubMed Abstract | Publisher Full Text | Free Full Text

[4] Almedia B, Costa R: OntoAndalus: An ontology of Islamic artefacts for terminological purposes. Semantic Web Journal, Special Issue on Semantic Web for Cultural Heritage. 2021; 12(2):295–311. Publisher Full Text

[5] Attene M, Katz S, Mortara M, et al.: Mesh segmentation -A comparative study. IEEE International Conference on Shape Modeling and Applications 2006 (SMI’06). Matsushima, Japam, 2006, pp. 7–7. Publisher Full Text

[6] Attene M, Robbiano F, Spagnuolo M, et al.: Characterization of 3D shape parts for semantic annotation. Computer-Aided Design. 2009; 41:756–763. Publisher Full Text

[7] Barrera Rivera A, Islas Domínguez A: Arqueología urbana en la reconstrucción arquitectónica del recinto sagrado de Tenochtitlan. México: Secretaría de Cultura, Instituto Nacional de Antropología e Historia. Colección Arqueología, Serie Logo; 2018.

[8] Benjamins VR, Contreras J, Blázquez M, et al.: Cultural Heritage and the Semantic Web.Bussler CJ, Davies J, Fensel D, Studer R, editors. The Semantic Web: Research and Applications. ESWS 2004. Lecture Notes in Computer Science. Vol. 3053. . 2004; pp. 433–444. Publisher Full Text

[9] Benjamins VR, Fensel D: Editorial: Problem-solving methods. International Journal of Human-Computer Studies. Special issue on Problem- Solving Methods.October 1998; 49(4): 305–313. Publisher Full Text

[10] Besl PJ, Jain R: Three-dimensional object recognition. Computing Surveys. 1985; 17(1): 75–145. Publisher Full Text

[11] Bianchini C, Ippolito A, Bartolomei C.: The surveying and representation process applied to architecture: Non-contact methods for the documentation of cultural heritage.Brusaporci S, editor. Handbook of Research on Emerging Digital Tools for Architectural Surveying, Modeling and Representation.2015; pp. 44–93. Publisher Full Text

[12] Bikakis A, Hyvönen E, Jean S, et al.: Editorial: Special issue on Semantic Web for Cultural Heritage. Semantic Web. 2021; 12:163–167. Publisher Full Text

[13] Brandolini F, Pratucco G: Structure-from-Motion (SFM) photogrammetry as a non-invasive methodology to digitalize historical documents: A highly flexible and low-cost approach? Heritage. 2019; 2:2124–2136. Publisher Full Text

[14] Bronstein MM, Bruna J, LeCun Y, et al.: Geometric deep learning: Going beyond Euclidean data. IEEE Signal Processing Magazine. 2017; 34(4):18–42. Publisher Full Text

[15] Bustos B, Keim D, Saupe D, et al.: Automatic selection and combination of descriptors for effective 3D similarity search. IEEE Sixth International Symposium on Multimedia Software Engineering. 2004; 2004: 514–521. Publisher Full Text

[16] Bustos B, Keim DA, Saupe D, et al.: Feature-based similarity search in 3D object databases. ACM Computing Surveys. 2005; 37(4): 345–387. Publisher Full Text

[17] Campbell RJ, Flynn PJ: A survey of free-form object representation and recognition techniques. Computer Vision and Image Understanding. 2001; 81(2): 166–210. Publisher Full Text

[18] Clark JT, Slator BM, Bergstrom A, et al.: DANA (Digital Archive Network for Anthropology) A model for digital archiving. Proceedings of the 2002 ACM Symposium on Applied Computing, SAC ’02. March 2002; pp. 483–487. Publisher Full Text

[19] Croce V, Caroti G, De Luca L, et al.: From the semantic point cloud to heritage-building information modeling: A semiautomatic approach exploiting machine learning. Remote Sensing. 2021; 13(3):461. Publisher Full Text

[20] Croce V, Caroti G, De Luca L, et al.: Semantic annotations on heritage models: 2D/3D approaches and future research challenges. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2020; XLIII-B2-2020: 829–836. Publisher Full Text

[21] Crofts N, Doerr M, Gill T, et al.: Definition of the CIDOC Conceptual Reference Model Version 5.0.4. Technical report, ICOM; 2011. https://www.cidoc-crm.org/Version/version-5.0.4

[22] Ekengren F, Vallieri M, Dininno D, et al.: Dynamic collections: A 3D web infrastructure for artifact engagement. Open Archaeology. 2021; 7: 337–352. Publisher Full Text

[23] Feng Y, Zhang Z, Zhao X, et al.: GVCNN: Group-view convolutional neural networks for 3D shape recognition. Proceedings of the IEEE/CVF Conference on Computer vision and Pattern Recognition.2018; pp. 264–272. Publisher Full Text

[24] Funkhouser T, Min P, Kazhdan M, et al.: A search engine for 3D models. ACM Transactions on Graphics. 2003; 22(1): 83–105. Publisher Full Text

[25] Furuya T, Ohbuchi R :Transcoding across 3D shape representations for unsupervised learning of 3D shape feature. Pattern Recognition Letters.2020; 138: 146–154. Publisher Full Text

[26] Gal R, Cohen-Or D: Salient geometric features for partial shape matching and similarity. ACM Transactions on Graphics. 2006; 25(1): 130–150. Publisher Full Text

[27] García-García A, Orts-Escolano S, Oprea SO, et al.: A review on deep learning techniques applied to semantic segmentation. Computer Vision and Pattern Recognition. 2017; arXiv. https://arxiv.org/abs/1704.06857

[28] Goodall S, Lewis PH, Martínez K, et al.: SCULPTEUR: Multimedia Retrieval for Museums.Enser P, Kompatsiaris Y, O’Connor NE, et al., editors. Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science; Vol. 3115. . 2004; pp. 638–646. Publisher Full Text

[29] Grilli E, Özdemir E, Remondino F: Application of machine and deep learning strategies for the classification of heritage point clouds. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2019; XLII-4/W18: 447–454. Publisher Full Text

[30] Grilli E, Menna F, Remondino F :A review of point clouds segmentation and classification algorithms. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2017; XLII-2/W3: 339–344. Publisher Full Text

[31] Grilli E, Remondino F: Classification of 3D digital heritage. Remote Sensing. 2019; 11: 847. Publisher Full Text

[32] Hayes-Roth F, Waterman DA, Lenat DB: Building Expert Systems. Boston: Addison Wesley Longman Publishing Co.; 1983. ISBN:978-0-201-10686-2

[33] He W, Jiang Z, Zhang C, et al.: CurvaNet: Geometric deep learning based on directional curvature for 3D shape analysis. KDD ’20: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.2020; pp. 2214–2224. Publisher Full Text

[34] Herrewegen J, Tourwé T, Wyffels F: Self-supervised learning for robust object retrieval without human annotations. Computers & Graphics.2023; 115: 13–24. Publisher Full Text

[35] Huang J, Yagel R, Filippov V, et al.: An accurate method for voxelizing polygon meshes. VVS ’98 Proceedings of the 1998 IEEE symposium on Volume visualization. North Carolina: ACM; 1998; pp. 119–126.

[36] Isaksen L: Archaeology and the Semantic Web. PhD. Thesis, University of Southampton. Faculty of Physical and Applied Sciences. School of Electronics and Computer Science. 2011.

[37] Jain S, Mishra S: Survey paper on various 3D view based retrieval methods. International Journal of Engineering Research & Technology. 2014; 3(2): 470–473.

[38] Jiménez-Badillo D, Ruiz-Correa S, García Alfaro W: Developing a recognition system for the retrieval of archaeological 3D models. Contreras F, Farjas M, Melero FJ, editors. CAA-2010. Fusion of Cultures. Proceedings of the 38^th Annual Conference on Computer Applications and Quantitative Methods in Archaeology, Granada, Spain. Oxford: Archaeopress, BAR International Series 2494. 2013; pp. 325–332. Publisher Full Text

[39] Johnson AE, Hebert M: Using spin-images for efficient multiple model recognition in cluttered 3-D scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1999; 21(5): 433–449. Publisher Full Text

[40] Kazhdan M, Chazelle B, Dobkin D, et al.: A reflective symmetry descriptor. Proceedings of the 7th European Conference on Computer Vision, ECCV02, Copenhagen, May, 2002. Lecture Notes on Computer Sciences, 2352 (Part II). 2002; pp. 642–656.

[41] Kazhdan M, Chazelle B, Dobkin D, et al.: A Reflective Symmetry Descriptor for 3D Models. Algorithmica. 2004a; 38: 201–225. Publisher Full Text

[42] Kazhdan M, Funkhouser T: Harmonic 3D shape matching. SIGGRAPH ’02: ACM SIGGRAPH 2002 Conference Abstracts and Applications. 2002; p. 191. Publisher Full Text

[43] Kazhdan M, Funkhouser T, Rusinkiewicz S: Rotation invariant spherical harmonic representation of 3D shape descriptors.Kobbelt L, Schröder P, Hoppe H, editors. Proceedings of the 2003 Eurographics/ACM SIGGRAPH Symposium on Geometry Processing (SGP ’03). 2003; pp. 156–164. Publisher Full Text

[44] Kazhdan M, Funkhouser T, Rusinkiewicz S: Symmetry descriptors and 3D shape matching. Proceedings of the 2004 Eurographics/ACM SIGGRAPH Symposium on Geometry Processing (SGP ’04). 2004b; pp. 115–123. Publisher Full Text

[45] Kim S, Chae D :ExMeshCNN: An explainable convolutional neural network architecture for 3D shape analysis. KDD ’22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022; pp. 795–803. Publisher Full Text

[46] Koller D, Frischer B, Humphreys G: Research challenges for digital archives of 3D cultural heritage models. Journal on Computing and Cultural Heritage. 2009; 2(3): pp. 1–17. article 7. Publisher Full Text

[47] Krestenitis M, Passalis N, Iosifidis A, et al.: Recurrent bag-of-features for visual information analysis. Pattern Recognition.2020; 106: 107380. Publisher Full Text

[48] López Luján L: Proyecto Templo Mayor: Séptima temporada (2007-2012).García NMR, editor. Memoria 2007-2012 de la Coordinación Nacional de Arqueología. México: INAH; 2012; pp. 1939–1942. Reference Source

[49] López Luján L: Proyecto Templo Mayor: Séptima y octava temporadas. Nava PFS, editor. La arqueología oficial mexicana a principios del siglo XXI: Estudios de caso. México: INAH; 2019; pp. 341–345. Reference Source

[50] López Luján L: The offerings of the Templo Mayor of Tenochtitlan. Colorado: University Press of Colorado; 1994.

[51] Lara López G, Peña Pérez Negrón A, de Antonio Jiménez A, et al.: Comparative analysis of shape descriptors for 3D objects. Multimedia Tools and Applications. 2017; 76(5): 6993–7040. Publisher Full Text

[52] Loncaric S: A survey of shape analysis techniques. Pattern Recognition. 1998; 31(8): 983–1001. Publisher Full Text

[53] Matos Moctezuma E: The Great Temple of the Aztecs: Treasures of Tenochtitlan. London: Thames and Hudson; 1988.

[54] Mendoza-Montoya O: omendoza83/ArcheoShape-Data: ArcheoShape 0.2 (v0.2.1-alpha). [Data]. Zenodo. 2023a. Publisher Full Text

[55] Mendoza-Montoya O: omendoza83/ArcheoShape: ArcheoShape 0.3 (v0.3.0-alpha). Zenodo. [Code]. 2023b. Publisher Full Text

[56] Mori G, Belongie S, Malik H: Shape contexts enable efficient retrieval of similar shapes. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001. 2001; pp. 723–730.

[57] Nagao D: Mexica buried offerings: A historical and contextual analysis. Oxford: Archaeopress, BAR International Series; 1985; 235.

[58] Osada R, Funkhouser T, Chazelle B, et al.: Shape distributions. ACM Transactions on Graphics. 2002; 21: 807–832. Publisher Full Text

[59] Paquet E, Rioux: Nefertiti: A tool for 3-D shape databases management. SAE Transactions: Journal of Aerospace. 2000; 108(1): 387–393.

[60] Pieraccini M, Guidi G, Atzeni C: 3D digitizing of cultural heritage. Journal of Cultural Heritage. 2001; 2: 63–70. Publisher Full Text

[61] Pintus R, Pal K, Yang Y, et al.: A survey of geometric analysis in cultural heritage. Computer Graphics Forum. 2016; 35(1): 4–31. Publisher Full Text

[62] Roman-Rangel EF, Jimenez-Badillo D, Marchand-Maillet S: Classification and retrieval of archaeological potsherds using histograms of spherical orientations. ACM Journal on Computing and Cultural Heritage. 2016; 9(3): 1–23. Publisher Full Text

[63] Rostami R, Bashiri FS, Rostami B, et al.: A survey on data-driven 3D shape descriptors. Computer Graphics Forum. 2019; 38(1): 356–393. Publisher Full Text

[64] Roussel R, De Luca L :An approach to build a complete digital report of the Notre Dame cathedral after the fire, using the Aioli platform. 29th CIPA Symposium Documenting, Understanding, Preserving Cultural Heritage: Humanities and Digital Technologies for Shaping the Future. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.2023; XLVIII-M-2-2023: 1359–1365. Publisher Full Text

[65] Rowe J, Razdan A: Digital library system. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries – JCDL’02. 2002. Publisher Full Text

[66] Rowe J, Razdan A, Collins D, et al.: A 3D digital library system: Capture, analysis, query, and display. Proceedings of the Fourth International Conference on Digital Libraries (ICADL). 2001.

[67] Saupe D, Vranic DV: 3D model retrieval with spherical harmonics and moments.Radig B, Florczyk S, editors. Proceedings of the DAGM 2001. 2001; pp. 392–397.

[68] Schurmans U, Razdan A, Simon A, et al.: Advances in geometric modeling and feature extraction on pots, rocks and bones for representation and query via the Internet.Burenhult G, Arvidsson J, editors. Archaeological Informatics: Pushing the Envelope. CAA2001. Computer Applications and Quantitative Methods in Archaeology. Proceedings of the 29th Conference, Gotland, April 2001, Oxford: Archaeopress, BAR International Series 1016. 2002; pp. 191–202.

[69] Shamir A: Segmentation and shape extraction of 3D boundary meshes.Wyvill B, Wilkie A, editors. Eurographics 2006-State of the Art Reports, pp. 137–149. Publisher Full Text

[70] Sinclair P, Goodall S, Lewis PH, et al.: Concept browsing for multimedia retrieval in the SCULPTEUR project. [accessed March 13 2024].https://eprints.soton.ac.uk/260913/1/eswc.pdf

[71] Suzuki MT: A Web-based retrieval system for 3D polygonal models. Proceedings of the Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569). 2001; pp. 2271–2276. Publisher Full Text

[72] Tangelder JWH, Veltkamp RC: A survey of content based 3D shape retrieval methods. Multimedia Tools and Applications. 2008; 39:441–471. Publisher Full Text

[73] Teruggi S, Grilli E, Russo, et al.: A hierarchical machine learning approach for multi-level and multi-resolution 3D point cloud classification. Remote Sensing.2020; 12: 2598. Publisher Full Text

[74] Theologou P, Pratikakis I, Theoharis T: A review on 3D object retrieval methodologies using a part-based representation. Computer-Aided Design and Applications. 2014; 11(6):670–684. Publisher Full Text

[75] Uschold M, Grüninger M: Ontologies: Principles, methods, and applications. Knowledge Engineering Review. 1996; 11(2):93–136. Publisher Full Text

[76] Veltkamp RC, Hagedoorn M: State-of-the-art in shape matching. Technical Report. UU-CS-1999-27, Utrecht University, the Netherlands.1999. [12 December 2022]. Reference Source

[77] Vranic DV, Saupe D: 3D shape descriptor based on 3D Fourier Transform.Fazekas K, editor. Proceedings of the EURASIP Conference on Digital Signal Processing for Multimedia Communications and Services (ECMCS 2001). 2001; pp. 271–274. [12 December 2022]. Reference Source

[78] Vranic DV, Saupe D, Richter J: Tools for 3D-object retrieval: Karhunen-Loeve transform and spherical harmonics.Dugelay J-L, Rose K, editors. Proceedings of the IEEE 2001 Workshop Multimedia Signal Processing. 2001; pp. 293–298.

[79] Wang Y, Tan DJ, Navab N, et al.: SoftPoolNet: Shape descriptor for point cloud completion and classification.Vedaldi A, Bischof H, Brox T, et al., editors. Proceedings of 16th European Conference on Computer Vision – ECCV, Part III. Lecture Notes in Computer Science. Vol. 12348. . 2020; pp. 70–85. Publisher Full Text

[80] Xie J, Dai G, Zhu F, et al.: DeepShape: Deep-learned shape descriptor for 3D shape retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017; 39(7):1335–1345. PubMed Abstract | Publisher Full Text

[81] Yang L, Wang L, Su Y, et al.: Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition, Signal Processing: Image Communication.2021; 96: 116297. Publisher Full Text

Application of computer vision techniques for 3D matching and retrieval of archaeological objects

Abstract

Background

Methods

Use case

Conclusions

Keywords

Revised Amendments from Version 1

Introduction

Basic requirements

Figure 1. Three figurines found in the Sacred Precinct of Tenochtitlan.

Development of a search-engine module

Table 1. The four distance measures implemented in the prototype to calculate dissimilarity of shape descriptors.

Shape distributions

Figure 2. Frequency probability histograms resulting from obtaining measurements of angles between vectors for four objects from the Templo Mayor museum collection.

Reflective symmetry descriptor

(1)

Figure 3. A 3D model of an archaeological figure and its voxel representation.

Figure 4. Graphs of the reflexive symmetry descriptors obtained for three objects from the Templo Mayor collection.

Spherical harmonics descriptor

Figure 5. Graphical representation of harmonic functions calculated to obtain shape descriptors from 3D models.

(2)

(3)

(4)

(5)

(6)

Figure 6. Graphs of the decomposition coefficients obtained by calculating harmonic functions for three objects of the collection of the Templo Mayor collection.

User interface

Figure 7. Architecture of MeshAnalyser.

Figure 8. Screenshot of the window to open a file in MeshAnalizer.

Figure 9. Screenshot of the window that displays the properties of a 3D mesh opened with MeshAnalizer.

Figure 10. Rendering a triangular mesh as a voxelised model.

Figure 11. Screenshot of the window to apply the Shape Distribution algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 12. Screenshot of the window to apply the Relective Simmetry algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 13. Screenshot of the window to apply the Spherical Harmonics algorithm to perform a matching and retrieval operation with MeshAnalizer.

Figure 14. Illustration of the software developed for the search and recovery of 3D models of archaeological objects.

Figure 15. Another example of searching and retrieving 3D models.

Figure 16. A third example of how the software works.

Conclusions

Data availability

Underlying data

Software availability

References

Footnotes

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated