1. Introduction
The
Lichenotheca Veneta is one of the most relevant historic collections of lichen
exsiccatae in Italy. Lichens have been collected and studied in the country since the Renaissance. Federico Cesi (1585–1630) [
1], using the microscope developed by Galileo Galilei, paved the way for the fundamental work of Pier Antonio Micheli (1679–1737), who should probably be considered the “father” of lichenological studies in Italy. After Micheli, the first significant contribution is recorded in 1846, the year of publication of the
Frammenti Lichenografici [
2] by Giuseppe De Notaris (1805–1877). De Notaris was one of the first to take advantage of the advancements in microscopy, and, in particular, the invention of a microscope with achromatic lenses by Giovanni Battista Amici (1786–1862) [
3,
4,
5]. During the nineteenth century, several researchers worked on lichens: Martino Anzi (1812–1881), Vittore Trevisan (1818–1897), Abramo Bartolomeo Massalongo (1824–1860), and Francesco Baglietto (1826–1916). The international relevance of their studies was remarkable, and several taxa described by these authors are still valid today [
4].
These researchers produced a relevant number of collections [
6]. Most of the specimens collected by Abramo Massalongo (including type specimens) are preserved at the Natural History Museum of Verona, and some are preserved at the Natural History Museum of Venice (Veneto, North-Eastern Italy). The herbarium of Francesco Baglietto is preserved at the University of Modena (Emilia Romagna, Central Italy) and, in small part, at the University of Genova (Liguria, North-Western Italy), which also hosts the collection of Camillo Sbarbaro (1888–1967), poet, writer, and one of the most relevant Italian lichenologists of the 1900s. The herbarium of Martino Anzi (1812–1883) is preserved at the University of Torino (Piemonte, North-Western Italy), while the University of Rome (Lazio, Central Italy) hosts that of Giuseppe De Notaris (1805–1877).
During the nineteenth century, it was quite common to publish and distribute
exsiccatae. These collections, composed of duplicated sets of specimens, were produced by one or several authors, and were meant as a form of scholar communication. They contain specimens which are normally associated with typewritten, printed labels, reporting far more information than simply the scientific name and gathering data. This information could range from complete diagnoses to taxonomic and systematic considerations. A relevant online inventory of
exsiccatae from all over the world is IndExs [
7].
In Italy, as far as cryptogams are concerned, Abramo Massalongo published his
Lichenes Italici Exsiccati (10 volumes, 1855–56 [
8]), which is still preserved in at least two complete copies, as they were originally published, at the Natural History Museums of Verona and Venice. In the same period, several researchers contributed to the
Erbario Crittogamico Italiano (2 series, 30 and 10 volumes, 1858–1885 [
9]), which contains specimens of lichens, mosses, liverworts, algae, and ferns. All these collections where published in a limited number of copies, and distributed among scholars.
Vittore Benedetto Antonio Trevisan, earl of Saint-Léon (Padua, 1818–Milan, 1897), was a botanist, specializing in cryptogamic flora. During his career, he published 141 works, and was a professor of natural history at the University of Padua between 1851 and 1853. He was a member of several scientific societies, and, in 1882, was appointed president of the physio-medical–statistical Academy of Milan. Trevisan focused on the study of lichens mostly between 1853 and 1869, and, in 1860, published what is probably his most relevant contribution in the field, a general conspectus of pyrenocarpic lichens, the
Conspectus Verrucarinarum [
10]. Trevisan’s system constitutes one of the last examples of a taxonomic arrangement of lichens on the basis of microscopical characters in the nineteenth century [
4]. He described 75 new genera of lichens, and built one of the most relevant private Italian herbaria of the time, which, at his death, included over a million specimens. Unfortunately, it was completely destroyed during the Second World War [
4,
11]. In 1869 he published the
Lichenotheca Veneta [
12] which, as with other collections of
exsiccatae, was built as a reference herbarium for scholars. The exact number of copies which were published by the Sante Pozzato Typography (Bassano, Veneto, NE Italy) is unknown, but certainly extremely limited [
11]. To our knowledge, the copy preserved at the Museum of Natural History “Giancarlo Ligabue” of Venice is probably the only one which is certainly complete and preserved in its original form as it was conceived by the author. As evidenced by the handwritten notes added on the covers of the volumes, this copy was given by Trevisan to the
Istituto Veneto di Scienze, Lettere ed Arti, presumably in 1877. From here it was moved in 1923 to the Museum of Natural History of Venice, together with other scientific collections.
Given the scientific, historical, and cultural relevance of the
Lichenotheca Veneta, it was decided to mobilize its specimens as well as the data stored in their labels by means of digitization. This contribution details the digitization process, its issues and challenges, and the publication of the digital specimens online at the URL
https://dryades.units.it/lichenothecaveneta (accessed on 20 December 2024) [
13].
2. Data and Methods
The data and digital images originated from the digitization of the
Lichenotheca Veneta, which was carried out between 2020 and 2023 at the Natural History Museum “Giancarlo Ligabue” of Venice. The data and images are stored on the servers of Project
Dryades [
14], at the Dept. of Life Sciences of the University of Trieste. They are accessible at the URL
https://dryades.units.it/lichenothecaveneta, accessed on 18 December 2024 [
13], in a web portal enriched by a query system written in PHP and operating on a MySQL database.
2.1. The Collection
The
Lichenotheca Veneta [
12] is a collection of
exsiccatae composed of eight files of herbarium sheets, each hosting a specimen with a label, organized into four volumes. The cover of each of the four volumes is made of cardboard with printed title pages. The specimen sheets, which measure 30 × 21 cm, are not bounded, so that they can be taken out without damaging the volume. The sheets were originally collected one by one in paper pockets, which are only partially preserved. The whole collection contains a total of 268 specimens for 74 genera (see
Supplementary Materials S1 and S2). The specimens belong to 197 species and (in terms of varieties and forms) to 119 infraspecific taxa. The author probably intended for this collection to be the first of several, since “Series I” is specified in the title of the volumes. However, no further series was ever published.
The specimens are divided into volumes according to the date of publication: Vol. I, April 1869 (78 specimens); Vol. II, June 1869 (68 specimens); Vol. III, August 1869 (61 specimens); Vol. IV, October 1869 (61 specimens). The specimens are normally glued to the sheets (a typical example is the specimen of
Lobaria macrophylla, a taxon described by Trevisan, shown in
Figure 1), while in some cases they are preserved inside paper envelopes. This is a common practice for taxa growing on a fragile substratum, such as soil, in order to prevent the loss of parts of the specimens because of the fragmentation of the substratum. Each specimen is associated with a printed, typewritten label reporting its scientific name, synonyms, habitat, and—almost always—one or more localities or a general area of collection, as well as notes written by the author.
Given that most of the specimens were collected with their substratum (soil, bark, rock), and thus they are at least partly three-dimensional, their arrangement on the sheets can be very peculiar. Since each volume contains several sheets stacked one over the other, in order to achieve an even height of all the parts of each volume, specimens are fixed on different portions of their sheets, some in the center, others in the corners. Thus, each sheet can host a label and a specimen in different positions, as shown in
Figure 2. This was a common practice in the publication of collections of
exsiccatae. A similar arrangement of the specimens can be found in the contemporaneous
Lichenes Italici Exsiccati by Abramo Massalongo, while in other cases, the issue of three-dimensionality was solved by storing the specimens in volumes divided in small cardboard boxes, one per specimen.
The specimens in the
Lichenotheca Veneta were mostly collected by Trevisan himself in the North-Eastern Italian area, mainly in the modern Veneto and Friuli Venezia Giulia administrative regions and in the province of Mantua. The collection also hosts one specimen from mt. Etna (Sicily, Italy), while four were collected abroad, on the island of Mauritius and in Mexico. However, most of the labels do not report detailed gathering localities, but quite generic information. Twenty-five specimens have no detailed indication of the collection site at all, while the labels of most of the others report generic indications only, often listing more than one geographic area. As an example, for the specimen of
Acrorixis actinostoma Trevis.
trachyticola, the label reports
ad rupes et saxa trachytica in Euganeis, et ad saxa trachytica in moeniis, urbis Patavii, which means
on the trachytic cliffs and rocks in the Euganean Hills, and on the trachytic rocks in the walls of the city of Padua. As a consequence, any effort to geo-reference these localities can be a futile exercise, at least if a minimum degree of precision is required. The gathering localities—as they are described by Trevisan in the
Lichenotheca Veneta—generally have a wide extent. Even if it is possible to place a point in the center of the extent of a locality, whatever its extent is, the deriving uncertainty would be massive, thus making the point unusable for practically any spatial inference. As explained in
Georeferencing Best Practice published in the GBIF [
15], calculating the uncertainty of a geo-reference is a particularly relevant task, as well as a challenging one, for improving the fitness of occurrence data for use, especially if they are used for applications like Species Distribution Modeling (SDM). Thus, in the framework of this digitization effort, geo-referencig was not carried out, and gathering localities are published as they were originally reported by Trevisan. However, the scientific value of
Lichenotheca Veneta is not in the details of the gathering localities and the date of collection of the specimens, but in the rich amount of information, especially as far as nomenclature is concerned, and in the notes of the author, reported on the labels [
12] of several specimens, as shown in
Figure 3.
2.2. Digitization Workflow
Digitization was carried out following a workflow divided into three phases, the first and the third devoted to digital imaging, and the second devoted to the extraction of the metadata from the labels. As in any other digitization workflow of natural history collection specimens [
16], pre-digitization decisions [
17] and curatorial steps were carried out, in order to pave the way for successful digitization work.
The first phase, devoted to the acquisition of panoramic digital images of the whole herbarium sheets, with both the specimen and its label, made use of a Canon 600D full-frame sensor camera (18 Megapixels, APS-C sensor; Canon Inc., Tokyo, Japan), with tethered remote shutter release, equipped with a 17–50 mm Tamron lens (Tamron Co., Ltd., Saitama, Japan), and a macro 105 mm Sigma lens (Sigma Corporation, Kanuma, Japan). The light source for the camera was a custom-built copy stand Kaiser Repro RS 1 with a 1 m columnand a 45 × 50 cm base with RB 218N HF led lighting (Kaiser Fototechnik GmbH, Bielefeld, Germany), arranged at both sides of the specimens with an angle of ca 30°, in order to avoid refraction. A ruler bar and a standard color checker were applied to each specimen before taking the image.
For each specimen, 2 images were taken. Other than the panoramic image of the whole sheet (as in
Figure 1), an image of the label alone was also taken (
Figure 3), which was processed in the second phase of the workflow for metadata extraction.
Images were stored at a resolution of 300 dpi in JPEG format. The digitization guidelines of the Global Plants Initiative (GPI) [
18] recommend that images should be stored in TIFF format. However, given that TIFF files call for a relevant amount of storage, it was decided to use the JPEG format, even if it is a lossy one [
19]. Since this choice could limit the possibility of magnifying some details of a specimen, and especially the reproductive structures, which are quite relevant for lichen identification, it was decided to create a gallery of details, taken with the same camera equipped with a macro 105 mm Sigma lens in the third phase of the digitization workflow.
In the second phase, the operator did not use the specimens, but images of the labels, which are all typewritten. This allowed for the use of any common OCR software for extracting digital text from the images. In this study, the OCR function of the Images App of the MacOS 13 Ventura and MacOS 14 Sonoma was used. After the extraction, texts were reviewed by the operator for thorough quality control, with particular care given to the scientific names. Given the possibility that some misspellings could have arisen, the digital images of the labels were published together with the specimens’ metadata, and are available to the users of the system, thus providing a visual voucher for data quality.
During the second phase, in parallel with the extraction of metadata from the labels, the panoramic images of the specimens were reviewed by a lichenologist. This action aimed to select which details of each specimen would be relevant to be magnified and captured. Furthermore, during this action, a quality check of the panoramic images was carried out as well, in order to decide whether some of them should be taken again because of poor quality or errors in the pre-digitization curation phase.
On the basis of the review action, the third phase of the workflow was carried out, and images of details of the specimens were taken. As a result, after the third phase, a total of 3 to 6 images for each specimens were produced: a panoramic image of the whole sheet, an image of the label alone, and 1 to 4 detail images of the thallus and/or of the reproductive structures. The relationship between specimen metadata and images is thus one to many.
Specimen and image metadata were organized in a MySQL database. The data were arranged in a custom structure. However, each concept corresponds to a concept in the Darwin Core (dwc) standard [
20]. This will allow the data to be interoperable with any platform which adopts this standard. Specimen metadata were organized into the following concepts (corresponding dwc concepts are reported in parentheses):
The UID of the specimen in the catalog of the Museum of Venice (dwc:catalogNumber);
The taxon name (dwc:verbatimIdentification);
Synonyms, which often are quite numerous (dwc:catalogNumber);
The locality of collection, with the date, if present (dwc:verbatimLocality):
Literature (dwc:associatedReferences);
Notes from the author, which can be written either in Italian or in Latin (dwc:occurenceRemarks).
All the metadata transcribed from the labels were reported verbatim as they were originally written by Trevisan.
The images were organized in a different table, in order to allow for the one–many relationship between each specimens and its images. In this case, the correspondence was not only to the simple Darwin Core, but to its Simple Multimedia extension. The concepts are as follows:
The image’s public URL (dc:identifier);
The UID of the specimen in the catalog of the Museum of Venice (dwc:catalogNumber);
The image name (dc:title)
The image type (dc:type).
One–many relationships between specimens and images were granted by the specimen UIDs, which serve as a foreign key in the image table.
3. Results
The digitization of the
Lichenotheca Veneta led to the production of a rich set of metadata and 1173 digital images. The metadata and images are available on an online web portal [
13] accessible by any common web browser. The portal is also fully responsive, in order to be usable on mobile devices. All the contents of the portal are published as open data with a Creative Commons 4.0 CC BY license. Access to the data and images is provided by means of three interfaces: a list of taxa, image galleries, and a query system.
In the first case, users can access a list of all the taxa of the collection ordered alphabetically by scientific name (as it was originally written by Trevisan on the labels). By selecting a taxon from the list, users will load the specimen page, which hosts all the metadata as well as a gallery of digital images (panoramic and details, as in
Figure 4). The owner and license of the images are always stated at the beginning of the gallery.
In the gallery section, users can display either a gallery of the panoramic images of the specimens, or a gallery of the labels. In both cases, the images are ordered alphabetically by taxon name. From each gallery, it is possible to click on the taxon name to access a specimen page (
Figure 4).
The third interface is a query system which operates dynamically on the database. The interface has a single field input form, and allows users to perform queries by entering a text string, which can be made of part of a word, a single word, or two or more words. The query operates on the taxon name, as well as on other concepts among the metadata (synonyms, locality of collection, and notes). Given that the metadata were transcribed verbatim, as they were originally written on the labels by Trevisan, they are rich in Latin terms, as was common in nineteenth century Italian academic language, as well as several archaic Italian words. Thus, to query for “hills”, users must use the Latin word
collibus. It is possible to use blanks inside the query string: as an example, by querying for “ria pulmo”, the system will return the taxon
Lobaria pulmonaria. However, the query allows for exact matches only, since no near-match algorithms are currently implemented in the system. The query system is case-insensitive, and no special characters are allowed (i.e., if the user terminates a string with an “*”, which is commonly used as a wildcard character, the system will use it as a standard character, and no result will be returned). The result of a query is a list of no (in cased of no match) to one-to-many (in cases where some matches are achieved) taxa. The list provides access to specimen pages (
Figure 4), as seen in the first interface.
As further enrichment for the user experience, the web portal provides historical information on the
Lichenotheca Veneta, a biographic note on Vittore Trevisan, and an overview of lichenological research in Italy, extracted from the essay originally published by Nimis in his recent
Annotated Checklist on Italian Lichens [
4].
The web portal is available in Italian and English.
4. Discussion
The mobilization of natural history specimen data by means of digitization is seen as pivotal for research and dissemination in the field of natural sciences [
21,
22]. In fact, making digital specimens available online, by means of web portals and/or global aggregators, such as the Global Biodiversity Information Facility (GBIF, [
23]), has potentially relevant medium- and long-term benefits for the research community and for institutions investing time and money in digitization efforts [
24]. Additionally, since natural history specimens are an extremely relevant source of falsifiable biodiversity data, their mobilization will open up novel perspectives and roles for natural history collections (and for the institutions which host them) now and in the future [
25]. Several digitization efforts are currently being carried out all over the world, aiming at the mobilization of a large amount of data. Italy just started a relevant effort, which involves several major herbaria, starting from the
Herbarium Centrale Italicum (HCI) hosted in Florence [
26]. This is the first relevant (in terms of the number of specimens, ca. 4.2 million) digitization effort in the country, which was lagging behind in terms of digital specimens compared to other European countries [
27].
Collections of
exsiccatae such as the
Lichenotheca Veneta, however, are not only a source of valuable falsifiable data for researchers in the field of natural history. They can also be a relevant resource of information and data in other research fields, such as history of science. Furthermore, they are also a fundamental part of our historical and cultural heritage. Historical natural history collections are in fact not only a memory of our natural heritage, but part of our cultural heritage in its broader sense [
28]. For this reason, their digitization and valorization are challenges that natural history museums (as well as the national and local institutions that fund them) should address as soon as possible, given the ephemeral nature of any biological collection [
29].
In this sense, the digitization and the exhibition of specimens’ metadata and images online also has the positive effect of decreasing the risk of their deterioration [
30]. Since researchers can access the digital specimens, they can avoid (in many cases) the need to physically access the collection. Thus, the risks of the loss or deterioration of portions of or whole specimens is strongly lowered. In fact, physical access to biological specimens always comes with a certain risk of damaging the specimens, which—mostly comprising dried specimens in the case of herbaria—are intrinsically fragile.
Of the few copies of the
Lichenotheca Veneta which were originally published by Trevisan, several were dismembered to store the specimens in other collections. As an example, in the Botanische Staatssammlung München and in the Wisconsin State Herbarium, there exist some specimens which have been removed from their original sheets, together with the labels, stored in envelopes, and accessed together with other specimens from other sources in general collections [
31]. Thus, this digitization effort aimed not only at enhancing the accessibility of what is certainly one of the few—if not the only—complete copy of the
Lichenotheca Veneta, but also at exposing the specimens as they ware originally arranged by the author. According to Lynge [
32], in Italy, at least 12 collections of
exsiccata were produced in the nineteenth century by authors other than V. Trevisan, including M. Anzi (the most active, with several collections), G. De Notaris, F. Baglietto, S. Garovaglio, and A. Massalongo. Most of these collections are still available (complete or dismembered) in Italian Museums or abroad. The most similar to the
Lichenotheca Veneta is probably the
Lichenes Italici Exsiccati by A. Massalongo [
8], published a few years before. The two collections are quite similar in terms of the arrangement of specimens on the sheets (see
Figure 2), but the Masssalongian collection is made of bounded volumes, and thus, it is slightly difficult to page through. Another difference is in the position of the labels, which are one per sheet in the
Lichenotheca Veneta, while they are grouped at the beginning of each volume in the
Lichenes Italici Exsiccati. However, both the collections, as well as all other collections of
exsiccata of the period, had the same role in scholar communication. Neither Trevisan nor Massalongo, however, had the opportunity to contribute to the largest collection of
exsiccata of cryptogams in Italy, the
Erbario Crittogamico Italiano, published in two series [
33,
34] between 1858 and 1885.
The mobilization of data and images of the
Lichenotheca Veneta by means of a web portal [
13], which is accessible by any common web browsers and mobile devices, was carried out in order to potentially reach the wider general public, even if they are interested not specifically in lichens and lichenology, but in the historical and cultural value of this collection, or of natural history collections in general. However, the primary targets of this effort are obviously researchers, who can now access the collection and its specimens, even if in a digital form, thus allowing for inferences which were previously impossible without having physical access to the specimens. As an example, the exhibition of the collection’s data has already generated a short publication [
35], since one of the taxa in the collection had never been reported before for the region of Veneto (
Opegrapha scripta Ach. var.
recta Schaer.). Physical access to the collection is obviously still possible, but on motivated request only, given its historical and cultural relevance, while shipment is not foreseen.
A critical point of the digitization process was the alignment of the scientific names adopted by Trevisan to currently accepted names, such as those in the modern checklist of Italian lichens by Nimis [
4], which is available in ITALIC, the portal to the lichens of Italy [
36], together with a tool for automatically aligning names to their nomenclatural backbones [
37]. Even with the opportunities offered by this novel digital tool, after a first screening, it was decided to avoid carrying out such a challenging effort during the digitization of the
Lichenotheca Veneta. This decision was taken since several of the names used by Trevisan cannot be synonymized to currently accepted names without at least careful morphological and anatomical revision of the specimens. Trevisan had his own taxonomic concept for several taxa, which often conflicted with those of some other contemporaneous lichenologists, such as Abramo Massalongo. The proper delimitation of his taxonomic concepts can thus be clarified in many cases only by means of a thorough revision of the specimens. Furthermore, several scientific names in the
Lichenotheca Veneta are either invalid (as in the case of
Arthonia quercus Trevis., which was previously used by Johann Adam Philipp Hepp in 1862 for another taxon), or, when valid, they are Trevisan’s new combinations proposed for Massalongo’s taxa (as in the case of
Arthoniopsis ruana Trevis., the name Trevisan issued for
Arthonia ruana A. Massal.). However, a complete taxonomic revision of Trevisan’s scientific names was far out of the scope of this work, and it will be the focus of future research. The decision of not aligning taxon names to a taxonomic backbone, together with the difficulty of geo-referencing the localities of collection as they were reported on the labels by Trevisan, hinder the possibility of aggregating the metadata in international aggregators such as the GBIF [
23]; in any case, aggregation is a future objective. However, as discussed before, the relevant value of this collection, other than its historical value, is not in the information on the gathering sites, but rather, in the nomenclatural data and in the observations of the author, as they are reported on the labels. This information is now available online, and researchers worldwide can easily access them.
This effort is part of the activities of the Natural Museum of Venice “Giancarlo Ligabue” to digitize natural history collections and, in particular, historical collections, which led to the recent online publication of another interesting collection, the
Algarium Vatova-Schiffner [
17], and will produce further digital specimens in the future.