Mobilising marine biodiversity data: a new malacological dataset of Italian records (Mollusca)

Occurrence
Dernière version Publié par Museo di Zoologia (MZUR) - Sapienza University of Rome le déc. 6, 2024 Museo di Zoologia (MZUR) - Sapienza University of Rome

Téléchargez la dernière version de la ressource en tant qu'Archive Darwin Core (DwC-A), ou les métadonnées de la ressource au format EML ou RTF :

Données sous forme de fichier DwC-A (zip) télécharger 44 096 enregistrements dans Anglais (1 MB) - Fréquence de mise à jour: inconnue
Métadonnées sous forme de fichier EML télécharger dans Anglais (24 KB)
Métadonnées sous forme de fichier RTF télécharger dans Anglais (14 KB)

Description

The location and palaeoceanographic history of the Mediterranean Sea make it a biodiversity hotspot, prompting extensive studies in this region. However, despite the marine biodiversity of this area is apparently widely studied, a large amount of distributional data for Mediterranean taxa is still unpublished or scattered in various sources and formats, causing severe limitations to their potential reuse. This emerges as a particularly thorny issue for highly biodiverse and neglected taxa, such as invertebrates. The mobilisation of these frozen data through a process of standardisation and georeferencing could potentially support biodiversity research and conservation. The aim of this work is to provide a standardised pipeline to integrate these dispersed data, focusing on the Italian waters of the Mediterranean Sea and using molluscs as target taxa. Data were gathered from two main sources: published literature and Natural History Collections. The harmonisation process involved three key steps: 1) terminology and structure standardisation, 2) taxonomy updating and 3) georeferencing. Our efforts yielded over 44000 standardised records of mollusc species from Italian seawaters. These records encompassed primary biodiversity data from newly digitised specimens owned by 11 different institutions and private collectors, as well as secondary biodiversity data extracted from 311 published studies.

Enregistrements de données

Les données de cette ressource occurrence ont été publiées sous forme d'une Archive Darwin Core (Darwin Core Archive ou DwC-A), le format standard pour partager des données de biodiversité en tant qu'ensemble d'un ou plusieurs tableurs de données. Le tableur de données du cœur de standard (core) contient 44 096 enregistrements.

Cet IPT archive les données et sert donc de dépôt de données. Les données et métadonnées de la ressource sont disponibles pour téléchargement dans la section téléchargements. Le tableau des versions liste les autres versions de chaque ressource rendues disponibles de façon publique et permet de tracer les modifications apportées à la ressource au fil du temps.

Versions

Le tableau ci-dessous n'affiche que les versions publiées de la ressource accessibles publiquement.

Comment citer

Les chercheurs doivent citer cette ressource comme suit:

Giannini A (2024). Mobilising marine biodiversity data: a new malacological dataset of Italian records (Mollusca). Version 1.0. Museo di Zoologia (MZUR) - Sapienza University of Rome. Occurrence dataset. https://cloud.gbif.org/eca/resource?r=mzur_sap_zoo_01&v=1.0

Droits

Les chercheurs doivent respecter la déclaration de droits suivante:

L’éditeur et détenteur des droits de cette ressource est Museo di Zoologia (MZUR) - Sapienza University of Rome. Ce travail est sous licence Creative Commons Attribution Non Commercial (CC-BY-NC) 4.0.

Enregistrement GBIF

Cette ressource a été enregistrée sur le portail GBIF, et possède l'UUID GBIF suivante : e0370da7-b32b-414c-8a61-1d86125075f3.  Museo di Zoologia (MZUR) - Sapienza University of Rome publie cette ressource, et est enregistré dans le GBIF comme éditeur de données avec l'approbation du Participant Node Managers Committee.

Mots-clé

Occurrence; Marine; Mollusca; Italy

Contacts

Arianna Giannini
  • Fournisseur Des Métadonnées
  • Créateur
  • Personne De Contact
  • PhD Student
Sapienza University of Rome
00185 Roma
RM
IT
Caterina Giovinazzo
  • Conservateur
Polo Museale Sapienza
00185 Roma
RM
IT

Couverture géographique

Collected data occurred within the Italian Exclusive Economic Zone in the Mediterranean Sea.

Enveloppe géographique Sud Ouest [35,35, 7,605], Nord Est [45,784, 18,795]

Couverture taxonomique

The dataset includes 44096 occurrences of 1513 Italian marine mollusc species.

Class Bivalvia, Monoplacophora, Gastropoda, Scaphopoda, Polyplacophora, Cephalopoda
Order Nudibranchia, Cardiida, Ellobiida, Pectinida, Gadilida, Neopilinida, Chitonida, Carditida, Aplysiida, Lepidopleurida, Sepiida, Dentaliida, Oegopsida, Callochitonida, Myida, Solemyida, Lepetellida, Systellommatophora, Cocculinida, Caenogastropoda incertae sedis, Arcida, Mytilida, Trochida, Nuculanida, Siphonariida, Myopsida, Ostreida, Neogastropoda, Cycloneritida, Umbraculida, Limida, Cephalaspidea, Galeommatida, Venerida, Littorinimorpha, Pleurobranchida, Seguenziida, Runcinida, Adapedonta, Pteropoda, Gastrochaenida, Nuculida, Lucinida, Octopoda

Données sur le projet

Pas de description disponible

Titre THE NATIONAL CHECKLIST OF ITALIAN FAUNA - DEVELOPMENT OF A MODERN DATABASE

Les personnes impliquées dans le projet:

Marco Oliverio

Méthodes d'échantillonnage

Data were gathered from two main sources: literature and Natural History Collections (NHCs). To collect literature data, a comprehensive search was performed on the public databases Scopus and Web of Science. In addition to this, we also searched data from journals specialised on mediterranean marine fauna, namely Iberus and all the volumes of both journals of the Italian Society of Malacology (Società Italiana di Malacologia, SIM): Bollettino Malacologico and Alleryana. Since until the publication of the Checklist of the Italian Fauna no unified standard existed for Italian molluscan taxonomy and nomenclature - and verifying the accuracy of identifications reported in literature would have been difficult without direct check of the actual specimens - the literature search was restricted to publications issued after the first edition of the Checklist of the Italian Fauna. Species distribution information published in various formats (e.g. data tables in supplementary materials or within the paper, species lists, statements reporting the species occurrence) were considered as potential raw data. Paper with data already published in public databases were excluded. In order to avoid collecting the same record several times, only papers with new data were considered (i.e. new data derived from a dedicated sampling or non-new data published for the first time). Data from NHCs were collected by direct request to private collectors and institutions. From both sources, records were included in the dataset if at least the occurrence locality and a taxonomic identification at the genus level (or more specific) were stated.

Etendue de l'étude The present work aims at collecting and making usable in the form of point-occurrences the distributional data of marine mollusc species reported in Italy, by integrating via harmonisation and georeferencing processes both primary (i.e. newly digitised specimen from public and private Natural History Collections) and secondary biodiversity data (i.e. non databased spatial information of species reported in publicly-accessible papers).

Description des étapes de la méthode:

  1. 1. Firstly, data were merged and formatted in a Darwin Core scheme, using Biodiversity Data Cleaning toolkit package in R. 2. With the same package, a first filter was performed to clean the dataset from duplicates and records lacking essential information (i.e. identification or locality/coordinates). Then, data were manually filtered to retrieve records that were: out of scope (i.e. occurrences outside Italian Marine Exclusive Economic Zone, fossils, non-marine species), too vague (i.e. broad locality, specimens with a higher level of identification than the genus), or dubious (dubious locality, ambiguous and/or unclear identification). 3. Taxonomy was aligned to the one proposed by the World Register of Marine Species (WoRMS Editorial Board 2024) using the taxon-match Life Watch webservice, also extracting the WoRMS Life Science Identifiers for each valid scientific name to trace as far as possible rehashes of taxonomy, which in marine molluscs are quite common, especially through molecular evidence. 4. The remaining dubious taxonomy that was not automatically validated was checked manually and then submitted to experts, which resulted in the removal of other records with dubious identification. 5. Open Nomenclature qualifiers were used to set uncertainty and provisional statuses for taxonomic identifications. 6. Subsequently, records were classified in 7 different groups based on the type of the geographic information they had, in order to georeference them by the most appropriate method. Georeferencing was performed following the point-radius method, using GEOLocate web-based collaborative client and QGIS. Each final processed record has associated coordinates expressed in WGS84 decimal degrees and an uncertainty measure in metres. GEBCO_2022 global terrain model was used to georeference depth data correctly. 7. During the georeferencing process it was possible to remove other data occurred outside study boundaries. We then excluded records with >5000 m of uncertainty radius. 8. As raw temporal data from NHCs arrived in various formats, this information was handled with the R package lubridate and converted to ISO 8601 format. The temporal information collected spans various degrees of resolution, from the exact date to time ranges between years. We decided to mantain also records without temporal information. No hourly data were collected.

Métadonnées additionnelles