Mobilising marine biodiversity data: a new malacological dataset of Italian records (Mollusca)

Occurrence
最新バージョン Museo di Zoologia (MZUR) - Sapienza University of Rome により出版 12月 6, 2024 Museo di Zoologia (MZUR) - Sapienza University of Rome

DwC-A形式のリソース データまたは EML / RTF 形式のリソース メタデータの最新バージョンをダウンロード:

DwC ファイルとしてのデータ ダウンロード 44,096 レコード English で (1 MB) - 更新頻度: unknown
EML ファイルとしてのメタデータ ダウンロード English で (24 KB)
RTF ファイルとしてのメタデータ ダウンロード English で (14 KB)

説明

The location and palaeoceanographic history of the Mediterranean Sea make it a biodiversity hotspot, prompting extensive studies in this region. However, despite the marine biodiversity of this area is apparently widely studied, a large amount of distributional data for Mediterranean taxa is still unpublished or scattered in various sources and formats, causing severe limitations to their potential reuse. This emerges as a particularly thorny issue for highly biodiverse and neglected taxa, such as invertebrates. The mobilisation of these frozen data through a process of standardisation and georeferencing could potentially support biodiversity research and conservation. The aim of this work is to provide a standardised pipeline to integrate these dispersed data, focusing on the Italian waters of the Mediterranean Sea and using molluscs as target taxa. Data were gathered from two main sources: published literature and Natural History Collections. The harmonisation process involved three key steps: 1) terminology and structure standardisation, 2) taxonomy updating and 3) georeferencing. Our efforts yielded over 44000 standardised records of mollusc species from Italian seawaters. These records encompassed primary biodiversity data from newly digitised specimens owned by 11 different institutions and private collectors, as well as secondary biodiversity data extracted from 311 published studies.

データ レコード

この オカレンス(観察データと標本) リソース内のデータは、1 つまたは複数のデータ テーブルとして生物多様性データを共有するための標準化された形式であるダーウィン コア アーカイブ (DwC-A) として公開されています。 コア データ テーブルには、44,096 レコードが含まれています。

この IPT はデータをアーカイブし、データ リポジトリとして機能します。データとリソースのメタデータは、 ダウンロード セクションからダウンロードできます。 バージョン テーブルから公開可能な他のバージョンを閲覧でき、リソースに加えられた変更を知ることができます。

バージョン

次の表は、公にアクセス可能な公開バージョンのリソースのみ表示しています。

引用方法

研究者はこの研究内容を以下のように引用する必要があります。:

Giannini A (2024). Mobilising marine biodiversity data: a new malacological dataset of Italian records (Mollusca). Version 1.0. Museo di Zoologia (MZUR) - Sapienza University of Rome. Occurrence dataset. https://cloud.gbif.org/eca/resource?r=mzur_sap_zoo_01&v=1.0

権利

研究者は権利に関する下記ステートメントを尊重する必要があります。:

パブリッシャーとライセンス保持者権利者は Museo di Zoologia (MZUR) - Sapienza University of Rome。 This work is licensed under a Creative Commons Attribution Non Commercial (CC-BY-NC 4.0) License.

GBIF登録

このリソースをはGBIF と登録されており GBIF UUID: e0370da7-b32b-414c-8a61-1d86125075f3が割り当てられています。   Participant Node Managers Committee によって承認されたデータ パブリッシャーとして GBIF に登録されているMuseo di Zoologia (MZUR) - Sapienza University of Rome が、このリソースをパブリッシュしました。

キーワード

Occurrence; Marine; Mollusca; Italy

連絡先

Arianna Giannini
  • メタデータ提供者
  • 最初のデータ採集者
  • 連絡先
  • PhD Student
Sapienza University of Rome
00185 Roma
RM
IT
Caterina Giovinazzo
  • キュレーター
Polo Museale Sapienza
00185 Roma
RM
IT

地理的範囲

Collected data occurred within the Italian Exclusive Economic Zone in the Mediterranean Sea.

座標(緯度経度) 南 西 [35.35, 7.605], 北 東 [45.784, 18.795]

生物分類学的範囲

The dataset includes 44096 occurrences of 1513 Italian marine mollusc species.

Class Bivalvia, Monoplacophora, Gastropoda, Scaphopoda, Polyplacophora, Cephalopoda
Order Nudibranchia, Cardiida, Ellobiida, Pectinida, Gadilida, Neopilinida, Chitonida, Carditida, Aplysiida, Lepidopleurida, Sepiida, Dentaliida, Oegopsida, Callochitonida, Myida, Solemyida, Lepetellida, Systellommatophora, Cocculinida, Caenogastropoda incertae sedis, Arcida, Mytilida, Trochida, Nuculanida, Siphonariida, Myopsida, Ostreida, Neogastropoda, Cycloneritida, Umbraculida, Limida, Cephalaspidea, Galeommatida, Venerida, Littorinimorpha, Pleurobranchida, Seguenziida, Runcinida, Adapedonta, Pteropoda, Gastrochaenida, Nuculida, Lucinida, Octopoda

プロジェクトデータ

説明がありません

タイトル THE NATIONAL CHECKLIST OF ITALIAN FAUNA - DEVELOPMENT OF A MODERN DATABASE

プロジェクトに携わる要員:

Marco Oliverio

収集方法

Data were gathered from two main sources: literature and Natural History Collections (NHCs). To collect literature data, a comprehensive search was performed on the public databases Scopus and Web of Science. In addition to this, we also searched data from journals specialised on mediterranean marine fauna, namely Iberus and all the volumes of both journals of the Italian Society of Malacology (Società Italiana di Malacologia, SIM): Bollettino Malacologico and Alleryana. Since until the publication of the Checklist of the Italian Fauna no unified standard existed for Italian molluscan taxonomy and nomenclature - and verifying the accuracy of identifications reported in literature would have been difficult without direct check of the actual specimens - the literature search was restricted to publications issued after the first edition of the Checklist of the Italian Fauna. Species distribution information published in various formats (e.g. data tables in supplementary materials or within the paper, species lists, statements reporting the species occurrence) were considered as potential raw data. Paper with data already published in public databases were excluded. In order to avoid collecting the same record several times, only papers with new data were considered (i.e. new data derived from a dedicated sampling or non-new data published for the first time). Data from NHCs were collected by direct request to private collectors and institutions. From both sources, records were included in the dataset if at least the occurrence locality and a taxonomic identification at the genus level (or more specific) were stated.

Study Extent The present work aims at collecting and making usable in the form of point-occurrences the distributional data of marine mollusc species reported in Italy, by integrating via harmonisation and georeferencing processes both primary (i.e. newly digitised specimen from public and private Natural History Collections) and secondary biodiversity data (i.e. non databased spatial information of species reported in publicly-accessible papers).

Method step description:

  1. 1. Firstly, data were merged and formatted in a Darwin Core scheme, using Biodiversity Data Cleaning toolkit package in R. 2. With the same package, a first filter was performed to clean the dataset from duplicates and records lacking essential information (i.e. identification or locality/coordinates). Then, data were manually filtered to retrieve records that were: out of scope (i.e. occurrences outside Italian Marine Exclusive Economic Zone, fossils, non-marine species), too vague (i.e. broad locality, specimens with a higher level of identification than the genus), or dubious (dubious locality, ambiguous and/or unclear identification). 3. Taxonomy was aligned to the one proposed by the World Register of Marine Species (WoRMS Editorial Board 2024) using the taxon-match Life Watch webservice, also extracting the WoRMS Life Science Identifiers for each valid scientific name to trace as far as possible rehashes of taxonomy, which in marine molluscs are quite common, especially through molecular evidence. 4. The remaining dubious taxonomy that was not automatically validated was checked manually and then submitted to experts, which resulted in the removal of other records with dubious identification. 5. Open Nomenclature qualifiers were used to set uncertainty and provisional statuses for taxonomic identifications. 6. Subsequently, records were classified in 7 different groups based on the type of the geographic information they had, in order to georeference them by the most appropriate method. Georeferencing was performed following the point-radius method, using GEOLocate web-based collaborative client and QGIS. Each final processed record has associated coordinates expressed in WGS84 decimal degrees and an uncertainty measure in metres. GEBCO_2022 global terrain model was used to georeference depth data correctly. 7. During the georeferencing process it was possible to remove other data occurred outside study boundaries. We then excluded records with >5000 m of uncertainty radius. 8. As raw temporal data from NHCs arrived in various formats, this information was handled with the R package lubridate and converted to ISO 8601 format. The temporal information collected spans various degrees of resolution, from the exact date to time ranges between years. We decided to mantain also records without temporal information. No hourly data were collected.

追加のメタデータ