Dataverse provides provenance facilities on dataset level and file level. A Dataverse installation accepts provenance information in two forms: a Provenance File or a free-text Provenance Description. Specification of the service can be found at: https://guides.dataverse.org/en/latest/user/dataset-management.html#dataset-versions Dataverse Support for Dataset Versions This annotation schema is used to annotate data as part of the PINK project. It is based on DCAT for basic Accessibility and enhanced with domain and application specific terms. PINK Annotation Schema RO-Crate is a community effort to establish a lightweight approach to packaging research data with their metadata. It is based on schema.org annotations in JSON-LD. An RO-Crate is a structured archive of all the items that contributed to the research outcome, including their identifiers, provenance, relations and annotations. RO-Crate | Research Object Crate ToxTemp is a web-accessible template and database designed for documenting in vitro test methods while integrating different readiness levels and a structured review workflow. It was developed initially under the EU-ToxRisk and developed further under RISK-HUNT3R (both EC-funded Horizon 2020 projects). The test method template (ToxTemp) provides a standardized framework for documenting test methods and assessing their readiness status, guided by a structured questionnaire. By offering a user-friendly approach, ToxTemp helps test developers, especially those less familiar with regulatory requirements to describe their assays in sufficient detail. It also facilitates the comparison of different testing methods, including those targeting the same endpoint. Widespread adoption of ToxTemp could not only support test developers by providing clear documentation guidance but also drive a cultural shift toward more consistent and transparent method documentation. ToxTemp: A Standardized Framework and Database for Documenting and Assessing Readiness of In Vitro Test Methods The OECD Omics Reporting Framework (OORF) was developed to increase transparency in the reporting of Omics data used in regulatory toxicology, and aid regulatory uptake while remaining sufficiently flexible to allow technologies to evolve. It is a harmonized combination of the Transcriptomics Reporting Framework (TRF) and a Metabolomics Reporting Framework (MRF), with four types of reporting modules: (i) The Study Summary Reporting Module (SSRM) provides a high-level overview of the regulatory toxicology and Omics experiment, (ii) The Toxicology Experiment Reporting Module (TERM) reports the key descriptors of the in vivo or in vitro toxicology study, (iii) Data Acquisition and Processing Reporting Modules (DAPRM) report descriptions of the Omics assays, data acquisition and associated data, (iv) Data Analysis Reporting Modules (DARM) describe the statistical analysis that has been undertaken in the Omics study. Guidance documents describe each of the reporting modules in detail alongside associated templates are available via the website. OECD Omics Reporting Framework | OORF NetCDF compliant with Climate and Forecasts (CF) Metadata Convention v1.10 NetCDF CF-1.10 2.3.0 PAV is a lightweight ontology for tracking Provenance, Authoring and Versioning. PAV specializes the W3C provenance ontology PROV-O in order to describe authorship, curation and digital creation of online resources. This ontology describes the defined PAV properties and their usage. Note that PAV does not define any explicit classes or domain/ranges, as every property is meant to be used directly on the described online resource. Provenance, Authoring and Versioning Ontology | PAV NetCDF compliant with Climate and Forecasts (CF) Metadata Convention v1.6 NetCDF CF-1.6 1.6.0 The FDSN StationXML schema defines an XML representation of the most important and commonly-used structures of SEED 2.4 metadata with enhancements. This format is intended primarily for the archival and exchange of seismological time series data and related metadata. The goal is to allow mapping between SEED 2.4 dataless SEED volumes and this schema with as little transformation or loss of information as possible, while at the same time simplifying station metadata representation when possible. Also, content and clarification has been added where lacking in the SEED standard. When definitions and usage are under-defined the SEED manual should be referred to for clarification. International Federation of Digital Seismograph Networks StationXML | FDSN StationXML Metadata schema used for measurements or modelling of air pollutant concentrations. EEA assessment methods metadata schema for air quality Basic annotation schema used in Tripper for handling data and metadata in a knowledge base. Based on DCAT-AP, so compatible with all DCAT-aware services. Tripper Annotation Schema Bioschemas aims to improve data interoperability in the life sciences. It does this by encouraging the use of Schema.org markup within web pages, so that their websites and services contain machine-processable metadata that is consistently structured. This structured information makes it easier to discover, collate and analyse distributed data. As well as extending Schema.org with life sciences specific types, the Bioschemas community has developed a group of specifications that provide guidelines to facilitate a more consistent adoption of Schema.org markup within the life sciences. Bioschemas Deploys This resource describes the flowchart developed within the ITINERIS Project by the DiSSCo community of the Institute of Marine Sciences (CNR-ISMAR) for the management of physical and digital samples. Natural Physical and Digital Specimens Flowchart PDBx/mmCIF is a dictionary of data archiving macromolecule crystallographic experiments and their results. macromolecular Crystallographic Information File | PDBx/mmCIF NetCDF compliant with Climate and Forecasts (CF) Metadata Convention v1.11 NetCDF CF-1.11 NetCDF compliant with Climate and Forecasts (CF) Metadata Convention v1.7 NetCDF CF-1.7 The specification defines the architecture, metadata structure and schema, API and validation rules for a FAIR Data Point. FDPspec | FAIR Data Point specification FAIR-EuMon metadata schema was built to FAIRly describe European biodiversity monitoring schemes across marine, terrestrial and freshwater ecosystems FAIR-EuMon metadata schema for biodiversity monitoring FAIR-EuMon metadata schema was built to FAIRly describe European biodiversity monitoring schemes across marine, terrestrial and freshwater ecosystems FAIR-EuMon metadata schema for biodiversity monitoring DDi Lifecycle is a metadata standard. DDI Lifecycle 3.1 ODM2 is an information model and supporting software ecosystem for feature-based earth observations, designed to facilitate interoperability across scientific disciplines and domain cyberinfrastructures. Observations Data Model 2 | ODM2 DDI-Lifecycle is designed to document and manage data across the entire life cycle, from conceptualization to data publication, analysis and beyond. It encompasses all of the DDI-Codebook specification and extends it. Based on XML Schemas, DDI-Lifecycle is modular and extensible. This version also supports improvements in Classification management (based on GSIM / Neuchatel), non-survey data collection (Measurements), sampling, weighting, questionnaire Design and support for DDI as a Property Graph. DDI Lifecycle 3.3 Metadata Schema Schema.org is a collaborative, community activity with a mission to create, maintain, and promote schemas for structured data on the Internet, on web pages, in email messages, and beyond. Schema.org Schema.org is a collaborative, community activity with a mission to create, maintain, and promote schemas for structured data on the Internet, on web pages, in email messages, and beyond. Schema.org The PREMIS Data Dictionary for Preservation Metadata is the international standard for metadata to support the preservation of digital objects and ensure their long-term usability. Developed by an international team of experts, PREMIS is implemented in digital preservation projects around the world, and support for PREMIS is incorporated into a number of commercial and open-source digital preservation tools and systems. The PREMIS Editorial Committee coordinates revisions and implementation of the standard, which consists of the Data Dictionary, an XML schema, and supporting documentation. PREservation Metadata Implementation Strategies (PREMIS) v3.0 Darwin Core (DwC) is a vocabulary that includes a glossary of terms (in other contexts these might be called properties, elements, fields, columns, attributes, or concepts) intended to facilitate the sharing of information about biological diversity by providing identifiers, labels, and definitions. The Darwin Core is primarily based on the observation, specimen and samples of taxa. The Darwin Core standard is comprised of one vocabulary (the Darwin Core vocabulary), and six associated documents. The vocabulary itself is composed of four term lists: Darwin Core terms borrowed from the Dublin Core legacy and terms namespace, Darwin Core IRI-value Term Analogs, and the core terms defined by Darwin Core. DwC | Darwin Core EML 2.2.0 introduces support for semantic annotations. EML >=2.2.0 | Ecological Metadata Language >=2.2.0 MOD is conceived as an OWL ontology and application profile to capture metadata information for ontologies, vocabularies or semantic resources/artefacts in general. MOD 2.0 is designed as a profile of DCAT 2. MOD | Metadata for Ontology Description and publication Ver 2.0 DDI-CDI is a model-driven, domain- and technology-neutral metadata specification designed to facilitate the combination of data from diverse sources and across disciplines. DDI CDI | Cross-Domain Integration Data Documentation Initiative (DDI) Lifecycle (DDI-Lifecycle, DDI-L) is designed to document and manage data across the entire life cycle, from conceptualization to data publication, analysis and beyond. The freely available international DDI standards describe data that result from observational methods in the social, behavioral, economic, and health sciences. DDI is used to document data in over 80 countries of the world. It encompasses all of the DDI-Codebook specification and extends it. Based on XML Schemas, DDI-Lifecycle is modular and extensible. DDI-Lifecycle 3.3 DDI-Codebook is a more light-weight version of the standard, intended primarily to document simple survey data. Originally DTD-based, DDI-C is also available as an XML Schema. DDI-Codebook|Data Documentation Initiative - Codebook Blockchain technology is a decentralized, distributed ledger that stores the record of ownership of digital assets Blockchain Metadata schema of ambient air used to collect data within Stockholm Convention Global Monitoring Plan Data Warehouse. SCGMPDWH_AIR | Stockholm Convention Global Monitoring Plan Data Warehouse Ambient Air The Minimum Information about a Digital Specimen (MIDS) guidelines, currently in development, specifies the information elements expected to be present when providing access to specimens within a digital framework They are intended to provide a harmonizing framework captured as a TDWG standard that can help clarify levels (depth) of digitization and the minimum information captured and published at each level. This would help to ensure that enough data are captured, curated and published against specific requirements so they are useful for the widest range of possible purposes; as well as making it easier to consistently measure the extent of digitization achieved over time and to set priorities for remaining work. Such a framework would also be beneficial for 'born digital' specimens where digital data is captured from the outset, beginning with the gathering event. MIDS | Minimum Information Standard for Digital Specimens The Dublin Metadata Element Set, which is often called Dublin Core (DC), is a standardized metadata scheme for description of any kind of resource such as documents in electronic and non-electronic form, digital materials (such as video, sound, images, etc) and composite media like web pages. Dublin Core Metadata may be used for multiple purposes, from simple resource description, to combining metadata vocabularies of different metadata standards, to providing interoperability for metadata vocabularies in the Linked Data cloud and Semantic Web implementations. Please note that this version of the specification for the Dublin Core Element Set 1.1 is somewhat out of date, although it is not officially deprecated. The DCMI Metadata Terms specification is linked to this record and is the current documentation that should be used for the Dublin Core Element Set 1.1. DC | Dublin Core This standard allows for a metadata representation of research entities, their activities and interconnections, along with their output. This is captured in highly flexible formal (semantic) relationships, and enables quality maintenance, archiving, access and interchange of research information, so supporting knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors and the general public. CERIF | Common European Research Information Format The WMO Core Profile of the ISO 19115: Geographic Information - Metadata standard is used by the WMO Information System (WIS) to create a catalogue of all information that is made available through the WIS. The current version, 1.3, was approved by Executive Council in May 2013. This profile provides a general definition for directory searches and exchange that should be applicable to a wide variety of WMO data sets. WMO Core Profile | World Meteorological Organization Core Metadata Profile The DataCite Metadata Schema is a list of core metadata properties chosen for accurate and consistent identification of a resource for citation and retrieval purposes, with recommended use instructions in the documentation. The resource that is being identified can be of any kind, but it is typically a dataset. We use the term ‘dataset’ in its broadest sense. We mean it to include not only numerical data, but any other research objects in keeping with DataCite’s mission (https://datacite.org/value.html). The metadata schema properties are presented and described in detail in the section DataCite Metadata Properties in this document. DataCite Metadata Scheme Directory Interchange Format (DIF) is a descriptive and standardized format for exchanging information about geospatial data sets. It includes elements focusing on instruments that capture data, temporal and spatial characteristics of the data, and projects with which the dataset is associated. From its inception in the late 1980's as a way to document and exchange information on scientific data to its implementation in NASA's Global Change Master Directory (GCMD), the DIF has evolved to serve the user community in the discovery, access and use of Earth science and related data. GCMD metadata records are now maintained in the Common Metadata Repository (CMR), and the DIF is one of several supported formats for submitting metadata to CMR. The DIF, the ECHO metadata model, and the ISO 19115 standards for science metadata formed the basis for NASA's Unified Metadata Model (UMM) used by CMR. DIF | Directory Interchange Format The Ecological Metadata Language (EML) metadata standard was originally developed for the earth, environmental and ecological sciences. It is based on prior work done by the Ecological Society of America and associated efforts. It has been developed to document any research data, and as such can be used outside of these original subject areas. EML is implemented as a series of XML document types that can by used in a modular and extensible manner to document ecological data. Each EML module is designed to describe one logical part of the total metadata that should be included with any ecological dataset. EML | Ecological Metadata Language OpenDS is a specification of Digital Specimen and other related object type definitions essential to mass digitization of natural science collections and their digital use in a new generation of infrastructure and applications. For the principal digital object types corresponding to major categories and specimen data. openDS defines the structure and content of each object type, and the operations that can act upon them. openDS | Open Digital Specimen The PROV Ontology (PROV-O) expresses the PROV Data Model using the OWL2 Web Ontology Language (OWL2). It is intended for the Linked Data and Semantic Web community. It provides a set of classes, properties, and restrictions that can be used to represent and interchange provenance information generated in different systems and under different contexts. It can also be specialized to create new classes and properties to model provenance information for different applications and domains. PROV-O is one serialization of PROV-DM, the other two being PROV-N and PROV-XML. PROV-DM and PROV-O define how to represent provenance on the World Wide Web, and as such additional documentation has been included in this record for PROV-AQ (Access and Query), a note which describes how standard web protocols may be used to locate, retrieve and query provenance records. PROV-DC provides a mapping from Dublin Core to PROV-O, and is listed in this record. For the purpose of this specification, provenance is defined as a record that describes the people, institutions, entities, and activities involved in producing, influencing, or delivering a piece of data or a thing. In particular, the provenance of information is crucial in deciding whether information is to be trusted, how it should be integrated with other diverse information sources, and how to give credit to its originators when reusing it. In an open and inclusive environment such as the Web, where users find information that is often contradictory or questionable, provenance can help those users to make trust judgements. PROV-O | W3C PROV Ontology The Sensor Model Language (SensorML) aims to provide a robust and semantically-tied means of defining processes and processing components associated with the measurement and post-measurement transformation of observations. This includes sensors and actuators as well as computational processes applied pre- and post- measurement. The main objective is to enable interoperability, first at the syntactic level and later at the semantic level (by using ontologies and semantic mediation), so that sensors and processes can be better understood by machines, utilized automatically in complex workflows, and easily shared between intelligent sensor web nodes. This standard is one of several implementation standards produced under OGC's Sensor Web Enablement (SWE) activity. OGC SensorML | Open Geospatial Consortium Sensor Model Language ISO 19139 defines XML based encoding rules for conceptual schemas specifying types that describe geographic resources. The encoding rules support the UML profile as used in the UML models commonly used in the standards developed by ISO/TC 211. The encoding rules use XML schema for the output data structure schema. The encoding rules described in this document are not applicable for encoding UML application schema for geographic features (see ISO 19136 for those rules). ISO 19139 | Geographic information - Metadata - XML schema implementation ISO 19115 defines the schema required for describing geographic information and services by means of metadata. It provides information about the identification, the extent, the quality, the spatial and temporal aspects, the content, the spatial reference, the portrayal, distribution, and other properties of digital geographic data and services. ISO 19115 | Geographic information - Metadata Crossref is a registration agency of the International DOI Foundation. Crossref provides a mechanism for identifying and describing research objects (books and chapters, components, conference proceedings, datasets, dissertations, grants, journals and articles, peer reviews, pending publications, posted content (includes preprints), reports and working papers, and standards). It follows the ISO/IEC 11179 Metadata Registry (MDR) standard, which specifies a schema for recording both the meaning and technical structure of the data for unambiguous usage by humans and computers. CrossRef uses a single deposit schema stored as XML, which supports a range of different content types and provides a structure and set of rules to keep everything consistent and interoperable. Crossref (DOI) DDI-Lifecycle is designed to document and manage data across the entire life cycle, from conceptualization to data publication, analysis and beyond. It encompasses all of the DDI-Codebook specification and extends it. Based on XML Schemas, DDI-Lifecycle is modular and extensible. This version also supports improvements in Classification management (based on GSIM / Neuchatel), non-survey data collection (Measurements), sampling, weighting, questionnaire Design and support for DDI as a Property Graph. DDI Lifecycle 3.2 Metadata Schema This is the strategy SALURBAL is developing to implement FAIR principles into the data portal SALURBAL FAIR strategy The American Standard Code for Information Interchange (ASCII) is a standard table of seven-bit designations for digital representation of uppercase and lowercase Roman letters, numbers and special control characters in teletype, computer and word processor systems. American Standard Code for Information Interchange Crystallographic Information File or Framework CIF Test Metadata schema contained in a catalog Schema.org Dataset The theme scope includes two main aspects; the first is the environmental monitoring facility as a spatial object, the second is the data obtained through observations and measurements taken at this facility, encoded using the ISO 19156 standard. INSPIRE EMF | Infrastructure for Spatial Information in the European Community Environmental Monitoring Facilities LifeWatch core ontology under development for LifeWatch ERIC LW_CO | LifeWatch core ontology The ontology that describes the ICOS metadata and data. ICOS Ontology | Integrated Carbon Observation System Ontology The GBIF Metadata Profile is primarily based on the Ecological Metadata Language (EML)6 . The GBIF profile utilises a subset of EML and extends it to include additional requirements that are not accommodated in the EML specification. EML GBIF Profile Metadata | Ecological Metadata Language Global Biodiversity Information Facility Profile Metadata DwC-A is a biodiversity informatics data standard that makes use of the Darwin Core terms to produce a single, self contained dataset for sharing species-level (taxonomic), species-occurrence data, and sampling-event data. An archive is a set of text files, in standard comma- or tab-delimited format, with a simple descriptor file (called meta.xml) to inform others how the files are organized. DwC-A | Darwin Core Archive DCAT is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This document defines the schema and provides examples for its use. DCAT | Data Catalog Vocabulary Version 3 DCAT-AP is a specification based on the Data Catalogue Vocabulary (DCAT) developed by W3C and provides a common specification for describing public sector datasets in Europe to enable the exchange of descriptions of datasets among data portals. DCAT-AP | Data Catalog Vocabulary Application Profile for Data Portals in Europe CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers hundreds of data portals worldwide. CKAN | Comprehensive Knowledge Archive Network Darwin Core schema Darwin Core JSON schema for record metadata of the Materials Cloud Archive MC-1.0.0|Materials Cloud Archive Record Schema v1.0.0 Extensive provenance graph of computational workflows executed via AiiDA, including inputs, outputs and metadata of all calculations. AiiDA archive format|Automated Interactive Infrastructure and Database for Computational Science archive format Ontology of AiiDA provenance graphs AiiDA Ontology|Automated Interactive Infrastructure and Database for Computational Science Ontology Descriptive documentation of the content, meaning, provenance, and access for a single data set or series Data Documentation Initiative - Data Documentation 3.3 this is a 2nd publication that can be used for testing, created Jan 4 2022 test publication 2 Ecological Metadata Language EML GBIF Profile JSON schema for record metadata of the Materials Cloud Archive Materials Cloud Archive Record Schema v1.0.0 Extensive provenance graph of computational workflows executed via AiiDA, including inputs, outputs and metadata of all calculations. AiiDA archive format Specification defining the information elements expected to be present when publishing digitized information about specimens at various levels of digitization. Minimum Information Standard for Digital Specimens The PROV standard defines a data model, serializations, and definitions to support the interchange of provenance information on the Web PROV