RADx-DHT Metadata Mapping
This page explains how to take pre-existing RADx-DHT metadata and convert it into RADx Metadata.
id/doi
{
"id": "https://doi.org/10.57895/me7r-vp06"
}
The RADx-DHT id
/doi
field represents the same information formatted in different ways. We chose to map the value of the id
field to the RADx Metadata Specification Identifier field in the Data File Identity element.
RADx Metadata Field | Value |
---|---|
Identifier | Value of $.id prefixed with "https://doi.org/" .For example, "https://doi.org/10.57895/ME7R-VP06" . |
Identifier Type | "http://vocab.fairdatacollective.org/gdmt/DOI" |
url
{
"url": "https://rapids.ll.mit.edu/10.57895/me7r-vp06"
}
The value of the RADx-DHT url
field should be encoded in a Data File Related Resources element.
RADx Metadata Field | Value |
---|---|
Related Resource Identifier | Value of $.url . |
Related Resource Identifier Type | "http://vocab.fairdatacollective.org/gdmt/URL" |
types
{
"types": {
"ris": "DATA",
"bibtex": "misc",
"citeproc": "dataset",
"schemaOrg": "Dataset",
"resourceType": "Survey",
"resourceTypeGeneral": "Dataset"
}
}
There is no direct equivalent of this field. Types in the RADx-DHT may be recorded in the Auxiliary Metadata element.
creators
The value of the compound creators
field should be encoded in a Data File Creators element. The fields inside this element directly parallel the fields used in the RADx-DHT specification.
Multiple Data File Creators elements may be specified. Use one element per creator.
The JSON below shows a RADx-DHT example of a creators
JSON object.
{
"creators": [
{
"name": "Marcus, Gregory",
"nameType": "Personal",
"givenName": "Gregory",
"familyName": "Marcus",
"affiliation": [
{
"name": "University of California, San Francisco",
"schemeUri": "https://ror.org",
"affiliationIdentifier": "https://ror.org/043mz5j54",
"affiliationIdentifierScheme": "ROR"
}
],
"nameIdentifiers": [
{
"schemeUri": "https://orcid.org",
"nameIdentifier": "https://orcid.org/0000-0001-5197-7696",
"nameIdentifierScheme": "ORCID"
}
]
}
]
}
Fields in a RADx-DHT creators
object should be mapped as follows. Note that,
- the RADx metadata specification only supports one creator affiliation per Data File Creators element.
- the RADx metadata specification only supports one creator identifier per Data File Creators.
Set the value of the Creator Identifier Scheme field to one of the allowable options. If an ORCID has been used to identify the creator then https://orcid.org
as the value for the Creator Identifier Scheme field.
RADx Metadata Field | Value |
---|---|
Creator Name | creators[0].name |
Creator Type | Use http://vocab.fairdatacollective.org/gdmt/Person for Personal , Use http://vocab.fairdatacollective.org/gdmt/Organization for Organizational . |
Creator Given Name | creators[0].givenName |
Creator Family Name | creators[0].familyName |
Creator Affiliation | creators[0].affiliation[0].name |
Creator Affiliation Identifier | creators[0].affiliation[0].affiliationIdentifier |
Creator Affiliation Identifier Scheme | creators[0].affiliation[0].schemeUri |
Creator Identifier Scheme | creators[0].nameIdentifiers[0].schemeUri |
Creator Identifier | creators[0].nameIdentifiers[0].nameIdentifier |
titles
A RADx-DHT titles
JSON object should be encoded as a Data File Titles element. Multiple Data File Titles elements may be specified. Use one element per title.
The JSON below shows a RADx-DHT example of a titles
JSON object.
{
"titles" : [
{
"lang": "en",
"title": "Covidseeker and COVID-19 Citizen Science",
"titleType": null
}
]
}
RADx-DHT object fields should be mapped as follows:
RADx Metadata Field | Value |
---|---|
Title | titles.title |
Language | titles.lang |
The titles.titleType
is not supported in the RADx Metadata. If this needs to be recorded then the Auxiliary Metadata element should be used.
publisher
The RADx-DHT publisher
field should be encoded using fields from the Data File Distributions element.
{
"publisher": "RAPIDS – Rapid AI Platform for Innovating Data Science "
}
RADx Metadata Field | Value |
---|---|
Distribution Publisher | "MIT Lincoln Laboratory" |
Distribution Publisher Identifier | We suggest that this is set to "https://ror.org/022z6jk58" , which identifies MIT Lincoln Laboratory. |
Distribution Publisher Identifier Scheme | "https://ror.org" |
Distribution Identifier | The DOI for the data file |
Distribution Identifier Type | "http://vocab.fairdatacollective.org/gdmt/DOI" |
subjects
The RADx Metadata Specification uses the Medical Subject Headings Thesaurus (MeSH) for specifying the subjects for a data file. We possible subjects should be translated into MeSH terms. See the Data File Subjects element for documentation on encoding subjects.
{
"subjects": [
{
"subject": "FOS: Medical and health sciences",
"valueUri": "http://www.oecd.org/science/inno/38235147.pdf",
"schemeUri": "http://www.oecd.org/science/inno",
"subjectScheme": "Fields of Science and Technology (FOS)"
}
]
}
For subjects that cannot be mapped into MeSH, the keywords free text field should be used.
contributors
The value of the compound contributors
field should be encoded in a Data File Contributors element. The fields inside this element directly parallel the fields used in the RADx-DHT specification.
Multiple Data File Contributors elements may be specified. Use one element per contributor.
The JSON below shows a RADx-DHT example of a contributors
JSON object.
"contributors": [
{
"name": "MIT Lincoln Laboratory",
"nameType": "Organizational",
"givenName": null,
"familyName": null,
"affiliation": [],
"contributorType": "DataCurator",
"nameIdentifiers": [
{
"schemeUri": "https://ror.org",
"nameIdentifier": "https://ror.org/022z6jk58",
"nameIdentifierScheme": "ROR"
}
]
}
]
Fields in a RADx-DHT contributors
object should be mapped as follows. Note that,
- the RADx metadata specification only supports one contributor affiliation per Data File Contributor element.
- the RADx metadata specification only supports one contributor identifier per Data File Contributor element.
If the contributor is a person:
RADx Metadata Field | Value |
---|---|
Contributor Name | contributors[0].name |
Contributor Type | "http://vocab.fairdatacollective.org/gdmt/Person" |
Contributor Given Name | contributors[0].givenName |
Contributor Family Name | contributors[0].familyName |
Contributor Affiliation | contributors[0].affiliation[0].name |
Contributor Affiliation Identifier Scheme | contributors[0].affiliation[0].schemeUri |
Contributor Affiliation Identifier | contributors[0].affiliation[0].affiliationIdentifier |
Contributor Identifier | contributors[0].nameIdentifiers[0].nameIdentifier |
Contributor Identifier Scheme | contributors[0].nameIdentifiers[0].schemeUri If an ORCID has been used to identify the contributor then the value should be "https://orcid.org" . |
If the contributor is an organization:
RADx Metadata Field | Value |
---|---|
Contributor Name | contributors[0].name |
Contributor Type | "http://vocab.fairdatacollective.org/gdmt/Organization" |
Contributor Identifier | contributors[0].nameIdentifiers[0].nameIdentifier |
Contributor Identifier Scheme | contributors[0].nameIdentifiers[0].schemeUri If a Research Organization Registry (ROR) Id has been used to identify the contributing organization then the value should be "https://ror.org" |
dates
The RADx-DHT dates
field should be encoded in a Data File Dates element, except for publication date, which should be encoded in the Data File Publication Date element. There must be exactly one date per element. Dates represent events related to the data file itself. Dates of events related to the funded program should be encoded elsewhere.
publicationYear
The RADx-DHT publicationYear
field should be encoded in a Data File Publication Date field inside a Data File Publication Date. This field can accept precise publication dates, which are preferred over "publication year", however if publication year is the only available metadata then please enter the publication year in yyyy format.
language
The RADx-DHT language
field should be encoded in the Primary Language field contained in a Data File Language element. The encoding of languages uses standard language codes. These codes are directly mappable from the RADx-DHT specification.
identifiers
{
"identifiers": []
}
Please consult with the Stanford Team. These identifiers may be enocded as Auxiliary Metadata.
sizes
The RADx-DHT sizes
field should be encoded using fields from the Data File Distributions element. Values for this field should be merged with the values from the publisher field and formats field. Since RADx metadata pertains to a single file only the size related array element should be encoded.
{
"sizes": [
"6 MB",
"33 Files"
]
}
RADx Metadata Field | Value |
---|---|
Distribution Size | The distribution size in bytes. To convert Megabytes to bytes multiply by 1,048,576. For example, 6MB is 6291456 bytes. |
Since the RADx Metadata Specification applies to single files the "33 Files" value (or any other value) that indicates the number of files is not applicable here.
formats
The RADx-DHT formats
field should be encoded using a field from the Distribution Format element. Values for this field should be merged with the values from the publisher field and sizes field.
{
"formats": [
"csv",
"html",
"pdf",
"xlsx"
]
}
RADx Metadata Field | Value |
---|---|
Distribution Format | A value from the formats list, for example, "csv" . |
version
Assuming that the RADx-DHT version
field describes a version of the data file, it should be encoded in the Version field in the Data File Identity element. If the version
field describes the version of something other than the data file then it should be encoded using the Auxiliary Metadata element.
{
"version": "1.0"
}
rightsList
The Data File Rights element should be used to encode rights for the datafile. We recommend that an identifier for the license is chosen and entered into the License Name field.
If a standardized license is not used or if a license has been customized then the rights/license should be entered into the License Text field.
descriptions
The RADx-DHT descriptions
field should be encoded using a Data File Descriptions element. One element should encode exactly one description.
{
"descriptions": [
{
"lang": "en",
"description": "This study integrated Covidseeker, a tool for real-tine geospatial temporal mobile data for digital contact tracing and COVID-19 hotspotting, into a customized version of the COVID-19 Citizen Science Study. ",
"descriptionType": "Abstract"
}
]
}
RADx Metadata Field | Value |
---|---|
Description | Value of descriptions[0].description |
Description Language | Value of descriptions[0].lang |
geoLocations
The RADx-DHT geoLocations
field should be encoded using the Data File Spatial Coverage element. This element makes it possible to encode locations using bounded boxes/shapes that are described with lat/lon coordinates. Multiple locations can be specified.
If place names are used to specify locations then the Geopolitical Region field that is contained with in a Data File Geopolitical Coverage element should be used to provide an identifier for a place.
funding references
The RADx-DHT funding references
field should be encoded using a Data File Funding Sources element.
The JSON below shows a RADx-DHT example of a funding element.
{
"fundingReferences": [
{
"awardUri": "https://reporter.nih.gov/project-details/10274151",
"awardTitle": "DIGITAL HEALTH SOLUTIONS FOR COVID-19: COVIDSEEKER AND COVID-19 CITIZEN SCIENCE",
"funderName": "National Institutes of Health",
"awardNumber": "75N91020C00039",
"funderIdentifier": "https://doi.org/10.13039/100000002",
"funderIdentifierType": "Crossref Funder ID"
}
]
}
RADx Metadata Field | Value |
---|---|
Award Title | Value of fundingReferences[0].awardTitle |
Award Page URL | Value of fundingReferences[0].awardUri |
Funder Name | Value of fundingReferences[0].funderName |
Award Local Identifier | fundingReferences[0].awardNumber |
Funder Identifier | fundingReferences[0].funderIdentifier . We recommend that a Research Organization Registry identifier is used, if possible. |
Funder Identifier Scheme | Choose one of the available values |
relatedIdentifiers
{
"relatedIdentifiers": []
}
This field may be encoded using the Data File Related Resources element. One element per related identifier should be used. For a given identifier, the Related Resource Identifier field should be used to encode the identifier and the other fields should be used to describe the type of identifier.
schemaVersion
{
"schemaVersion": "http://datacite.org/schema/kernel-4"
}
This field is not applicable to the RADx Metadata Specification. You may however encode it using the Auxiliary Metadata element.
providerId
{
"providerId": "mit"
}
This field is not directly supported in the RADx Metadata Specfification. You may encode it using the Auxiliary Metadata element.
clientId
{
"clientId": "mit.rapids"
}
This field is not directly supported in the RADx Metadata Specfification. You may encode it using the Auxiliary Metadata element.
agency
{
"agency": "datacite"
}
This field is not directly supported in the RADx Metadata Specfification. You may encode it using the Auxiliary Metadata element.
state
{
"state": "draft"
}
This field is not directly supported in the RADx Metadata Specfification. You may encode it using the Auxiliary Metadata element.