Popis polí data – eng

METADATA – fields

Tag Field description Mandatory Repeatable field
200$a Title of dataset mandatory N
541$a Title in English optional N
200$e Alternative title optional Y
700, 701 Author mandatory Y
608$a Data file type mandatory Y
101$a Language mandatory Y
C15$a Dataset files description mandatory N
C15$b Dataset  files description in English optional N
327$a Technical informations optional N
856$u Link optional Y
C76$u Link to datasets stored in external repository conditionally mandatory Y
C76$z Name of the repository conditionally mandatory N
U88$3 Link to bibliographic records in the ASEP repository conditionally mandatory Y
U01 Link to publications stored in external repository optional Y
C12a Funding conditionally mandatory Y
C26x Research area – FORD category mandatory Y
C57$a Institutional funding conditionally mandatory Y
610$a Keywords mandatory Y
617$d Localization – city optional Y
617$a Localization – country optional Y
U22$a,b Time period optional N
C81$b License CC conditionally mandatory N
C81$c Other open license conditionally mandatory N
C81$e Custom license/file conditionally mandatory N
C81$f License note optional N
C81$a Access to the dataset mandatory N
C81$d Embargo end date conditionally mandatory N
C60$a Handle assigned by the system N
001 System number assigned by the system N
Date issued assigned by the system N
300$a Note optional N
C26$g Administrator note optional N
Agreement with publication in the ASEP mandatory N
U03c Software version conditionally mandatory N
U03d Dataset change description conditionally mandatory N

200$a Title of dataset

The field is mandatory.

The field contains the name of the data file that adequately describes the stored data.

Example 1:

Morphologically Constrained and Data Informed Cell Segmentation of Budding Yeast

Example 2:

Noisy reverberant speech database for training speech enhancement algorithms and TTS models

Example 3:

Fitr: A Toolbox for Computational Psychiatry Research


200 $e – Alternative title

The field is optional and repeatable.

The field’s content is an additional file name that will enhance the information about the files listed in the main title, e.g., the title in another language.

Example 1:

A software package for fitting computational models to behavioural data.

Example 2:

A cross-national macro analysis of the influence of home broadband access.


541a – Title in English

The field is conditionally mandatory.

The field contains the translation of the name of the data file into English. The field has to be filled in if the text is in a language other than English.


700,701 – Author

The field is mandatory and repeatable.

The field contains the surnames and first names of the authors. In the data record, the author who is not from the CAS will be listed with his/her full name and persistent identifiers ORCID and ROR. For the correct entry of the author, we use the author selection from the register (the country on the right next to the author’s surname), only if the author is not listed in the register, we fill in the surname, first name and workplace/country of the author. All authors of the dataset are listed.

Example:

Author from CAS: Lhoták, Martin:KNAV-K


608$a – Data file type

This field is mandatory and repeatable. The field contains the type of data files to be selected from the list. Select the generic „dataset“ type if it cannot be specified.


101 $a – Language

This field is mandatory and repeatable. The field contains the code of the language in which the dataset documentation is written. It can be selected from a drop-down list, multiple languages can be specified.

An overview of the language codes is in the table.


C15 $a – Dataset files description

This field is mandatory.

This field contains a summary, annotation, and abstract, which shows what the files are related to and what files the dataset contains. A clear description is used to help users orient and understand, or to cite the stored datasets.

A description of the particular dataset and the structure of the directory should be added. If these are described in a separate documentation file in the dataset (e.g., documentation.txt, readme.txt, ctimne.txt), please mention this file.

Example:

This archive contains drivers and configurations for the PALM model validation study done with the observation campaign done in Prague-Dejvice in 2018 and described in the GMD paper https://gmd.copernicus.org/preprints/gmd-2020-175/. The input drivers follow the PALM input data standard which is described on the model website (https://palm.muk.uni-hannover.de/trac/wiki/doc/app/iofiles/pids). The simulations are configured with two nested domains, the files for the parent domain are without a suffix, and the child domain-related files have a suffix „_N02“. The static input files (*static*) contain all static information, such as topography, geographical coordinates, and surface and vegetation information. The dynamic input files (*dynamic*) contain information on the initial state of the atmosphere and on time-dependent boundary conditions. The chemistry input files (*emission*) contain information on temporally and spatially dependent emission of chemical species. The *p3d* files contain the configuration of the model.


C15 $b Dataset file descriptions in English

This field is conditionally mandatory.

The field contains the translation of the file description in English if the description is in Czech or another language.


327 $a Technical informations

This field contains information about the formats of the files to be saved and the options for running them. It is described in which program the files can be run, or whether something else is needed for access.

Example:
The dataset contains 3D models in .phy format. The files can be launched with Bioedit, DNASP, or MEGAZ.


C26x Research area – FORD Category

Scientific subject area according to the OECD subject group codebook – Frascati Manual 2015.
The field is repeatable and mandatory for all types of publications.

The content of the field is the code from the discipline codebook valid for CEP and RIV published in the OECD discipline codebook on the ISVaVaI website according to the OECD – Frascati Manual 2015.

List of OECD categories

OECD / RIV Subject categories Converter (PDF)

OECD / RIV Subject Area Converter (XLS)


856$u Link

This field is optional and repeatable.

The field contains links (URI/URL/DOI/HANDLE) to files that are related to the data.


C76$u  Link to datasets stored in the external repository

The field is conditionally mandatory and repeatable.

The field contains links (URL/DOI/HANDLE) to data that will not be stored in the ASEP data repository but are stored in some other repositories, e.g. a subject repository. Only a metadata record describing these data with a link to the data repository, where the datasets are stored will be stored in ASEP.


C76$z Name of the repository

The field is conditionally mandatory.
The content of the field is the name of the repository, where the data is stored.
Example: Zenodo


U01$u  Link to publications stored in the external repository

The field is optional.

The field contains links to publications stored in other databases, which were created based on the data in the dataset.

Example:

https://zenodo.org/record/375521#.WOzpvp6kKUl

The field is repeatable.

The contents of the field are links (URI/URL/DOI/HANDLE) to files that are related to the data.


U88$3 Link to bibliographic records in the ASEP repository

The field is conditionally mandatory.

The content of the field is linked to publications stored in ASEP that were created based on the data in the data file.

Example: https://hdl.handle.net/11104/0354828


617 $d – Localization – city

This field is optional and repeatable.

The field contains the countries and places (cities) to which the results in the data files are related.

Example: Prague


617 $a –Localization – country

The field is optional and repeatable.

The field contains the countries to which the results in the datasets are related.

A summary of the country codes is provided in the table.


U22 $a – Time period

The field is optional.

The content of the field is the period to which the data is related. The date is selected from the calendar.


610 $a – Keywords

The field is mandatory and repeatable.

The content of the field is a keyword in English (or a multi-word keyphrase) describing as precisely as possible the factual content of the stored data.
It shall be entered as plain English text in lower case unless the spelling requires the use of capital letters (e.g. proper names).
Any number of words may be included.
Each keyword shall be entered in a separate field.

Example:

nanoribbons, graphene, ultrafast photoconductivity, plasmon, terahert

 


License

The content of the field is a license setup that provides users with information about how they can use the files.

C81$b Creative Commons License (CC)

CC licenses can be used for open access data. If data files are not in the public domain, they cannot be released under a Creative Commons license. Here is the CC License Chooser application which can advise you on choosing a CC license.

In ASEP it is possible to select from the following Creative Commons version 4.0 licenses in the drop-down box.

In comparison to the previous versions, Creative Commons 4.0 is no longer adapted to national legislation so there is only a single version of the licenses and the translation should be purely linguistic.

 

Attribution BY
Attribution-NoDerivatives BY-ND
Attribution-Sharealike BY-SA
Attribution-NonCommercial BY-NC
Attribution-NonCommercial-NoDerivatives BY-NC-ND
Attribution-NonCommercial-Sharealike BY-NC-SA

C81$c Other open license

If there is a proprietary license, the agreement must be accompanied by the text of such license, which sets the rules for the use of the file.

Examples (see SPDX, Open Data Commons list of licenses),


C81$e Custom license/file

The field is checked if the data is published under a license other than Creative Commons. The license text file must be saved in pdf format together with the data files. The file must be named licence.pdf.


C81$f License note

The contents of the field are additional information about the license. The field is optional.


C81$a Access to the dataset

The field is mandatory. The field contains the access to the dataset.

On request – the file is available on request. The file cannot be on request if it is published under a Creative Commons license.

Open acces for an institute – The file is available from the IP address of the institute.

Open access with embargo – during the embargo period the file is on demand, after the embargo expires it is immediately downloadable (Open Access).

Open acces means that the file is immediately available for download (Open Access).

 


C81d Embargo end date

The field is conditionally mandatory.
The content of the field is the date that is selected from the calendar. The field shall be filled in if the dataset will be publicly accessible to the user after a specified period. The field should be filled with the end date of the embargo, after which the dataset will be automatically available.


C$60a Handle

The field contains the handle persistent identifier. It is automatically assigned by the system. The link to the handle is functional the next day.

Example:

http://hdl.handle.net/11104/0324459


C12$a  Funding agency

The field contains the project from which the dataset was funded. The beneficiary or co-beneficiary of the project must be the author/institution of the CAS. For European projects, the information will be harvested into OpenAIRE.

CEP projects

Funding is provided by domestic agencies from the budget and recorded by the Research and Development Council. The project numbers and providers are listed in the CEP database: https://www.isvavai.cz/cep

EU projects
The dataset was created with the funding support of European Union projects (e.g. FP7, H2020)
The project numbers and the EU providers are listed in the CORDIS database – http://cordis.europa.eu/projects/home_en.html
$a Project number
$o EU provider
$e Country = XE


C57a Institutional funding

The field is conditionally mandatory and repeatable.
The field contains the institutional funding under which the dataset was created. The field is repeatable, in case of collaboration, it has to be selected for each institute separately. It is selected from a predefined list.


300$a – Note

The field is optional.
The field contains additional data to the record that cannot be put into predefined metadata fields.


C26g Administrator note

The field is optional.
The field contains information for the processor responsible for processing records in the institute from the person who created the record (e.g. a researcher).


Agreement with publication in the ASEP online catalogue

The field is mandatory.
By checking „agreement“ the author submits the data record(s) to the ASEP administrator for formal checking. Once saved, the record can no longer be accessed. If the data are correct, the processor will publish the record in the online catalogue.
Link to the text of the agreement.

U03c Dataset change description

This field is conditionally mandatory if a new version of the dataset is being created. The content of the field is a description of the changes in the stored files.

Example:

The dataset.csv file has been added

U03d Software version

The content of the field is the serial number of the dataset. The field contains the software version number.

Example:

Kramerius TEI converter 1.0