These files contain a snapshot of all
public data in the ORCID Registry associated with an ORCID record that
was created or claimed by an individual as of October 1st, 2020. ORCID
publishes this file once per year under a Creative Commons CC0 1.0
Universal public domain dedication. This means that, to the extent
possible under law, ORCID has waived all copyright and related or
neighbouring rights to the Public Data File. For more information on the
file, see https://orcid.org/content/orcid-public-data-file-use-policy
The
file contains the public information associated with each user's ORCID
record. The data is available in XML format and is further divided into
separate files for easier management. One file contains the full record
summary for each record. The rest of the data is divided into 11 files
which contain the activities for each record including full work data.
Below is more complete description of how the data is structured.
Summaries file
Name: ORCID_2020_10_summaries.tar.gz
Description:
Contains all the existing summaries, when extracted, it will generate
the following file structure: summaries/[3 digits checksum]/[iD].xml
Example:
If you are looking for the summary of iD '0000-0002-7869-831X',
decompress the file and you will find the summary under
'summaries/31X/0000-0002-7869-831X.xml'.
Activities files
Named:
- ORCID_2020_10_activites_0.tar.gz
- ORCID_2020_10_activites_1.tar.gz
- ORCID_2020_10_activites_2.tar.gz
- ORCID_2020_10_activites_3.tar.gz
- ORCID_2020_10_activites_4.tar.gz
- ORCID_2020_10_activites_5.tar.gz
- ORCID_2020_10_activites_6.tar.gz
- ORCID_2020_10_activites_7.tar.gz
- ORCID_2020_10_activites_8.tar.gz
- ORCID_2020_10_activites_9.tar.gz
- ORCID_2020_10_activites_X.tar.gz
Description:
Consists of 11 .tar.gz files, each file contains the public activities
that belongs to an iD that contains a given checksum. The file hierarchy
is as follows: