OpenCitations

Download

This page contains details of and links to all the data dumps of the OpenCitations Corpus (OCC), which are created regularly every month, and are made available online by means of the support of Figshare.

Each dump is composed by several zip archives, each containing either data or provenance information relating to a particular sub-dataset within the OCC.

After unzipping an archive, one needs to use Disk ARchive (DAR) - a multi-platform archive tool for managing huge amount of data - to recreate the whole OCC structure.

Most recent OCC data dump - May 2017 OCC Dump

May 2017 OCC Dump

Dump created on May 25, 2017. This dump includes information on:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

April 2017 OCC Dump

Dump created on April 26, 2017. This dump includes information on:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

March 2017 OCC Dump

Dump not submitted for technical reasons.

February 2017 OCC Dump

Dump not submitted for technical reasons.

January 2017 OCC Dump

Dump not submitted for technical reasons.

December 2016 OCC Dump

Dump created on December 24, 2016. This dump includes information on:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

November 2016 OCC Dump

Dump not submitted for technical reasons.

October 2016 OCC Dump

Dump created on October 24, 2016. This dump includes information on:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

September 2016 OCC Dump

Dump created on September 24, 2016. This dump includes information on:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance