OpenCitations

Download

This page contains all the dumps of the OpenCitations Corpus (OCC) that are created regularly every month and are made available online by means of the support of Figshare. Each dump is composed by several zip archives, each containing either data or provenance information of a particular sub-dataset of the OCC. After unzipping an archive, it is needed to use Disk ARchive (DAR) for recreating the whole structure - which is a multi-platform archive tool for managing huge amount of data.

Earliest dump

Dump created on December 24, 2016. It includes:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

Dump: November, 2016

November 2016 dump not submitted for technical reasons.

Dump: October 24, 2016

Dump created on October 24, 2016. It includes:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance

Dump: September 24, 2016

Dump created on September 24, 2016. It includes:

TypeArchive
agent roles (ar)data, provenance
bibliographic entries (be)data, provenance
bibliographic resources (br)data, provenance
identifiers (id)data, provenance
responsible agents (ra)data, provenance
resource embodiment (re)data, provenance
corpustriplestore, provenance