Download

This page contains details of and links to all the data dumps of the OpenCitations Meta and OpenCitations Index. They are made available online by means of the support of Figshare and of the Internet Archive.

OpenCitations Meta

The OpenCitations Meta database stores and delivers bibliographic metadata for all publications involved in the OpenCitations Index.

Most recent OpenCitations Meta data dump - February 2025 Dump

This dataset's dump, released on 2025-02-13, enhances its previous version by incorporating new data from the Crossref dump available at Crossref November 2024 Dump, as well as the November 2024 dump of JaLC (Japan Link Center). This dump includes information on:

Type and formatArchiveSize
Metadata (CSV)tar12G (48G zipped) on ext4
Metadata and provenance (RDF)tar.gz47G (145G compressed) on ext4

In addition:

Type and formatArchiveSize
A CSV dump storing a mapping between all OMIDs and their corresponding PID(s) (e.g., DOI, ORCID, PMID, etc)ZIP6.5 GB (1.5 GB zipped)
Previous dumps

OpenCitations Index

The OpenCitations Index stores OMID-to-OMID references representing all the references gathered from several sources.

Most recent OpenCitations Index data dump - March 2025 Dump

Dump created on 2025-03-24. Compared to the previous dump, this one adds the citation data contained in the Crossref dump dated November 2024. This dump includes information on:

Type and formatArchiveSize
Citation data (CSV)ZIP220 GB (34.4 GB zipped)
Citation data (N-Triple)ZIP1.9 TB (80.6 GB zipped)
Citation data (Scholix)ZIP1.9 TB (40 GB zipped)
Provenance data (CSV)ZIP410 GB (18 GB zipped)
Provenance data (N-Triple)ZIP3.1 TB (95 GB zipped)

In addition:

Type and formatArchiveSize
Citation data sources' info (N-Triple): information regarding the data source collection (e.g., COCI, DOCI, POCI, etc) of all the citation dataZIP388 GB (23.7 GB zipped)
Citation data sources' info (CSV): information regarding the data source collection (e.g., COCI, DOCI, POCI, etc) of all the citation dataZIP97 GB (21 GB zipped)
Citation count data (CSV): the number of incoming citations to each bibliographic entity (identified by an OMID) in OpenCitations IndexTBA
Previous dumps