About the data
Last updated
Was this helpful?
Last updated
Was this helpful?
OpenAlex is more than just a catalog of research publications. We do the work of disambiguating and connecting scholarly works, authors, institutions, sources, and other entities. We then offer the data and analytics on top of it in three different channels, depending on your needs:
β Our friendly web user interface
β A fast, modern REST API to get the data programmatically
β A periodic snapshot of the data, available to download in its entirety, for free
Web crawls
At the heart of OpenAlex is our datasetβa catalog of . A work is any sort of scholarly output. A research article is one kind of work, but there are others such as datasets, books, and dissertations. We keep track of these worksβtheir titles (and abstracts and full text in many cases), when they were created, etc. But that's not all we do. We also keep track of the connections between these works, finding associations through things like , , , citations, , and . There are hundreds of millions of works out there, and tens of thousands more being created every day, so it's important that we have these relationships to help us make sense of research at a large scale.
OpenAlex aggregates and standardizes data from a whole bunch of other great projects, like a river fed by many tributaries. Our two most important data sources are and Other key sources include:
Subject-area and institutional repositories from to and many in between
: Scholarly documents like journal articles, books, datasets, and theses
: People who create works
: Where works are hosted (such as journals, conferences, and repositories)
: Universities and other organizations to which authors claim affiliations
: Topics assigned to works
: Companies and organizations that distribute works
: Organizations that fund research
: Where things are in the world