logoFollow The Grant

Data and investigations about potential conflicts of interest within academic research


For our investigations and data analysis we maintain a comprehensive data catalog containing several datasets related to potential conflicts of interest in scientific research.

These datasets are on the one hand public access dumps of scientific articles from which we extract the conflict of interest statements for each author, and on the other hand control datasets from sources where authors may have disclosed potential conflicts of interest on their own or via a data aggregator.

We as well include public datasets containig metadata about organizations and companies to use to extract mentioned entities of this kind from our collection of conflict of interest statements.

To allow cross-referencing between all of this datasets, we convert the sources into a common data model called Follow The Money which was developed by the Organized Crime and Corruption Reporting Project (OCCRP).

Please contact us if you want to collaborate on an investigation.


We have used information from the following data sources:

Last updated: 2023-05-06T11:23:17

PubMed Central Open Access Subset

The PMC Open Access Subset includes millions of journal articles and preprints that are made available under license terms that allow reuse. Not all articles in PMC are available for text mining or other reuse; many are under copyright. Articles in the PMC Open Access Subset are made available under Creative Commons or similar licenses that allow more liberal redistribution and reuse than a traditionally copyrighted work. The PMC Open Access Subset is one part of the PMC Article Datasets.


The National Center for Biotechnology Information

Last updated: 2023-05-06T11:44:29

Europe PMC Open Access Subset

Europe PMC provides comprehensive access to life sciences literature from trusted sources. It's available to anyone, anywhere for free. With Europe PMC you can search and read 41.9 million publications, preprints and other documents enriched with links to supporting data, reviews, protocols, and other relevant resources.

Last updated: 2023-05-06T11:44:36

Europe PMC Preprints Open Access Subset

Preprints are author manuscripts which have not yet been peer-reviewed. The Europe PMC preprints subset contains metadata of 591,553 preprint abstracts and full-text XML of 46,347 COVID-19 preprints which are open access, originally uploaded by the authors to preprint servers.

Last updated: 2023-05-08T00:51:16

COVID-19 Research Project Tracker by UKCDR & GloPID-R

This is a live database of funded research projects across the world related to the current COVID-19 pandemic, as part of the COVID CIRCLE initiative. By providing an overview of research projects mapped against the priorities identified in the WHO Coordinated Global Research Roadmap: 2019 Novel Coronavirus, we support funders and researchers to deliver a more effective and coherent global research response. To our knowledge it is one of the most comprehensive databases, covering a wide breadth of research disciplines.

Last updated: 2023-02-24T19:33:13

CMS OpenPayments

Open Payments is a national transparency program that collects and publishes information about financial relationships between drug and medical device companies (referred to as "reporting entities") and certain health care providers (referred to as "covered recipients"). These relationships may involve payments to providers for things including but not limited to research, meals, travel, gifts or speaking fees.

Last updated: 2023-05-06T14:27:27

Disclosure UK

Disclosure UK is an industry-led initiative to deliver a searchable database that shows payments and benefits in kind made by the pharmaceutical industry to doctors, nurses and other health professionals and organisations in the UK. Disclosure UK is part of a Europe-wide initiative increase transparency between pharmaceutical companies and the doctors, nurses, pharmacists and other health professionals and organisations it works with.

Last updated: 2023-01-23T15:58:36


We listed sources of transparency across European countries: public registries in countries with registration, voluntary disclosure documents elsewhere. We extracted the data from these documents, unified the format, and stored everything in a single database.



Last updated: 2023-05-11T15:24:45

Legal Entity Identifier (LEI) Reference Data

A concatenated data file of all entities which have been issued Legal Entity Identifier (LEI) codes.

Last updated: 2023-05-01T00:36:29

Research Organization Registry (ROR)

The Research Organization Registry (ROR) includes IDs and metadata for more than 102,000 organizations and counting. Registry data is CC0 and openly available via a search interface, REST API, and data dump. Registry updates are curated through a community process and released on a rolling basis.

Last updated: 2023-05-10T14:15:54

Global Database of Humanitarian Organisations

GDHO is a global compendium of organisations that provide aid in humanitarian crises. The database includes basic organisational and operational information on these humanitarian providers, which include international non-governmental organisations (grouped by federation), national NGOs that deliver aid within their own borders, UN humanitarian agencies, and the International Red Cross and Red Crescent Movement.