About DataSeer

Uncover. Advise. Verify.

DataSeer fills the urgent need for a low-cost, scalable solution to show researchers how to comply with stakeholder data sharing policies.

DataSeer was conceived by Tim Vines while he was manually checking compliance with the journal Molecular Ecology‘s data sharing policy –instead of going through an article line-by-line for 30 minutes, why not have Artificial Intelligence do the same job in 2 seconds?

DataSeer fills the urgent need for a low-cost, scalable solution to:

a) show researchers what they need to do to comply with data sharing policies, and

b) allow stakeholders to precisely monitor compliance with their data policy.

We’re currently funded by the Sloan foundation

We’ve so far collected training data from over 3000 articles

We’re open source (web app is here, machine learning code is here)

Our Advisory Board

DataSeer draws on a wealth of Open Data expertise


Theo Bloom

Executive Editor, British Medical Journal

Theo has a PhD in developmental cell biology from the University of Cambridge, and moved into publishing as an editor on the biology team at Nature. After a number of years helping to develop Current Biology for Current Science Group and then for Elsevier, Theo was instrumental in the birth of the commercial open access publisher BioMed Central. She joined the non-profit open access publisher Public Library of Science (PLOS) in 2008, initially as chief editor of PLOS Biology. She has been a leader on issues around data access and availability for many years.

Phil Bourne

Founding Dean, School of Data Science, University of Virginia

From 2014-2017, Phil was the Associate Director for Data Science at the National Institutes of Health. In this role he led the Big Data to Knowledge Program, coordinating access to and analyzing biomedical research from across the globe and making it available to scientists and researchers. He has done exceptional work to make biomedical research accessible, as well as to advance the field of data science. Prior to his time at the NIH, Phil spent 20 years on the faculty at the University of California-San Diego, eventually becoming Associate Vice Chancellor of Innovation and Industrial Alliances.

Mercè Crosas

Research Data Management Officer, Harvard University

Mercè is a data technologist and researcher, currently holding two roles at Harvard University, as the University Research Data Management Officer, with Harvard University Information Technology (HUIT), and the Chief Data Science and Technology Officer at Harvard's Institute for Quantitative Social Science. Her career journey has included research in astrophysics, design and implementation of software for astronomical observations, development of learning and data management systems for education and biotechnologies, and now leading software platforms and tools for research data sharing and analysis, applied to all research fields.


Martin Fenner

Technical Director, DataCite

Martin envisions, develops, implements and manages a robust technical architecture for DataCite. Before 2015 he was technical lead for the PLOS Article-Level Metrics project. He co-chairs the Research Data Alliance/FORCE11 Working Group on Source Software Code Identification. Martin has a medical degree from the Free University of Berlin and is a Board-certified medical oncologist.

Iain Hrynaszkiewicz

Publisher, Open Research at Public Library of Science

Iain leads the conceptualisation and development of new products and services that add value to the PLOS portfolio by supporting and enabling open science. Iain was previously Head of Data Publishing at Springer Nature where he developed and implemented research data policies and services, and was publisher of Nature Research Group’s Scientific Data journal. He has also been Outreach Director at Faculty of 1000 (F1000), and spent seven years at the first commercial open access publisher BioMed Central (BMC) in a variety of editorial, publishing and product/policy development roles. Iain is part of several research/publishing community projects related to data sharing and reproducible research.

Daniella Lowenberg

Data Publishing & Data Metrics Product Manager/Dryad Product Manager

Daniella is the Product Manager for Dryad, a global open data publishing platform. She also directs the Make Data Count initiative focused on building the infrastructure for research data metrics. She also educates researchers/research stakeholders within the University of California system and globally on open research data publishing, open data metrics, research data ethics, and sharing of protected human data.


Kristen Ratan

Founder, Strategies for Open Science

Kristen Ratan founded Stratos to synthesize open science efforts into a cohesive movement and to offer pathways to success for open science projects and initiatives. She is a seasoned executive and open science advocate with 20+ years leading transformation in scholarly research and research communication. Kristen has a successful track record creating and driving vision, strategy and technology innovations in research, knowledge production, discovery and access. She Co-Founded the Collaborative Knowledge Foundation (Coko) and was the Publisher at PLOS prior to that.
Jason Roberts

Jason Roberts

Senior Partner, Origin Editorial

After earning a doctorate in Geography from Loughborough University, Jason worked at Blackwell Science in Oxford, UK. He switched to the editorial team and eventually rose to be Senior Editor of US-based medical journals. In 2010 he left Blackwell to found Origin Editorial, offering his journal management expertise to a much wider range of journals. Jason was the founding president of the International Society of Managing and Technical Editors. He works closely with the EQUATOR Network to encourage improved reporting standards among journals, editors, and publishers.