About DataSeer

Uncover. Advise. Verify.

DataSeer fills the urgent need for a low-cost, scalable solution to show researchers how to comply with stakeholder data sharing policies.

DataSeer was conceived by Tim Vines while he was manually checking compliance with the journal Molecular Ecology‘s data sharing policy –instead of going through an article line-by-line for 30 minutes, why not have Artificial Intelligence do the same job in 2 seconds?

DataSeer fills the urgent need for a low-cost, scalable solution to:

a) show researchers what they need to do to comply with data sharing policies, and

b) allow stakeholders to precisely monitor compliance with their data policy.

DataSeer was started with funding from the Sloan foundation

We’ve so far collected training data from over 3000 articles

We’re open source (web app is here, machine learning code is here)

Our PeoplE

DataSeer personnel

Tim Vines, PhD

Founder and Director

Tim Vines is the Founder and Project Lead on DataSeer. Prior to that he founded Axios Review, an independent peer review company that helped authors find journals that wanted their paper. He was the Managing Editor for the journal Molecular Ecology for eight years, where he led their adoption of data sharing and numerous other initiatives. He writes for the industry-leading Scholarly Kitchen blog, and has published research papers on peer review, data sharing, and reproducibility (including one that was covered by Vanity Fair). He has a PhD in evolutionary ecology from the University of Edinburgh and now lives in Vancouver, Canada.

Kristen Ratan

Strategy Consultant

Kristen Ratan is a seasoned executive and open science advocate with 20+ years leading transformation in scholarly research and research communication. Kristen has a successful track record creating and driving vision, strategy and technology innovations in research, knowledge production, discovery and access. She co-founded the Collaborative Knowledge Foundation (Coko) and was Publisher at PLOS prior to that.

Souad McIntosh

Operations Manager

Souad is the Operations Manager and a Data Curator at DataSeer. She  is our main point of client contact and oversees the various applications of our AI. Souad is passionate about efficiency, auditing, and especially Open Data. Previously, Souad was employed in the restaurant and customer service industries. She holds a bachelor’s degree in biology from Simon Fraser University and lives in Vancouver, Canada.

DataSeer is an open-source project developed with the support of the Sloan Foundation.

Our Advisory Board

DataSeer draws on a wealth of Open Data expertise

Nokome Bentley

Founder, Stencila

Nokome was originally a marine scientist with over twenty years experience in research for sustainable management of marine resources. He is the Founder of Stencila, which produces living documents that link tables and plots to the underlying datasets. It serves a platform for reproducible research that bridges the gap in collaboration between coders and non-coders. Nokome is based in Aotearoa New Zealand.

Theo Bloom

Executive Editor, BMJ

Theo has a PhD in developmental cell biology from the University of Cambridge, and moved into publishing as an editor on the biology team at Nature. After a number of years helping to develop Current Biology for Current Science Group and then for Elsevier, Theo was instrumental in the birth of the commercial open access publisher BioMed Central. She joined the non-profit open access publisher Public Library of Science (PLOS) in 2008, initially as chief editor of PLOS Biology. She has been a leader on issues around data access and availability for many years.

Phil Bourne

Founding Dean, School of Data Science, University of Virginia

From 2014-2017, Phil was the Associate Director for Data Science at the National Institutes of Health. In this role he led the Big Data to Knowledge Program, coordinating access to and analyzing biomedical research from across the globe and making it available to scientists and researchers. He has done exceptional work to make biomedical research accessible, as well as to advance the field of data science. Prior to his time at the NIH, Phil spent 20 years on the faculty at the University of California-San Diego, eventually becoming Associate Vice Chancellor of Innovation and Industrial Alliances.

Erin Clary

Curation Coordinator, Portage Network

Erin is the Curation Coordinator for the Portage Network, an initiative of the Canadian Association of Research Libraries, now supported by Canada’s New Digital Research Infrastructure Organization (NDRIO). She reviews new dataset deposits for the Federated Research Data Repository (FRDR), and as a member of Portage’s Curation Expert Group, she is engaged in developing resources to support a national curation community of practice.

Mercè Crosas

Research Data Management Officer, Harvard University

Mercè is a data technologist and researcher, currently holding two roles at Harvard University, as the University Research Data Management Officer, with Harvard University Information Technology (HUIT), and the Chief Data Science and Technology Officer at Harvard’s Institute for Quantitative Social Science. Her career journey has included research in astrophysics, design and implementation of software for astronomical observations, development of learning and data management systems for education and biotechnologies, and now leading software platforms and tools for research data sharing and analysis, applied to all research fields.

Martin Fenner

Technical Director, DataCite

Martin envisions, develops, implements and manages a robust technical architecture for DataCite. Before 2015 he was technical lead for the PLOS Article-Level Metrics project. He co-chairs the Research Data Alliance/FORCE11 Working Group on Source Software Code Identification. Martin has a medical degree from the Free University of Berlin and is a Board-certified medical oncologist.

Iain Hrynaszkiewicz

Publisher, Open Research at Public Library of Science

Iain leads the conceptualisation and development of new products and services that add value to the PLOS portfolio by supporting and enabling open science. Iain was previously Head of Data Publishing at Springer Nature where he developed and implemented research data policies and services, and was publisher of Nature Research Group’s Scientific Data journal. He has also been Outreach Director at Faculty of 1000 (F1000), and spent seven years at the first commercial open access publisher BioMed Central (BMC) in a variety of editorial, publishing and product/policy development roles. Iain is part of several research/publishing community projects related to data sharing and reproducible research.

Patrice Lopez

Founder, Science Miner

Patrice is a global expert on the application of Machine Learning to research articles. His previous projects include the pdf to TEI converter Grobid and the Deep Learning Framework For Text (DeLFT). He developed DataSeer’s core Machine Learning algorithms and remains actively engaged with the project. 

Daniella Lowenberg

Data Publishing & Data Metrics Product Manager/Dryad Product Manager

Daniella is the Product Manager for Dryad, a global open data publishing platform. She also directs the Make Data Count initiative focused on building the infrastructure for research data metrics. She also educates researchers/research stakeholders within the University of California system and globally on open research data publishing, open data metrics, research data ethics, and sharing of protected human data.

Jason Roberts

Jason Roberts

Senior Partner, Origin Editorial

After earning a doctorate in Geography from Loughborough University, Jason worked at Blackwell Science in Oxford, UK. He switched to the editorial team and eventually rose to be Senior Editor of US-based medical journals. In 2010 he left Blackwell to found Origin Editorial, offering his journal management expertise to a much wider range of journals. Jason was the founding president of the International Society of Managing and Technical Editors. He works closely with the EQUATOR Network to encourage improved reporting standards among journals, editors, and publishers.

Heather Staines

Director of Community Engagement and Senior Consultant at Delta Think

Before joining DeltaThink, Heather was Head of Partnerships for the Knowledge Futures Group, building open source infrastructure for publishers and libraries. Her previous roles include positions at Hypothesis, Proquest, SIPX (formerly the Stanford Intellectual Property Exchange), Springer SBM, and Greenwood Publishing Group/Praeger Publishers. She is a frequent speaker and participant at industry events including the COUNTER Board of Directors, the STM Futurelab, Society for Scholarly Publishing, the NISO Transfer Standing Committee, the NASIG Digital Preservation Task Force. She has a Ph.D. in Military and Diplomatic History from Yale University.