Skip to main content

ukbREST: efficient and streamlined data access for reproducible research in large biobanks.

Citation
Pividori, M., and H. K. Im. “Ukbrest: Efficient And Streamlined Data Access For Reproducible Research In Large Biobanks.”. Bioinformatics (Oxford, England), pp. 1971-1973.
Center University of Chicago
Author Milton Pividori, Hae Kyung Im
Abstract

SUMMARY: Large biobanks, such as UK Biobank with half a million participants, are changing the scale and availability of genotypic and phenotypic data for researchers to ask fundamental questions about the biology of health and disease. The breadth of the UK Biobank data is enabling discoveries at an unprecedented pace. However, this size and complexity pose new challenges to investigators who need to keep the accruing data up to date, comply with potential consent changes, and efficiently and reproducibly extract subsets of the data to answer specific scientific questions. Here we propose a tool called ukbREST designed for the UK Biobank study (easily extensible to other biobanks), which allows authorized users to efficiently retrieve phenotypic and genetic data. It exposes a REST API that makes data highly accessible inside a private and secure network, allowing the data specification in a human readable text format easily shareable with other researchers. These characteristics make ukbREST an important tool to make biobank's valuable data more readily accessible to the research community and facilitate reproducibility of the analysis, a key aspect of science.

AVAILABILITY AND IMPLEMENTATION: It is implemented in Python using the Flask-RESTful framework for the API, and it is under the MIT license. It works with PostgreSQL and a Docker image is available for easy deployment. The source code and documentation is available in Github: https://github.com/hakyimlab/ukbrest.

Year of Publication
2019
Journal
Bioinformatics (Oxford, England)
Volume
35
Issue
11
Number of Pages
1971-1973
Date Published
12/2019
ISSN Number
1367-4811
DOI
10.1093/bioinformatics/bty925
Alternate Journal
Bioinformatics
PMID
30395166
PMCID
PMC6546122
Download citation