The Dementias Platform UK (DPUK) Data Portal
Bauermeister S., Orton C., Thompson S., Barker RA., Bauermeister JR., Ben-Shlomo Y., Brayne C., Burn D., Campbell A., Calvin C., Chandran S., Chaturvedi N., Chene G., Chessell IP., Corbett A., Davis DHJ., Denis M., Dufouil C., Elliott P., Fox N., Hill D., Hofer SM., Hu MT., Jindra C., Kee F., Kim CH., Kim C., Kivimaki M., Koychev I., Lawson RA., Linden GJ., Lyons RA., Mackay C., Matthews PM., McGuiness B., Middleton L., Moody C., Moore K., Na DL., O’Brien JT., Ourselin S., Paranjothy S., Park KS., Porteous DJ., Richards M., Ritchie CW., Rohrer JD., Rossor MN., Rowe JB., Scahill R., Schnier C., Schott JM., Seo SW., South M., Steptoe A., Tabrizi SJ., Tales A., Tillin T., Timpson NJ., Toga AW., Visser PJ., Wade-Martins R., Wilkinson T., Williams J., Wong A., Gallacher JE.
<jats:title>Abstract</jats:title><jats:p>The Dementias Platform UK (DPUK) Data Portal is a data repository facilitating access to data for 3 370 929 individuals in 42 cohorts. The Data Portal is an end-to-end data management solution providing a secure, fully auditable, remote access environment for the analysis of cohort data. All projects utilising the data are by default collaborations with the cohort research teams generating the data.</jats:p><jats:p>The Data Portal uses UK Secure eResearch Platform (UKSeRP) infrastructure to provide three core utilities: data discovery, access, and analysis. These are delivered using a 7 layered architecture comprising: data ingestion, data curation, platform interoperability, data discovery, access brokerage, data analysis and knowledge preservation. Automated, streamlined, and standardised procedures reduce the administrative burden for all stakeholders, particularly for requests involving multiple independent datasets, where a single request may be forwarded to multiple data controllers. Researchers are provided with their own secure ‘lab’ using VMware which is accessed using two factor authentication.</jats:p><jats:p>Over the last 2 years, 160 project proposals involving 579 individual cohort data access requests were received. These were received from 268 applicants spanning 72 institutions (56 academic, 13 commercial, 3 government) in 16 countries with 84 requests involving multiple cohorts. Project are varied including multi-modal, machine learning, and Mendelian randomisation analyses. Data access is usually free at point of use although a small number of cohorts require a data access fee.</jats:p>