undefined

Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences

Year of publication

2022

Authors

Lähteenmäki, Jaakko; Vuorinen, Anna Leena; Pajula, Juha; Harno, Kari; Lehto, Mika; Niemi, Mikko; Van Gils, Mark

Abstract

Aim: This case study aimed to investigate the process of integrating resources of multiple biobanks and health-care registers, especially addressing data permit application, time schedules, co-operation of stakeholders, data exchange and data quality. Methods: We investigated the process in the context of a retrospective study: Pharmacogenomics of antithrombotic drugs (PreMed study). The study involved linking the genotype data of three Finnish biobanks (Auria Biobank, Helsinki Biobank and THL Biobank) with register data on medicine dispensations, health-care encounters and laboratory results. Results: We managed to collect a cohort of 7005 genotyped individuals, thereby achieving the statistical power requirements of the study. The data collection process took 16 months, exceeding our original estimate by seven months. The main delays were caused by the congested data permit approval service to access national register data on health-care encounters. Comparison of hospital data lakes and national registers revealed differences, especially concerning medication data. Genetic variant frequencies were in line with earlier data reported for the European population. The yearly number of international normalised ratio (INR) tests showed stable behaviour over time. Conclusions: A large cohort, consisting of versatile individual-level phenotype and genotype data, can be constructed by integrating data from several biobanks and health data registers in Finland. Co-operation with biobanks is straightforward. However, long time periods need to be reserved when biobank resources are linked with national register data. There is a need for efforts to define general, harmonised co-operation practices and data exchange methods for enabling efficient collection of data from multiple sources.
Show more

Organizations and authors

Tampere University

Van Gils Mark Orcid -palvelun logo

University of Helsinki

Vuorinen Anna-Leena

Lehto Mika

Niemi Mikko

Helsinki University Hospital

Vuorinen Anna-Leena

Lehto Mika

Niemi Mikko

VTT Technical Research Centre of Finland Ltd

Vuorinen Anna-Leena Orcid -palvelun logo

Lähteenmäki Jaakko

Pajula Juha Orcid -palvelun logo

van Gils Mark

Publication type

Publication format

Article

Parent publication type

Journal

Article type

Original article

Audience

Scientific

Peer-reviewed

Peer-Reviewed

MINEDU's publication type classification code

A1 Journal article (refereed), original research

Publication channel information

Volume

50

Issue

4

Pages

482-489

​Publication forum

66804

​Publication forum level

1

Open access

Open access in the publisher’s service

Yes

Open access of publication channel

Partially open publication channel

License of the publisher’s version

Other license

Self-archived

Yes

Article processing fee (EUR)

2570

Other information

Fields of science

Computer and information sciences; Electronic, automation and communications engineering, electronics; Biomedicine; Health care science; Public health care science, environmental and occupational health

Keywords

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Internationality of the publisher

International

Language

English

International co-publication

No

Co-publication with a company

No

DOI

10.1177/14034948211004421

The publication is included in the Ministry of Education and Culture’s Publication data collection

Yes