Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression

Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression...

Full description

Bibliographic Details
Published in:Bioinformatics
Main Authors: Marttinen, Pekka, Pirinen, Matti, Sarin, Antti-Pekka, Gillberg, Jussi, Kettunen, Johannes, Surakka, Ida, Kangas, Antti J., Soininen, Pasi, O’Reilly, Paul, Kaakinen, Marika, Kähönen, Mika, Lehtimäki, Terho, Ala-Korpela, Mika, Raitakari, Olli T., Salomaa, Veikko, Järvelin, Marjo-Riitta, Ripatti, Samuli, Kaski, Samuel
Format: Text
Language:English
Published: Oxford University Press 2014
Subjects:
Online Access:http://bioinformatics.oxfordjournals.org/cgi/content/short/30/14/2026
https://doi.org/10.1093/bioinformatics/btu140
_version_ 1821661390076641280
author Marttinen, Pekka
Pirinen, Matti
Sarin, Antti-Pekka
Gillberg, Jussi
Kettunen, Johannes
Surakka, Ida
Kangas, Antti J.
Soininen, Pasi
O’Reilly, Paul
Kaakinen, Marika
Kähönen, Mika
Lehtimäki, Terho
Ala-Korpela, Mika
Raitakari, Olli T.
Salomaa, Veikko
Järvelin, Marjo-Riitta
Ripatti, Samuli
Kaski, Samuel
author_facet Marttinen, Pekka
Pirinen, Matti
Sarin, Antti-Pekka
Gillberg, Jussi
Kettunen, Johannes
Surakka, Ida
Kangas, Antti J.
Soininen, Pasi
O’Reilly, Paul
Kaakinen, Marika
Kähönen, Mika
Lehtimäki, Terho
Ala-Korpela, Mika
Raitakari, Olli T.
Salomaa, Veikko
Järvelin, Marjo-Riitta
Ripatti, Samuli
Kaski, Samuel
author_sort Marttinen, Pekka
collection HighWire Press (Stanford University)
container_issue 14
container_start_page 2026
container_title Bioinformatics
container_volume 30
description Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression or metabolomics studies. A common approach is to perform a univariate test between each genotype–phenotype pair, and then to apply a stringent significance cutoff to account for the large number of tests performed. However, this approach has limited ability to uncover dependencies involving multiple variables. Another trend in the current genetics is the investigation of the impact of rare variants on the phenotype, where the standard methods often fail owing to lack of power when the minor allele is present in only a limited number of individuals. Results: We propose a new statistical approach based on Bayesian reduced rank regression to assess the impact of multiple SNPs on a high-dimensional phenotype. Because of the method’s ability to combine information over multiple SNPs and phenotypes, it is particularly suitable for detecting associations involving rare variants. We demonstrate the potential of our method and compare it with alternatives using the Northern Finland Birth Cohort with 4702 individuals, for whom genome-wide SNP data along with lipoprotein profiles comprising 74 traits are available. We discovered two genes ( XRCC4 and MTHFD2L ) without previously reported associations, which replicated in a combined analysis of two additional cohorts: 2390 individuals from the Cardiovascular Risk in Young Finns study and 3659 individuals from the FINRISK study. Availability and implementation: R-code freely available for download at http://users.ics.aalto.fi/pemartti/gene_metabolome/ . Contact: samuli.ripatti@helsinki.fi samuel.kaski@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
genre Northern Finland
genre_facet Northern Finland
id fthighwire:oai:open-archive.highwire.org:bioinfo:30/14/2026
institution Open Polar
language English
op_collection_id fthighwire
op_container_end_page 2034
op_doi https://doi.org/10.1093/bioinformatics/btu140
op_relation http://bioinformatics.oxfordjournals.org/cgi/content/short/30/14/2026
http://dx.doi.org/10.1093/bioinformatics/btu140
op_rights Copyright (C) 2014, Oxford University Press
publishDate 2014
publisher Oxford University Press
record_format openpolar
spelling fthighwire:oai:open-archive.highwire.org:bioinfo:30/14/2026 2025-01-16T23:52:45+00:00 Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression Marttinen, Pekka Pirinen, Matti Sarin, Antti-Pekka Gillberg, Jussi Kettunen, Johannes Surakka, Ida Kangas, Antti J. Soininen, Pasi O’Reilly, Paul Kaakinen, Marika Kähönen, Mika Lehtimäki, Terho Ala-Korpela, Mika Raitakari, Olli T. Salomaa, Veikko Järvelin, Marjo-Riitta Ripatti, Samuli Kaski, Samuel 2014-07-15 00:00:00.0 text/html http://bioinformatics.oxfordjournals.org/cgi/content/short/30/14/2026 https://doi.org/10.1093/bioinformatics/btu140 en eng Oxford University Press http://bioinformatics.oxfordjournals.org/cgi/content/short/30/14/2026 http://dx.doi.org/10.1093/bioinformatics/btu140 Copyright (C) 2014, Oxford University Press GENETICS AND POPULATION ANALYSIS TEXT 2014 fthighwire https://doi.org/10.1093/bioinformatics/btu140 2015-02-28T19:13:00Z Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression or metabolomics studies. A common approach is to perform a univariate test between each genotype–phenotype pair, and then to apply a stringent significance cutoff to account for the large number of tests performed. However, this approach has limited ability to uncover dependencies involving multiple variables. Another trend in the current genetics is the investigation of the impact of rare variants on the phenotype, where the standard methods often fail owing to lack of power when the minor allele is present in only a limited number of individuals. Results: We propose a new statistical approach based on Bayesian reduced rank regression to assess the impact of multiple SNPs on a high-dimensional phenotype. Because of the method’s ability to combine information over multiple SNPs and phenotypes, it is particularly suitable for detecting associations involving rare variants. We demonstrate the potential of our method and compare it with alternatives using the Northern Finland Birth Cohort with 4702 individuals, for whom genome-wide SNP data along with lipoprotein profiles comprising 74 traits are available. We discovered two genes ( XRCC4 and MTHFD2L ) without previously reported associations, which replicated in a combined analysis of two additional cohorts: 2390 individuals from the Cardiovascular Risk in Young Finns study and 3659 individuals from the FINRISK study. Availability and implementation: R-code freely available for download at http://users.ics.aalto.fi/pemartti/gene_metabolome/ . Contact: samuli.ripatti@helsinki.fi samuel.kaski@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online. Text Northern Finland HighWire Press (Stanford University) Bioinformatics 30 14 2026 2034
spellingShingle GENETICS AND POPULATION ANALYSIS
Marttinen, Pekka
Pirinen, Matti
Sarin, Antti-Pekka
Gillberg, Jussi
Kettunen, Johannes
Surakka, Ida
Kangas, Antti J.
Soininen, Pasi
O’Reilly, Paul
Kaakinen, Marika
Kähönen, Mika
Lehtimäki, Terho
Ala-Korpela, Mika
Raitakari, Olli T.
Salomaa, Veikko
Järvelin, Marjo-Riitta
Ripatti, Samuli
Kaski, Samuel
Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title_full Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title_fullStr Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title_full_unstemmed Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title_short Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
title_sort assessing multivariate gene-metabolome associations with rare variants using bayesian reduced rank regression
topic GENETICS AND POPULATION ANALYSIS
topic_facet GENETICS AND POPULATION ANALYSIS
url http://bioinformatics.oxfordjournals.org/cgi/content/short/30/14/2026
https://doi.org/10.1093/bioinformatics/btu140