Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
Abstract Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene e...
Published in: | Bioinformatics |
---|---|
Main Authors: | , , , , , , , , , , , , , , , , , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
Oxford University Press (OUP)
2014
|
Subjects: | |
Online Access: | http://dx.doi.org/10.1093/bioinformatics/btu140 https://academic.oup.com/bioinformatics/article-pdf/30/14/2026/48925112/bioinformatics_30_14_2026.pdf |
_version_ | 1821661401564839936 |
---|---|
author | Marttinen, Pekka Pirinen, Matti Sarin, Antti-Pekka Gillberg, Jussi Kettunen, Johannes Surakka, Ida Kangas, Antti J. Soininen, Pasi O’Reilly, Paul Kaakinen, Marika Kähönen, Mika Lehtimäki, Terho Ala-Korpela, Mika Raitakari, Olli T. Salomaa, Veikko Järvelin, Marjo-Riitta Ripatti, Samuli Kaski, Samuel |
author_facet | Marttinen, Pekka Pirinen, Matti Sarin, Antti-Pekka Gillberg, Jussi Kettunen, Johannes Surakka, Ida Kangas, Antti J. Soininen, Pasi O’Reilly, Paul Kaakinen, Marika Kähönen, Mika Lehtimäki, Terho Ala-Korpela, Mika Raitakari, Olli T. Salomaa, Veikko Järvelin, Marjo-Riitta Ripatti, Samuli Kaski, Samuel |
author_sort | Marttinen, Pekka |
collection | Oxford University Press |
container_issue | 14 |
container_start_page | 2026 |
container_title | Bioinformatics |
container_volume | 30 |
description | Abstract Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression or metabolomics studies. A common approach is to perform a univariate test between each genotype–phenotype pair, and then to apply a stringent significance cutoff to account for the large number of tests performed. However, this approach has limited ability to uncover dependencies involving multiple variables. Another trend in the current genetics is the investigation of the impact of rare variants on the phenotype, where the standard methods often fail owing to lack of power when the minor allele is present in only a limited number of individuals. Results: We propose a new statistical approach based on Bayesian reduced rank regression to assess the impact of multiple SNPs on a high-dimensional phenotype. Because of the method’s ability to combine information over multiple SNPs and phenotypes, it is particularly suitable for detecting associations involving rare variants. We demonstrate the potential of our method and compare it with alternatives using the Northern Finland Birth Cohort with 4702 individuals, for whom genome-wide SNP data along with lipoprotein profiles comprising 74 traits are available. We discovered two genes ( XRCC4 and MTHFD2L ) without previously reported associations, which replicated in a combined analysis of two additional cohorts: 2390 individuals from the Cardiovascular Risk in Young Finns study and 3659 individuals from the FINRISK study. Availability and implementation: R-code freely available for download at http://users.ics.aalto.fi/pemartti/gene_metabolome/ . Contact: samuli.ripatti@helsinki.fi samuel.kaski@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Article in Journal/Newspaper |
genre | Northern Finland |
genre_facet | Northern Finland |
id | croxfordunivpr:10.1093/bioinformatics/btu140 |
institution | Open Polar |
language | English |
op_collection_id | croxfordunivpr |
op_container_end_page | 2034 |
op_doi | https://doi.org/10.1093/bioinformatics/btu140 |
op_rights | http://creativecommons.org/licenses/by-nc/3.0/ |
op_source | Bioinformatics volume 30, issue 14, page 2026-2034 ISSN 1367-4811 1367-4803 |
publishDate | 2014 |
publisher | Oxford University Press (OUP) |
record_format | openpolar |
spelling | croxfordunivpr:10.1093/bioinformatics/btu140 2025-01-16T23:52:46+00:00 Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression Marttinen, Pekka Pirinen, Matti Sarin, Antti-Pekka Gillberg, Jussi Kettunen, Johannes Surakka, Ida Kangas, Antti J. Soininen, Pasi O’Reilly, Paul Kaakinen, Marika Kähönen, Mika Lehtimäki, Terho Ala-Korpela, Mika Raitakari, Olli T. Salomaa, Veikko Järvelin, Marjo-Riitta Ripatti, Samuli Kaski, Samuel 2014 http://dx.doi.org/10.1093/bioinformatics/btu140 https://academic.oup.com/bioinformatics/article-pdf/30/14/2026/48925112/bioinformatics_30_14_2026.pdf en eng Oxford University Press (OUP) http://creativecommons.org/licenses/by-nc/3.0/ Bioinformatics volume 30, issue 14, page 2026-2034 ISSN 1367-4811 1367-4803 journal-article 2014 croxfordunivpr https://doi.org/10.1093/bioinformatics/btu140 2024-08-12T04:24:44Z Abstract Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression or metabolomics studies. A common approach is to perform a univariate test between each genotype–phenotype pair, and then to apply a stringent significance cutoff to account for the large number of tests performed. However, this approach has limited ability to uncover dependencies involving multiple variables. Another trend in the current genetics is the investigation of the impact of rare variants on the phenotype, where the standard methods often fail owing to lack of power when the minor allele is present in only a limited number of individuals. Results: We propose a new statistical approach based on Bayesian reduced rank regression to assess the impact of multiple SNPs on a high-dimensional phenotype. Because of the method’s ability to combine information over multiple SNPs and phenotypes, it is particularly suitable for detecting associations involving rare variants. We demonstrate the potential of our method and compare it with alternatives using the Northern Finland Birth Cohort with 4702 individuals, for whom genome-wide SNP data along with lipoprotein profiles comprising 74 traits are available. We discovered two genes ( XRCC4 and MTHFD2L ) without previously reported associations, which replicated in a combined analysis of two additional cohorts: 2390 individuals from the Cardiovascular Risk in Young Finns study and 3659 individuals from the FINRISK study. Availability and implementation: R-code freely available for download at http://users.ics.aalto.fi/pemartti/gene_metabolome/ . Contact: samuli.ripatti@helsinki.fi samuel.kaski@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online. Article in Journal/Newspaper Northern Finland Oxford University Press Bioinformatics 30 14 2026 2034 |
spellingShingle | Marttinen, Pekka Pirinen, Matti Sarin, Antti-Pekka Gillberg, Jussi Kettunen, Johannes Surakka, Ida Kangas, Antti J. Soininen, Pasi O’Reilly, Paul Kaakinen, Marika Kähönen, Mika Lehtimäki, Terho Ala-Korpela, Mika Raitakari, Olli T. Salomaa, Veikko Järvelin, Marjo-Riitta Ripatti, Samuli Kaski, Samuel Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title | Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title_full | Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title_fullStr | Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title_full_unstemmed | Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title_short | Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression |
title_sort | assessing multivariate gene-metabolome associations with rare variants using bayesian reduced rank regression |
url | http://dx.doi.org/10.1093/bioinformatics/btu140 https://academic.oup.com/bioinformatics/article-pdf/30/14/2026/48925112/bioinformatics_30_14_2026.pdf |