Order selection and sparsity in latent variable models via the ordered factor LASSO
Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, t...
Published in: | Biometrics |
---|---|
Main Authors: | , , |
Format: | Article in Journal/Newspaper |
Language: | unknown |
Subjects: | |
Online Access: | https://doi.org/10.1111/biom.12888 |
id |
ftrepec:oai:RePEc:bla:biomet:v:74:y:2018:i:4:p:1311-1319 |
---|---|
record_format |
openpolar |
spelling |
ftrepec:oai:RePEc:bla:biomet:v:74:y:2018:i:4:p:1311-1319 2024-04-14T08:19:59+00:00 Order selection and sparsity in latent variable models via the ordered factor LASSO Francis K. C. Hui Emi Tanaka David I. Warton https://doi.org/10.1111/biom.12888 unknown https://doi.org/10.1111/biom.12888 article ftrepec https://doi.org/10.1111/biom.12888 2024-03-19T10:31:00Z Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non‐zero loadings are concentrated on the lower‐order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates. Article in Journal/Newspaper Southern Ocean RePEc (Research Papers in Economics) Southern Ocean Biometrics 74 4 1311 1319 |
institution |
Open Polar |
collection |
RePEc (Research Papers in Economics) |
op_collection_id |
ftrepec |
language |
unknown |
description |
Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non‐zero loadings are concentrated on the lower‐order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates. |
format |
Article in Journal/Newspaper |
author |
Francis K. C. Hui Emi Tanaka David I. Warton |
spellingShingle |
Francis K. C. Hui Emi Tanaka David I. Warton Order selection and sparsity in latent variable models via the ordered factor LASSO |
author_facet |
Francis K. C. Hui Emi Tanaka David I. Warton |
author_sort |
Francis K. C. Hui |
title |
Order selection and sparsity in latent variable models via the ordered factor LASSO |
title_short |
Order selection and sparsity in latent variable models via the ordered factor LASSO |
title_full |
Order selection and sparsity in latent variable models via the ordered factor LASSO |
title_fullStr |
Order selection and sparsity in latent variable models via the ordered factor LASSO |
title_full_unstemmed |
Order selection and sparsity in latent variable models via the ordered factor LASSO |
title_sort |
order selection and sparsity in latent variable models via the ordered factor lasso |
url |
https://doi.org/10.1111/biom.12888 |
geographic |
Southern Ocean |
geographic_facet |
Southern Ocean |
genre |
Southern Ocean |
genre_facet |
Southern Ocean |
op_relation |
https://doi.org/10.1111/biom.12888 |
op_doi |
https://doi.org/10.1111/biom.12888 |
container_title |
Biometrics |
container_volume |
74 |
container_issue |
4 |
container_start_page |
1311 |
op_container_end_page |
1319 |
_version_ |
1796298140961210368 |