Order selection and sparsity in latent variable models via the ordered factor LASSO

Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, t...

Full description

Bibliographic Details
Published in:Biometrics
Main Authors: Francis K. C. Hui, Emi Tanaka, David I. Warton
Format: Article in Journal/Newspaper
Language:unknown
Subjects:
Online Access:https://doi.org/10.1111/biom.12888
id ftrepec:oai:RePEc:bla:biomet:v:74:y:2018:i:4:p:1311-1319
record_format openpolar
spelling ftrepec:oai:RePEc:bla:biomet:v:74:y:2018:i:4:p:1311-1319 2024-04-14T08:19:59+00:00 Order selection and sparsity in latent variable models via the ordered factor LASSO Francis K. C. Hui Emi Tanaka David I. Warton https://doi.org/10.1111/biom.12888 unknown https://doi.org/10.1111/biom.12888 article ftrepec https://doi.org/10.1111/biom.12888 2024-03-19T10:31:00Z Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non‐zero loadings are concentrated on the lower‐order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates. Article in Journal/Newspaper Southern Ocean RePEc (Research Papers in Economics) Southern Ocean Biometrics 74 4 1311 1319
institution Open Polar
collection RePEc (Research Papers in Economics)
op_collection_id ftrepec
language unknown
description Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non‐zero loadings are concentrated on the lower‐order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates.
format Article in Journal/Newspaper
author Francis K. C. Hui
Emi Tanaka
David I. Warton
spellingShingle Francis K. C. Hui
Emi Tanaka
David I. Warton
Order selection and sparsity in latent variable models via the ordered factor LASSO
author_facet Francis K. C. Hui
Emi Tanaka
David I. Warton
author_sort Francis K. C. Hui
title Order selection and sparsity in latent variable models via the ordered factor LASSO
title_short Order selection and sparsity in latent variable models via the ordered factor LASSO
title_full Order selection and sparsity in latent variable models via the ordered factor LASSO
title_fullStr Order selection and sparsity in latent variable models via the ordered factor LASSO
title_full_unstemmed Order selection and sparsity in latent variable models via the ordered factor LASSO
title_sort order selection and sparsity in latent variable models via the ordered factor lasso
url https://doi.org/10.1111/biom.12888
geographic Southern Ocean
geographic_facet Southern Ocean
genre Southern Ocean
genre_facet Southern Ocean
op_relation https://doi.org/10.1111/biom.12888
op_doi https://doi.org/10.1111/biom.12888
container_title Biometrics
container_volume 74
container_issue 4
container_start_page 1311
op_container_end_page 1319
_version_ 1796298140961210368