High-Dimensional Adaptive Basis Density Estimation

In the realm of high-dimensional statistics, regression and classification have received much attention, while density estimation has lagged behind. Yet there are compelling scientific questions which can only be addressed via density estimation using high-dimensional data, such as the paths of Nort...

Full description

Bibliographic Details
Main Author: Susan Buchman
Format: Thesis
Language:unknown
Published: 2018
Subjects:
Online Access:https://doi.org/10.1184/r1/6719834.v1
https://figshare.com/articles/High-Dimensional_Adaptive_Basis_Density_Estimation/6719834
id ftcarnmellonufig:oai:figshare.com:article/6719834
record_format openpolar
spelling ftcarnmellonufig:oai:figshare.com:article/6719834 2023-05-15T17:34:39+02:00 High-Dimensional Adaptive Basis Density Estimation Susan Buchman 2018-07-01T00:12:33Z https://doi.org/10.1184/r1/6719834.v1 https://figshare.com/articles/High-Dimensional_Adaptive_Basis_Density_Estimation/6719834 unknown doi:10.1184/r1/6719834.v1 https://figshare.com/articles/High-Dimensional_Adaptive_Basis_Density_Estimation/6719834 In Copyright Probability Statistics Text Thesis 2018 ftcarnmellonufig https://doi.org/10.1184/r1/6719834.v1 2019-11-18T10:43:48Z In the realm of high-dimensional statistics, regression and classification have received much attention, while density estimation has lagged behind. Yet there are compelling scientific questions which can only be addressed via density estimation using high-dimensional data, such as the paths of North Atlantic tropical cyclones. If we cast each track as a single high-dimensional data point, density estimation allows us to answer such questions via integration or Monte Carlo methods. In this dissertation, I present three new methods for estimating densities and intensities for high-dimensional data, all of which rely on a technique called diffusion maps. This technique constructs a mapping for high-dimensional, complex data into a low-dimensional space, providing a new basis that can be used in conjunction with traditional density estimation methods. Furthermore, I propose a reordering of importance sampling in the high-dimensional setting. Traditional importance sampling estimates high-dimensional integrals with the aid of an instrumental distribution chosen specifically to minimize the variance of the estimator. In many applications, the integral of interest is with respect to an estimated density. I argue that in the high-dimensional realm, performance can be improved by reversing the procedure: instead of estimating a density and then selecting an appropriate instrumental distribution, begin with the instrumental distribution and estimate the density with respect to it directly. The variance reduction follows from the improved density estimate. Lastly, I present some initial results in using climatic predictors such as sea surface temperature as spatial covariates in point process estimation. Thesis North Atlantic KiltHub Research from Carnegie Mellon University
institution Open Polar
collection KiltHub Research from Carnegie Mellon University
op_collection_id ftcarnmellonufig
language unknown
topic Probability
Statistics
spellingShingle Probability
Statistics
Susan Buchman
High-Dimensional Adaptive Basis Density Estimation
topic_facet Probability
Statistics
description In the realm of high-dimensional statistics, regression and classification have received much attention, while density estimation has lagged behind. Yet there are compelling scientific questions which can only be addressed via density estimation using high-dimensional data, such as the paths of North Atlantic tropical cyclones. If we cast each track as a single high-dimensional data point, density estimation allows us to answer such questions via integration or Monte Carlo methods. In this dissertation, I present three new methods for estimating densities and intensities for high-dimensional data, all of which rely on a technique called diffusion maps. This technique constructs a mapping for high-dimensional, complex data into a low-dimensional space, providing a new basis that can be used in conjunction with traditional density estimation methods. Furthermore, I propose a reordering of importance sampling in the high-dimensional setting. Traditional importance sampling estimates high-dimensional integrals with the aid of an instrumental distribution chosen specifically to minimize the variance of the estimator. In many applications, the integral of interest is with respect to an estimated density. I argue that in the high-dimensional realm, performance can be improved by reversing the procedure: instead of estimating a density and then selecting an appropriate instrumental distribution, begin with the instrumental distribution and estimate the density with respect to it directly. The variance reduction follows from the improved density estimate. Lastly, I present some initial results in using climatic predictors such as sea surface temperature as spatial covariates in point process estimation.
format Thesis
author Susan Buchman
author_facet Susan Buchman
author_sort Susan Buchman
title High-Dimensional Adaptive Basis Density Estimation
title_short High-Dimensional Adaptive Basis Density Estimation
title_full High-Dimensional Adaptive Basis Density Estimation
title_fullStr High-Dimensional Adaptive Basis Density Estimation
title_full_unstemmed High-Dimensional Adaptive Basis Density Estimation
title_sort high-dimensional adaptive basis density estimation
publishDate 2018
url https://doi.org/10.1184/r1/6719834.v1
https://figshare.com/articles/High-Dimensional_Adaptive_Basis_Density_Estimation/6719834
genre North Atlantic
genre_facet North Atlantic
op_relation doi:10.1184/r1/6719834.v1
https://figshare.com/articles/High-Dimensional_Adaptive_Basis_Density_Estimation/6719834
op_rights In Copyright
op_doi https://doi.org/10.1184/r1/6719834.v1
_version_ 1766133544061501440