Multiple Imputation for Multivariate Data with Missing and Below‐Threshold Measurements: Time‐Series Concentrations of Pollutants in the Arctic

Summary. Many chemical and environmental data sets are complicated by the existence of fully missing values or censored values known to lie below detection thresholds. For example, week‐long samples of airborne particulate matter were obtained at Alert, NWT, Canada, between 1980 and 1991, where some...

Full description

Bibliographic Details
Published in:Biometrics
Main Authors: Philip K. Hopke, Chuanhai Liu, Donald B. Rubin
Format: Article in Journal/Newspaper
Language:unknown
Subjects:
Online Access:https://doi.org/10.1111/j.0006-341X.2001.00022.x
Description
Summary:Summary. Many chemical and environmental data sets are complicated by the existence of fully missing values or censored values known to lie below detection thresholds. For example, week‐long samples of airborne particulate matter were obtained at Alert, NWT, Canada, between 1980 and 1991, where some of the concentrations of 24 particulate constituents were coarsened in the sense of being either fully missing or below detection limits. To facilitate scientific analysis, it is appealing to create complete data by filling in missing values so that standard complete‐data methods can be applied. We briefly review commonly used strategies for handling missing values and focus on the multiple‐imputation approach, which generally leads to valid inferences when faced with missing data. Three statistical models are developed for multiply imputing the missing values of airborne particulate matter. We expect that these models are useful for creating multiple imputations in a variety of incomplete multivariate time series data sets.