Estimation of wood fibre length distributions from censored mixture data

The motivating forestry background for this thesis is the need for fast, non-destructive, and cost-efficient methods to estimate fibre length distributions in standing trees in order to evaluate the effect of silvicultural methods and breeding programs on fibre length. The usage of increment cores i...

Full description

Bibliographic Details
Main Author: Svensson, Ingrid
Format: Doctoral or Postdoctoral Thesis
Language:English
Published: Umeå universitet, Institutionen för matematik och matematisk statistik 2007
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-1094
id ftumeauniv:oai:DiVA.org:umu-1094
record_format openpolar
spelling ftumeauniv:oai:DiVA.org:umu-1094 2023-05-15T17:45:15+02:00 Estimation of wood fibre length distributions from censored mixture data Svensson, Ingrid 2007 application/pdf http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-1094 eng eng Umeå universitet, Institutionen för matematik och matematisk statistik Umeå : Matematik och matematisk statistik http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-1094 urn:isbn:978-91-7264-300-0 info:eu-repo/semantics/openAccess censoring fibre length distribution identifiability increment core length bias mixture stochastic EM algorithm Probability Theory and Statistics Sannolikhetsteori och statistik Doctoral thesis, comprehensive summary info:eu-repo/semantics/doctoralThesis text 2007 ftumeauniv 2022-05-01T08:20:56Z The motivating forestry background for this thesis is the need for fast, non-destructive, and cost-efficient methods to estimate fibre length distributions in standing trees in order to evaluate the effect of silvicultural methods and breeding programs on fibre length. The usage of increment cores is a commonly used non-destructive sampling method in forestry. An increment core is a cylindrical wood sample taken with a special borer, and the methods proposed in this thesis are especially developed for data from increment cores. Nevertheless the methods can be used for data from other sampling frames as well, for example for sticks with the shape of an elongated rectangular box. This thesis proposes methods to estimate fibre length distributions based on censored mixture data from wood samples. Due to sampling procedures, wood samples contain cut (censored) and uncut observations. Moreover the samples consist not only of the fibres of interest but of other cells (fines) as well. When the cell lengths are determined by an automatic optical fibre-analyser, there is no practical possibility to distinguish between cut and uncut cells or between fines and fibres. Thus the resulting data come from a censored version of a mixture of the fine and fibre length distributions in the tree. The methods proposed in this thesis can handle this lack of information. Two parametric methods are proposed to estimate the fine and fibre length distributions in a tree. The first method is based on grouped data. The probabilities that the length of a cell from the sample falls into different length classes are derived, the censoring caused by the sampling frame taken into account. These probabilities are functions of the unknown parameters, and ML estimates are found from the corresponding multinomial model. The second method is a stochastic version of the EM algorithm based on the individual length measurements. The method is developed for the case where the distributions of the true lengths of the cells at least partially appearing in the sample belong to exponential families. The cell length distribution in the sample and the conditional distribution of the true length of a cell at least partially appearing in the sample given the length in the sample are derived. Both these distributions are necessary in order to use the stochastic EM algorithm. Consistency and asymptotic normality of the stochastic EM estimates is proved. The methods are applied to real data from increment cores taken from Scots pine trees (Pinus sylvestris L.) in Northern Sweden and further evaluated through simulation studies. Both methods work well for sample sizes commonly obtained in practice. Doctoral or Postdoctoral Thesis Northern Sweden Umeå University: Publications (DiVA)
institution Open Polar
collection Umeå University: Publications (DiVA)
op_collection_id ftumeauniv
language English
topic censoring
fibre length distribution
identifiability
increment core
length bias
mixture
stochastic EM algorithm
Probability Theory and Statistics
Sannolikhetsteori och statistik
spellingShingle censoring
fibre length distribution
identifiability
increment core
length bias
mixture
stochastic EM algorithm
Probability Theory and Statistics
Sannolikhetsteori och statistik
Svensson, Ingrid
Estimation of wood fibre length distributions from censored mixture data
topic_facet censoring
fibre length distribution
identifiability
increment core
length bias
mixture
stochastic EM algorithm
Probability Theory and Statistics
Sannolikhetsteori och statistik
description The motivating forestry background for this thesis is the need for fast, non-destructive, and cost-efficient methods to estimate fibre length distributions in standing trees in order to evaluate the effect of silvicultural methods and breeding programs on fibre length. The usage of increment cores is a commonly used non-destructive sampling method in forestry. An increment core is a cylindrical wood sample taken with a special borer, and the methods proposed in this thesis are especially developed for data from increment cores. Nevertheless the methods can be used for data from other sampling frames as well, for example for sticks with the shape of an elongated rectangular box. This thesis proposes methods to estimate fibre length distributions based on censored mixture data from wood samples. Due to sampling procedures, wood samples contain cut (censored) and uncut observations. Moreover the samples consist not only of the fibres of interest but of other cells (fines) as well. When the cell lengths are determined by an automatic optical fibre-analyser, there is no practical possibility to distinguish between cut and uncut cells or between fines and fibres. Thus the resulting data come from a censored version of a mixture of the fine and fibre length distributions in the tree. The methods proposed in this thesis can handle this lack of information. Two parametric methods are proposed to estimate the fine and fibre length distributions in a tree. The first method is based on grouped data. The probabilities that the length of a cell from the sample falls into different length classes are derived, the censoring caused by the sampling frame taken into account. These probabilities are functions of the unknown parameters, and ML estimates are found from the corresponding multinomial model. The second method is a stochastic version of the EM algorithm based on the individual length measurements. The method is developed for the case where the distributions of the true lengths of the cells at least partially appearing in the sample belong to exponential families. The cell length distribution in the sample and the conditional distribution of the true length of a cell at least partially appearing in the sample given the length in the sample are derived. Both these distributions are necessary in order to use the stochastic EM algorithm. Consistency and asymptotic normality of the stochastic EM estimates is proved. The methods are applied to real data from increment cores taken from Scots pine trees (Pinus sylvestris L.) in Northern Sweden and further evaluated through simulation studies. Both methods work well for sample sizes commonly obtained in practice.
format Doctoral or Postdoctoral Thesis
author Svensson, Ingrid
author_facet Svensson, Ingrid
author_sort Svensson, Ingrid
title Estimation of wood fibre length distributions from censored mixture data
title_short Estimation of wood fibre length distributions from censored mixture data
title_full Estimation of wood fibre length distributions from censored mixture data
title_fullStr Estimation of wood fibre length distributions from censored mixture data
title_full_unstemmed Estimation of wood fibre length distributions from censored mixture data
title_sort estimation of wood fibre length distributions from censored mixture data
publisher Umeå universitet, Institutionen för matematik och matematisk statistik
publishDate 2007
url http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-1094
genre Northern Sweden
genre_facet Northern Sweden
op_relation http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-1094
urn:isbn:978-91-7264-300-0
op_rights info:eu-repo/semantics/openAccess
_version_ 1766148113066622976