Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm
This study addresses the question of what diatom taxa to include in a modem calibration set based on their relative contribution in a palaeolinmological calibration model. Using a pruning algorithm for Artificial Neural Networks (ANNs) which determines the functionality of individual taxa in terms o...
Main Authors: | , , , |
---|---|
Format: | Article in Journal/Newspaper |
Language: | unknown |
Published: |
KLUWER ACADEMIC PUBL
2003
|
Subjects: | |
Online Access: | http://discovery.ucl.ac.uk/155448/ |
_version_ | 1821663703585521664 |
---|---|
author | Racca, JMJ Wild, M Birks, HJB Prairie, YT |
author_facet | Racca, JMJ Wild, M Birks, HJB Prairie, YT |
author_sort | Racca, JMJ |
collection | University College London: UCL Discovery |
description | This study addresses the question of what diatom taxa to include in a modem calibration set based on their relative contribution in a palaeolinmological calibration model. Using a pruning algorithm for Artificial Neural Networks (ANNs) which determines the functionality of individual taxa in terms of model performance, we pruned the Surface Water Acidification Project (SWAP) pH-diatom data-set until the predictive performance of the pruned set (as assessed by a jackknifing procedure) was statistically different from the initial full-set. Our results, based on the validation at each 5% data-set reduction, show that (i) 85% of the taxa can be removed without any effect on the pH model calibration performance, and (ii) that the complexity and the dimensionality reduction of the model by the removal of these non-essential or redundant taxa greatly improve the robustness of the calibration. A comparison between the commonly used "marginal" criteria for inclusion (species tolerance and Hill's N2) and our functionality criterion shows that the importance of each taxon in an ANN palaeolimnological model calibration does not appear to depend on these marginal characteristics. |
format | Article in Journal/Newspaper |
genre | Northern Sweden |
genre_facet | Northern Sweden |
geographic | Island Lakes |
geographic_facet | Island Lakes |
id | ftucl:oai:eprints.ucl.ac.uk.OAI2:155448 |
institution | Open Polar |
language | unknown |
long_lat | ENVELOPE(-128.226,-128.226,62.344,62.344) |
op_collection_id | ftucl |
op_source | J PALEOLIMNOL , 29 (1) pp. 123-133. (2003) |
publishDate | 2003 |
publisher | KLUWER ACADEMIC PUBL |
record_format | openpolar |
spelling | ftucl:oai:eprints.ucl.ac.uk.OAI2:155448 2025-01-16T23:55:30+00:00 Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm Racca, JMJ Wild, M Birks, HJB Prairie, YT 2003-01 http://discovery.ucl.ac.uk/155448/ unknown KLUWER ACADEMIC PUBL J PALEOLIMNOL , 29 (1) pp. 123-133. (2003) Artificial Neural Networks diatom model robustness pruning taxa contribution PARTIAL LEAST-SQUARES PAST ENVIRONMENTAL-CONDITIONS CHRYSOPHYTE CYST ASSEMBLAGES QUANTITATIVE INDICATORS INFERENCE MODELS PALEOENVIRONMENTAL RECONSTRUCTIONS NORTHERN SWEDEN ISLAND LAKES WA-PLS CALIBRATION Article 2003 ftucl 2016-01-21T23:11:55Z This study addresses the question of what diatom taxa to include in a modem calibration set based on their relative contribution in a palaeolinmological calibration model. Using a pruning algorithm for Artificial Neural Networks (ANNs) which determines the functionality of individual taxa in terms of model performance, we pruned the Surface Water Acidification Project (SWAP) pH-diatom data-set until the predictive performance of the pruned set (as assessed by a jackknifing procedure) was statistically different from the initial full-set. Our results, based on the validation at each 5% data-set reduction, show that (i) 85% of the taxa can be removed without any effect on the pH model calibration performance, and (ii) that the complexity and the dimensionality reduction of the model by the removal of these non-essential or redundant taxa greatly improve the robustness of the calibration. A comparison between the commonly used "marginal" criteria for inclusion (species tolerance and Hill's N2) and our functionality criterion shows that the importance of each taxon in an ANN palaeolimnological model calibration does not appear to depend on these marginal characteristics. Article in Journal/Newspaper Northern Sweden University College London: UCL Discovery Island Lakes ENVELOPE(-128.226,-128.226,62.344,62.344) |
spellingShingle | Artificial Neural Networks diatom model robustness pruning taxa contribution PARTIAL LEAST-SQUARES PAST ENVIRONMENTAL-CONDITIONS CHRYSOPHYTE CYST ASSEMBLAGES QUANTITATIVE INDICATORS INFERENCE MODELS PALEOENVIRONMENTAL RECONSTRUCTIONS NORTHERN SWEDEN ISLAND LAKES WA-PLS CALIBRATION Racca, JMJ Wild, M Birks, HJB Prairie, YT Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title | Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title_full | Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title_fullStr | Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title_full_unstemmed | Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title_short | Separating wheat from chaff: Diatom taxon selection using an artificial neural network pruning algorithm |
title_sort | separating wheat from chaff: diatom taxon selection using an artificial neural network pruning algorithm |
topic | Artificial Neural Networks diatom model robustness pruning taxa contribution PARTIAL LEAST-SQUARES PAST ENVIRONMENTAL-CONDITIONS CHRYSOPHYTE CYST ASSEMBLAGES QUANTITATIVE INDICATORS INFERENCE MODELS PALEOENVIRONMENTAL RECONSTRUCTIONS NORTHERN SWEDEN ISLAND LAKES WA-PLS CALIBRATION |
topic_facet | Artificial Neural Networks diatom model robustness pruning taxa contribution PARTIAL LEAST-SQUARES PAST ENVIRONMENTAL-CONDITIONS CHRYSOPHYTE CYST ASSEMBLAGES QUANTITATIVE INDICATORS INFERENCE MODELS PALEOENVIRONMENTAL RECONSTRUCTIONS NORTHERN SWEDEN ISLAND LAKES WA-PLS CALIBRATION |
url | http://discovery.ucl.ac.uk/155448/ |