A Clustering Optimization Strategy for Molecular Taxonomy Applied to Planktonic Foraminifera SSU rDNA

Markus Göker1, Guido W. Grimm2, Alexander F. Auch3, Ralf Aurahs4 and Michal KuÄera41German Collection of Microorganisms and Cell Cultures (DSMZ), Inhoffenstraße 7B, 38124 Braunschweig, Germany. 2Swedish Museum of Natural History, Box 50007, Stockholm, Sweden. 3Center for Bioinformatics Tübingen, Ebe...

Full description

Bibliographic Details
Main Author: Göker
Other Authors: Markus Göker, Guido W. Grimm, Alexander F. Auch, Ralf Aurahs and Michal KuÄera
Language:English
Published: Libertas Academica 2010
Subjects:
Online Access:http://www.la-press.com/redirect_file.php?fileId=3079&filename=2255-EBO-A-Clustering-Optimization-Strategy-for-Molecular-Taxonomy-Applied-to-P.pdf&fileType=zip
Description
Summary:Markus Göker1, Guido W. Grimm2, Alexander F. Auch3, Ralf Aurahs4 and Michal KuÄera41German Collection of Microorganisms and Cell Cultures (DSMZ), Inhoffenstraße 7B, 38124 Braunschweig, Germany. 2Swedish Museum of Natural History, Box 50007, Stockholm, Sweden. 3Center for Bioinformatics Tübingen, Eberhard Karls University of Tübingen, Sand 14, 72076 Tübingen, Germany. 4Institute of Geosciences, Eberhard Karls University of Tübingen, Sigwartstraße 10, 72076 Tübingen, Germany. AbstractIdentifying species is challenging in the case of organisms for which primarily molecular data are available. Even if morphological features are available, molecular taxonomy is often necessary to revise taxonomic concepts and to analyze environmental DNA sequences. However, clustering approaches to delineate molecular operational taxonomic units often rely on arbitrary parameter choices. Also, distance calculation is difficult for highly alignment-ambiguous sequences. Here, we applied a recently described clustering optimization method to highly divergent planktonic foraminifera SSU rDNA sequences. We determined the distance function and the clustering setting that result in the highest agreement with morphological reference data. Alignment-free distance calculation, when adapted to the use with partly non-homologous sequences caused by distinct primer pairs, outperformed multiple sequence alignment. Clustering optimization offers new perspectives for the barcoding of species diversity and for environmental sequencing. It bridges the gap between traditional and modern taxonomic disciplines by specifically addressing the issue of how to optimally account for both genetic divergence and given species concepts.