Additional file 2 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages

Additional file 2: Supplementary Table 1. Results of the CBS detection procedure and the validation using Mash Screen. Each row reports: CBS ID (i.e. the CBS MAG representative), metagenomic sample, estimated depth of coverage (mean, standard deviation, first quartile, median third quartile), number...

Full description

Bibliographic Details
Main Authors: Albanese, Davide, Coleine, Claudia, Rota-Stabelli, Omar, Onofri, Silvano, Tringe, Susannah G., Stajich, Jason E., Selbmann, Laura, Donati, Claudio
Format: Dataset
Language:unknown
Published: figshare 2021
Subjects:
Online Access:https://dx.doi.org/10.6084/m9.figshare.14251830
https://springernature.figshare.com/articles/dataset/Additional_file_2_of_Pre-Cambrian_roots_of_novel_Antarctic_cryptoendolithic_bacterial_lineages/14251830
Description
Summary:Additional file 2: Supplementary Table 1. Results of the CBS detection procedure and the validation using Mash Screen. Each row reports: CBS ID (i.e. the CBS MAG representative), metagenomic sample, estimated depth of coverage (mean, standard deviation, first quartile, median third quartile), number of mapped reads, ANI between the consensus sequences and the CBS representative, coverage breadths at depths from 1 to 5, Mash Screen containment score, number of shared hashes, median multiplicity and containment score p-value. Supplementary Table 2. Assembly statistics and taxonomic classification of the MAGs. Supplementary Table 3. Abundance of CBS at phylum level, expressed as percentage of reads that could be mapped to the representative CBS. Median: median; Q1 and Q3: first and third quartile; IQR: interquartile range; Mean: mean; SD: standard deviation; #CBS: number of candidate bacterial species belonging to the phylum. Supplementary Table 4. Increase in the number of bacterial species for each taxonomic Order provided by the data in the present study, compared to the data available in the GTDB database. Supplementary Table 5. Sample metadata. Geographic coordinates of the sampling sites, accession numbers of the raw sequences, accession numbers and N50 of the assembled metagenomes on the JGI IMG/M portal. Supplementary Table 6. Prevalence and taxonomic classification for each CBS representative. Supplementary Table 7. Summary of Bayesian divergence estimates. For each order we report the mean age of its origin (OO: the split of the order from the closest order) and the 95% CI (OO max and OO min), the origin of the oldest uniquely Antarctic clade (AOO1, the split of the Antarctic clade from a non-Antarctic lineage of the same order), and, where present, the origin of the second oldest antarctic clade (AOO2). See Supplementary Data 1. Supplementary Table 8. Number of predicted proteins (NProts) and of proteins that had a match in the EggNOG database (NHitsOG) and that could be associated to a term in the Gene Ontology (NHitsGO) or had a match in the KEGG and COG databases (NHitsKEGG and NHitsCOG, respectively). Supplementary Table 9. Number of KEGG orthologs characteristic of the Antarctic or reference Jiangellales genomes. The Fisher’s exact test (uncorrected p<0.05) was performed to identify unevenly distributed orthologs between the two groups. Supplementary Table 10. Number of KEGG orthologs characteristic of the Antarctic or reference Thermomicrobiales genomes. The Fisher’s exact test (uncorrected p<0.05) was performed to identify unevenly distributed orthologs between the two groups.