Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages
Additional file 1: Figure S1. Antarctic CBS form two distinct clades in the order Frankiales, with characteristic metabolic potential. a) Maximum-likelihood phylogenetic tree based on GTDB 120 core genes including representative genomes (violet) and Antarctic CBS (green). b) Principal Coordinate Ana...
Main Authors: | , , , , , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
figshare
2021
|
Subjects: | |
Online Access: | https://dx.doi.org/10.6084/m9.figshare.14251827.v1 https://springernature.figshare.com/articles/journal_contribution/Additional_file_1_of_Pre-Cambrian_roots_of_novel_Antarctic_cryptoendolithic_bacterial_lineages/14251827/1 |
id |
ftdatacite:10.6084/m9.figshare.14251827.v1 |
---|---|
record_format |
openpolar |
institution |
Open Polar |
collection |
DataCite Metadata Store (German National Library of Science and Technology) |
op_collection_id |
ftdatacite |
language |
unknown |
topic |
Genetics FOS Biological sciences Evolutionary Biology |
spellingShingle |
Genetics FOS Biological sciences Evolutionary Biology Albanese, Davide Coleine, Claudia Rota-Stabelli, Omar Onofri, Silvano Tringe, Susannah G. Stajich, Jason E. Selbmann, Laura Donati, Claudio Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
topic_facet |
Genetics FOS Biological sciences Evolutionary Biology |
description |
Additional file 1: Figure S1. Antarctic CBS form two distinct clades in the order Frankiales, with characteristic metabolic potential. a) Maximum-likelihood phylogenetic tree based on GTDB 120 core genes including representative genomes (violet) and Antarctic CBS (green). b) Principal Coordinate Analysis (Jaccard distance) of the protein cluster profiles (60% identity) b) and of the metabolic potential. d) The Fisher’s exact test (Bonferroni corrected p<0.01) highlights enriched functional categories. Genomes and KEGG orthologs are clustered according to the Hamming distance between the profiles. The top four KEGG categories significantly more present in the Antarctic CBS are highlighted in the upper bars. Figure S2. Antarctic Jiangellales CBS reveal a substantial genome reduction compared to known species, with characteristic differences in metabolic potential. a) Maximum-likelihood phylogenetic tree based on GTDB genes, including representative genomes from the GTDB database (violet) and Antarctic CBS (green). b) KEGG orthologs that are significantly less frequent in Antarctic Jiagellales compared to reference (uncorrected p<0.05, Fisher’s exact test). Only the first 25 pathways (ranked by the total number of significant orthologs) are shown. c) Number of predicted protein coding sequences in Antarctic (green) and reference (violet) Jiangellales d) The heatmap shows the presence (dark green) of KEGG orthologs belonging to the carotenoid biosynthesis pathway. The only gene involved in carotenoid biosynthesis detected in both CBS and GTDB reference genomes is the crtD. e) The phylogenetic tree inferred on the crtD gene highlights a segregation of Antarctic Jiangellales. Figure S3. Antarctic Thermomicrobiales ( class Chloroflexia ) CBS reveal characteristic metabolic potential. a) The Fisher’s exact test (uncorrected p<0.05) highlights a significant presence, in Antarctic genomes, of orthologs involved in transport, compared to the reference Thermomicrobiales genomes. Only the first 30 pathways (ranked by the total number of orthologs called significant) are shown. b) The prediction of protein coding sequences shows an increment of the number of genes in Antarctic Thermomicrobiales compared to reference genomes. Figure S4. Distribution of the number of CBS that are specific to a given number of samples, taxonomically classified at the Class level. We identified a set of 10 CBS (belonging to the classes Actinobacteria and Alphaproteobacteria) that are present in at least 75% (14/18) of the samples. Figure S5. Mash Screen was used to validate the presence of CBS in the Antarctic samples. a) Distribution of the number of CBS marked as present by the containment score estimated by Mash screen. 1009 out of 1094 (92.2%) CBS have been confirmed by Mash (containment score >0.95, green dashed vertical line). b) Distribution of the number of CBS marked as present by the estimated multiplicity. Figure S6. Scatter plot of the ANI estimated by mapping versus the containment scores estimated by Mash screen for each sample. Horizontal and vertical dashed lines represent the ideal species-level threshold of 0.95 for the containment score and the estimated ANI, respectively. Figure S7. a) Percentage of reads that could be mapped to the CBS representatives, grouped by Class. b) Per sample percentage of the reads that could be mapped to the CBS representatives, grouped by Class. Figure S8. The “ Candidatus Jiangella antarctica ” was found in each sample. a) Scatter plot of the ANI estimated by mapping versus the containment scores estimated by Mash screen (p < 1.47x10 -21 ). b) Scatter plot of the median depth of coverage estimated by mapping versus the median multeplicity estimated by Mash. The line of equality is represented in black. Supplementary Figure S9. Jaccard distance between the KEGG functional profiles for each Order. |
format |
Text |
author |
Albanese, Davide Coleine, Claudia Rota-Stabelli, Omar Onofri, Silvano Tringe, Susannah G. Stajich, Jason E. Selbmann, Laura Donati, Claudio |
author_facet |
Albanese, Davide Coleine, Claudia Rota-Stabelli, Omar Onofri, Silvano Tringe, Susannah G. Stajich, Jason E. Selbmann, Laura Donati, Claudio |
author_sort |
Albanese, Davide |
title |
Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
title_short |
Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
title_full |
Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
title_fullStr |
Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
title_full_unstemmed |
Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages |
title_sort |
additional file 1 of pre-cambrian roots of novel antarctic cryptoendolithic bacterial lineages |
publisher |
figshare |
publishDate |
2021 |
url |
https://dx.doi.org/10.6084/m9.figshare.14251827.v1 https://springernature.figshare.com/articles/journal_contribution/Additional_file_1_of_Pre-Cambrian_roots_of_novel_Antarctic_cryptoendolithic_bacterial_lineages/14251827/1 |
geographic |
Antarctic The Antarctic |
geographic_facet |
Antarctic The Antarctic |
genre |
Antarc* Antarctic Antarctica |
genre_facet |
Antarc* Antarctic Antarctica |
op_relation |
https://dx.doi.org/10.1186/s40168-021-01021-0 https://dx.doi.org/10.6084/m9.figshare.14251827 |
op_rights |
Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 |
op_rightsnorm |
CC-BY |
op_doi |
https://doi.org/10.6084/m9.figshare.14251827.v1 https://doi.org/10.1186/s40168-021-01021-0 https://doi.org/10.6084/m9.figshare.14251827 |
_version_ |
1766271603766722560 |
spelling |
ftdatacite:10.6084/m9.figshare.14251827.v1 2023-05-15T14:01:38+02:00 Additional file 1 of Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages Albanese, Davide Coleine, Claudia Rota-Stabelli, Omar Onofri, Silvano Tringe, Susannah G. Stajich, Jason E. Selbmann, Laura Donati, Claudio 2021 https://dx.doi.org/10.6084/m9.figshare.14251827.v1 https://springernature.figshare.com/articles/journal_contribution/Additional_file_1_of_Pre-Cambrian_roots_of_novel_Antarctic_cryptoendolithic_bacterial_lineages/14251827/1 unknown figshare https://dx.doi.org/10.1186/s40168-021-01021-0 https://dx.doi.org/10.6084/m9.figshare.14251827 Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 CC-BY Genetics FOS Biological sciences Evolutionary Biology Text article-journal Journal contribution ScholarlyArticle 2021 ftdatacite https://doi.org/10.6084/m9.figshare.14251827.v1 https://doi.org/10.1186/s40168-021-01021-0 https://doi.org/10.6084/m9.figshare.14251827 2021-11-05T12:55:41Z Additional file 1: Figure S1. Antarctic CBS form two distinct clades in the order Frankiales, with characteristic metabolic potential. a) Maximum-likelihood phylogenetic tree based on GTDB 120 core genes including representative genomes (violet) and Antarctic CBS (green). b) Principal Coordinate Analysis (Jaccard distance) of the protein cluster profiles (60% identity) b) and of the metabolic potential. d) The Fisher’s exact test (Bonferroni corrected p<0.01) highlights enriched functional categories. Genomes and KEGG orthologs are clustered according to the Hamming distance between the profiles. The top four KEGG categories significantly more present in the Antarctic CBS are highlighted in the upper bars. Figure S2. Antarctic Jiangellales CBS reveal a substantial genome reduction compared to known species, with characteristic differences in metabolic potential. a) Maximum-likelihood phylogenetic tree based on GTDB genes, including representative genomes from the GTDB database (violet) and Antarctic CBS (green). b) KEGG orthologs that are significantly less frequent in Antarctic Jiagellales compared to reference (uncorrected p<0.05, Fisher’s exact test). Only the first 25 pathways (ranked by the total number of significant orthologs) are shown. c) Number of predicted protein coding sequences in Antarctic (green) and reference (violet) Jiangellales d) The heatmap shows the presence (dark green) of KEGG orthologs belonging to the carotenoid biosynthesis pathway. The only gene involved in carotenoid biosynthesis detected in both CBS and GTDB reference genomes is the crtD. e) The phylogenetic tree inferred on the crtD gene highlights a segregation of Antarctic Jiangellales. Figure S3. Antarctic Thermomicrobiales ( class Chloroflexia ) CBS reveal characteristic metabolic potential. a) The Fisher’s exact test (uncorrected p<0.05) highlights a significant presence, in Antarctic genomes, of orthologs involved in transport, compared to the reference Thermomicrobiales genomes. Only the first 30 pathways (ranked by the total number of orthologs called significant) are shown. b) The prediction of protein coding sequences shows an increment of the number of genes in Antarctic Thermomicrobiales compared to reference genomes. Figure S4. Distribution of the number of CBS that are specific to a given number of samples, taxonomically classified at the Class level. We identified a set of 10 CBS (belonging to the classes Actinobacteria and Alphaproteobacteria) that are present in at least 75% (14/18) of the samples. Figure S5. Mash Screen was used to validate the presence of CBS in the Antarctic samples. a) Distribution of the number of CBS marked as present by the containment score estimated by Mash screen. 1009 out of 1094 (92.2%) CBS have been confirmed by Mash (containment score >0.95, green dashed vertical line). b) Distribution of the number of CBS marked as present by the estimated multiplicity. Figure S6. Scatter plot of the ANI estimated by mapping versus the containment scores estimated by Mash screen for each sample. Horizontal and vertical dashed lines represent the ideal species-level threshold of 0.95 for the containment score and the estimated ANI, respectively. Figure S7. a) Percentage of reads that could be mapped to the CBS representatives, grouped by Class. b) Per sample percentage of the reads that could be mapped to the CBS representatives, grouped by Class. Figure S8. The “ Candidatus Jiangella antarctica ” was found in each sample. a) Scatter plot of the ANI estimated by mapping versus the containment scores estimated by Mash screen (p < 1.47x10 -21 ). b) Scatter plot of the median depth of coverage estimated by mapping versus the median multeplicity estimated by Mash. The line of equality is represented in black. Supplementary Figure S9. Jaccard distance between the KEGG functional profiles for each Order. Text Antarc* Antarctic Antarctica DataCite Metadata Store (German National Library of Science and Technology) Antarctic The Antarctic |