The Ocean Gene Atlas v2.0: online exploration of the biogeography and phylogeny of plankton genes

Abstract Testing hypothesis about the biogeography of genes using large data resources such as Tara Oceans marine metagenomes and metatranscriptomes requires significant hardware resources and programming skills. The new release of the ‘Ocean Gene Atlas’ (OGA2) is a freely available intuitive online...

Full description

Bibliographic Details
Published in:Nucleic Acids Research
Main Authors: Vernette, Caroline, Lecubin, Julien, Sánchez, Pablo, Acinas, Silvia, Babin, Marcel, Bork, Peer, Boss, Emmanuel, Bowler, Chris, Cochrane, Guy, de Vargas, Colomban, Gorsky, Gabriel, Guidi, Lionel, Grimsley, Nigel, Iudicone, Daniele, Jaillon, Olivier, Kandels-Lewis, Stefanie, Karp-Boss, Lee, Karsenti, Eric, Not, Fabrice, Ogata, Hiroyuki, Poulton, Nicole, Pesant, Stéphane, Sardet, Christian, Speich, Sabrina, Stemmann, Lars, Sullivan, Matthew, Sunagawa, Shinichi, Wincker, Patrick, Delmont, Tom, Pelletier, Eric, Hingamp, Pascal, Lescot, Magali
Other Authors: Génomique métabolique (UMR 8030), Genoscope - Centre national de séquençage Evry (GENOSCOPE), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université d'Évry-Val-d'Essonne (UEVE)-Centre National de la Recherche Scientifique (CNRS), Global Oceans Systems Ecology & Evolution - Tara Oceans (GOSEE), Université de Perpignan Via Domitia (UPVD)-École Pratique des Hautes Études (EPHE), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Aix Marseille Université (AMU)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université d'Évry-Val-d'Essonne (UEVE)-Université de Toulon (UTLN)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes 2016-2019 (UGA 2016-2019 )-Institut de Recherche pour le Développement (IRD France-Nord )-Ecole Normale Supérieure Paris-Saclay (ENS Paris Saclay)-European Molecular Biology Laboratory (EMBL)-École Centrale de Nantes (Nantes Univ - ECN), Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Université australe du Chili, Institut méditerranéen d'océanologie (MIO), Institut de Recherche pour le Développement (IRD)-Aix Marseille Université (AMU)-Institut national des sciences de l'Univers (INSU - CNRS)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS), Centre National de la Recherche Scientifique (CNRS)
Format: Article in Journal/Newspaper
Language:English
Published: HAL CCSD 2022
Subjects:
Online Access:https://cea.hal.science/cea-04321536
https://doi.org/10.1093/nar/gkac420
Description
Summary:Abstract Testing hypothesis about the biogeography of genes using large data resources such as Tara Oceans marine metagenomes and metatranscriptomes requires significant hardware resources and programming skills. The new release of the ‘Ocean Gene Atlas’ (OGA2) is a freely available intuitive online service to mine large and complex marine environmental genomic databases. OGA2 datasets available have been extended and now include, from the Tara Oceans portfolio: (i) eukaryotic Metagenome-Assembled-Genomes (MAGs) and Single-cell Assembled Genomes (SAGs) (10.2E+6 coding genes), (ii) version 2 of Ocean Microbial Reference Gene Catalogue (46.8E+6 non-redundant genes), (iii) 924 MetaGenomic Transcriptomes (7E+6 unigenes), (iv) 530 MAGs from an Arctic MAG catalogue (1E+6 genes) and (v) 1888 Bacterial and Archaeal Genomes (4.5E+6 genes), and an additional dataset from the Malaspina 2010 global circumnavigation: (vi) 317 Malaspina Deep Metagenome Assembled Genomes (0.9E+6 genes). Novel analyses enabled by OGA2 include phylogenetic tree inference to visualize user queries within their context of sequence homologues from both the marine environmental dataset and the RefSeq database. An Application Programming Interface (API) now allows users to query OGA2 using command-line tools, hence providing local workflow integration. Finally, gene abundance can be interactively filtered directly on map displays using any of the available environmental variables. Ocean Gene Atlas v2.0 is freely-available at: https://tara-oceans.mio.osupytheas.fr/ocean-gene-atlas/.