MalaMix dataset: contextual and metabarcoding data

1. INTRODUCTION MalaMix is a compiled metabarcoding dataset composed of 451 marine samples collected from a range of depths - from the surface (3m) to deep waters (as far down as 4800m). This dataset covers three ocean layers: the epi- (0-200m – including DCM), meso- (200-1000m) and bathypelagic (10...

Full description

Bibliographic Details
Main Authors: Junger, Pedro C., Sarmento, Hugo, Giner, Caterina R., Mestre, Mireia, Sebastián, Marta, Moran, Xosé A. G., Arístegui, Javier, Agustí, Susana, Duarte, Carlos M., Acinas, Silvia G., Massana, Ramon, Gasol, Josep M., Logares, Ramiro
Format: Dataset
Language:English
Published: 2023
Subjects:
Online Access:https://zenodo.org/record/8363877
https://doi.org/10.5281/zenodo.8363877
Description
Summary:1. INTRODUCTION MalaMix is a compiled metabarcoding dataset composed of 451 marine samples collected from a range of depths - from the surface (3m) to deep waters (as far down as 4800m). This dataset covers three ocean layers: the epi- (0-200m – including DCM), meso- (200-1000m) and bathypelagic (1000-4000m). MalaMix combines samples obtained during two oceanographic expeditions with similar sampling strategies: i) the Malaspina-2010 global expedition that produced 263 samples collected between December 2010 and July 2011 from 120 stations distributed along the tropical and subtropical portions (latitudes between 35° N and 40° S) of the Pacific, Atlantic and Indian oceans; and ii) the HotMix trans-Mediterranean cruise that produced 188 samples collected between April and May 2014 in 29 stations distributed along the whole Mediterranean Sea (from -5° W to 33° E) and the adjacent Northeast Atlantic Ocean. MalaMix comprises: a 16S-V4V5 rRNA gene ASV table (MalaMix_16S.csv); an 18S-V4 rRNA gene ASV table (MalaMix_18S.csv); two tables of contextual metadata (MalaMix_EnvData_16S and MalaMix_EnvData_18S) including 6 standardized environmental parameters (temperature [°C], salinity, fluorescence, PO43− [µmol L-1], NO3− [µmol L-1], and SiO2 [µmol L-1]) as well as species taxonomic and phylogenetic diversity metrics a table (MalaMix_FCdata.csv) with flow cytometry microbial counts [cell mL-1] and bacterial activity measurements [pmol Leu L-1 h-1]; a README file (README_Metadata.csv) describing the meaning and units of each variable column in the metadata tables. The raw DNA sequences are publicly available at the European Nucleotide Archive (https://www.ebi.ac.uk/ena) under accession numbers PRJEB23913 [18S rRNA genes] & PRJEB25224 [16S rRNA genes] for the Malaspina surface dataset; PRJEB23771 [18S rRNA genes] & PRJEB45015 [16S rRNA genes] for the Malaspina vertical profiles; PRJEB45011 [16S rRNA genes] & PRJEB45014 [18S rRNA genes] for the Malaspina deep sea dataset; and PRJEB44683 [18S rRNA genes] & ...