Perl script for the extraction of the most abundant sequences for each OTU and each sample

This ZIP file contains a script called GOMS: Get OTU Main Sequence. GOMS is a Perl program based on mothur files. It thus requiers several files from mothur (http://www.mothur.org/wiki/454_SOP or http://www.mothur.org/wiki/MiSeq_SOP): .list file, .names file, .groups file and .fasta file. The user m...

Full description

Bibliographic Details
Main Authors: Galan, Maxime, Razzauti, Maria, Bard, Emilie, Bernard, Maria, Brouat, Carine, Charbonnel, Nathalie, Dehne-Garcia, Alexandre, Loiseau, Anne, Tatard, Caroline, Tamisier, Lucie, Vayssier-Taussat, Muriel, Vignes, Hélène, Cosson, Jean-François
Format: Dataset
Language:unknown
Published: Dryad Digital Repository 2016
Subjects:
Online Access:https://dx.doi.org/10.5061/dryad.m3p7d/13
http://datadryad.org/resource/doi:10.5061/dryad.m3p7d/13
_version_ 1821692167348813824
author Galan, Maxime
Razzauti, Maria
Bard, Emilie
Bernard, Maria
Brouat, Carine
Charbonnel, Nathalie
Dehne-Garcia, Alexandre
Loiseau, Anne
Tatard, Caroline
Tamisier, Lucie
Vayssier-Taussat, Muriel
Vignes, Hélène
Cosson, Jean-François
author_facet Galan, Maxime
Razzauti, Maria
Bard, Emilie
Bernard, Maria
Brouat, Carine
Charbonnel, Nathalie
Dehne-Garcia, Alexandre
Loiseau, Anne
Tatard, Caroline
Tamisier, Lucie
Vayssier-Taussat, Muriel
Vignes, Hélène
Cosson, Jean-François
author_sort Galan, Maxime
collection DataCite
description This ZIP file contains a script called GOMS: Get OTU Main Sequence. GOMS is a Perl program based on mothur files. It thus requiers several files from mothur (http://www.mothur.org/wiki/454_SOP or http://www.mothur.org/wiki/MiSeq_SOP): .list file, .names file, .groups file and .fasta file. The user must also provide the number of the OTU and the treshold used to form the OTUs. GOMS generates a fasta file with, for each sample, the highly representative sequence of a given OTU, meaning the unique sequence which has the most important number of copies. Some information are also provided within the fasta file like the number of sequences and unique sequences assigned to the OTU and the number of copies of the main unique sequence.
format Dataset
genre Rattus rattus
genre_facet Rattus rattus
id ftdatacite:10.5061/dryad.m3p7d/13
institution Open Polar
language unknown
op_collection_id ftdatacite
op_doi https://doi.org/10.5061/dryad.m3p7d/13
https://doi.org/10.5061/dryad.m3p7d
op_relation https://dx.doi.org/10.5061/dryad.m3p7d
op_rights http://creativecommons.org/publicdomain/zero/1.0
op_rightsnorm CC0
publishDate 2016
publisher Dryad Digital Repository
record_format openpolar
spelling ftdatacite:10.5061/dryad.m3p7d/13 2025-01-17T00:27:21+00:00 Perl script for the extraction of the most abundant sequences for each OTU and each sample Galan, Maxime Razzauti, Maria Bard, Emilie Bernard, Maria Brouat, Carine Charbonnel, Nathalie Dehne-Garcia, Alexandre Loiseau, Anne Tatard, Caroline Tamisier, Lucie Vayssier-Taussat, Muriel Vignes, Hélène Cosson, Jean-François 2016 https://dx.doi.org/10.5061/dryad.m3p7d/13 http://datadryad.org/resource/doi:10.5061/dryad.m3p7d/13 unknown Dryad Digital Repository https://dx.doi.org/10.5061/dryad.m3p7d http://creativecommons.org/publicdomain/zero/1.0 CC0 Zoonoses Rodents West Africa metagenomics 16S rRNA amplicon sequencing MiSeq Next-generation sequencing NGS High-throughput sequencing HTS Metabarcoding Epidemiology Disease monitoring Senegal Mus musculus domesticus Rattus rattus Mastomys natalensis Mastomys erythroleucus Borrelia Bartonella Mycoplasma Ehrlichia Rickettsia Streptobacillus Orientia dataset Dataset DataFile 2016 ftdatacite https://doi.org/10.5061/dryad.m3p7d/13 https://doi.org/10.5061/dryad.m3p7d 2021-11-05T12:55:41Z This ZIP file contains a script called GOMS: Get OTU Main Sequence. GOMS is a Perl program based on mothur files. It thus requiers several files from mothur (http://www.mothur.org/wiki/454_SOP or http://www.mothur.org/wiki/MiSeq_SOP): .list file, .names file, .groups file and .fasta file. The user must also provide the number of the OTU and the treshold used to form the OTUs. GOMS generates a fasta file with, for each sample, the highly representative sequence of a given OTU, meaning the unique sequence which has the most important number of copies. Some information are also provided within the fasta file like the number of sequences and unique sequences assigned to the OTU and the number of copies of the main unique sequence. Dataset Rattus rattus DataCite
spellingShingle Zoonoses
Rodents
West Africa
metagenomics
16S rRNA amplicon sequencing
MiSeq
Next-generation sequencing
NGS
High-throughput sequencing
HTS
Metabarcoding
Epidemiology
Disease monitoring
Senegal
Mus musculus domesticus
Rattus rattus
Mastomys natalensis
Mastomys erythroleucus
Borrelia
Bartonella
Mycoplasma
Ehrlichia
Rickettsia
Streptobacillus
Orientia
Galan, Maxime
Razzauti, Maria
Bard, Emilie
Bernard, Maria
Brouat, Carine
Charbonnel, Nathalie
Dehne-Garcia, Alexandre
Loiseau, Anne
Tatard, Caroline
Tamisier, Lucie
Vayssier-Taussat, Muriel
Vignes, Hélène
Cosson, Jean-François
Perl script for the extraction of the most abundant sequences for each OTU and each sample
title Perl script for the extraction of the most abundant sequences for each OTU and each sample
title_full Perl script for the extraction of the most abundant sequences for each OTU and each sample
title_fullStr Perl script for the extraction of the most abundant sequences for each OTU and each sample
title_full_unstemmed Perl script for the extraction of the most abundant sequences for each OTU and each sample
title_short Perl script for the extraction of the most abundant sequences for each OTU and each sample
title_sort perl script for the extraction of the most abundant sequences for each otu and each sample
topic Zoonoses
Rodents
West Africa
metagenomics
16S rRNA amplicon sequencing
MiSeq
Next-generation sequencing
NGS
High-throughput sequencing
HTS
Metabarcoding
Epidemiology
Disease monitoring
Senegal
Mus musculus domesticus
Rattus rattus
Mastomys natalensis
Mastomys erythroleucus
Borrelia
Bartonella
Mycoplasma
Ehrlichia
Rickettsia
Streptobacillus
Orientia
topic_facet Zoonoses
Rodents
West Africa
metagenomics
16S rRNA amplicon sequencing
MiSeq
Next-generation sequencing
NGS
High-throughput sequencing
HTS
Metabarcoding
Epidemiology
Disease monitoring
Senegal
Mus musculus domesticus
Rattus rattus
Mastomys natalensis
Mastomys erythroleucus
Borrelia
Bartonella
Mycoplasma
Ehrlichia
Rickettsia
Streptobacillus
Orientia
url https://dx.doi.org/10.5061/dryad.m3p7d/13
http://datadryad.org/resource/doi:10.5061/dryad.m3p7d/13