DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf

Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is...

Full description

Bibliographic Details
Main Authors: Ashley Byrne, Megan A. Supple, Roger Volden, Kristin L. Laidre, Beth Shapiro, Christopher Vollmers
Format: Dataset
Language:unknown
Published: 2019
Subjects:
ONT
Online Access:https://doi.org/10.3389/fgene.2019.00643.s001
https://figshare.com/articles/DataSheet_1_Depletion_of_Hemoglobin_Transcripts_and_Long-Read_Sequencing_Improves_the_Transcriptome_Annotation_of_the_Polar_Bear_Ursus_maritimus_pdf/8957348
id ftfrontimediafig:oai:figshare.com:article/8957348
record_format openpolar
spelling ftfrontimediafig:oai:figshare.com:article/8957348 2023-05-15T18:01:46+02:00 DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf Ashley Byrne Megan A. Supple Roger Volden Kristin L. Laidre Beth Shapiro Christopher Vollmers 2019-07-19T04:04:35Z https://doi.org/10.3389/fgene.2019.00643.s001 https://figshare.com/articles/DataSheet_1_Depletion_of_Hemoglobin_Transcripts_and_Long-Read_Sequencing_Improves_the_Transcriptome_Annotation_of_the_Polar_Bear_Ursus_maritimus_pdf/8957348 unknown doi:10.3389/fgene.2019.00643.s001 https://figshare.com/articles/DataSheet_1_Depletion_of_Hemoglobin_Transcripts_and_Long-Read_Sequencing_Improves_the_Transcriptome_Annotation_of_the_Polar_Bear_Ursus_maritimus_pdf/8957348 CC BY 4.0 CC-BY Genetics Genetic Engineering Biomarkers Developmental Genetics (incl. Sex Determination) Epigenetics (incl. Genome Methylation and Epigenomics) Gene Expression (incl. Microarray and other genome-wide approaches) Genome Structure and Regulation Genomics Genetically Modified Animals Livestock Cloning Gene and Molecular Therapy polar bear (Ursus maritimus) R2C2 long-read high throughput sequencing ONT Oxford Nanopore Technologies transcriptome annotation Dataset 2019 ftfrontimediafig https://doi.org/10.3389/fgene.2019.00643.s001 2019-08-07T23:03:26Z Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not ... Dataset polar bear Ursus maritimus Frontiers: Figshare
institution Open Polar
collection Frontiers: Figshare
op_collection_id ftfrontimediafig
language unknown
topic Genetics
Genetic Engineering
Biomarkers
Developmental Genetics (incl. Sex Determination)
Epigenetics (incl. Genome Methylation and Epigenomics)
Gene Expression (incl. Microarray and other genome-wide approaches)
Genome Structure and Regulation
Genomics
Genetically Modified Animals
Livestock Cloning
Gene and Molecular Therapy
polar bear (Ursus maritimus)
R2C2
long-read high throughput sequencing
ONT
Oxford Nanopore Technologies
transcriptome annotation
spellingShingle Genetics
Genetic Engineering
Biomarkers
Developmental Genetics (incl. Sex Determination)
Epigenetics (incl. Genome Methylation and Epigenomics)
Gene Expression (incl. Microarray and other genome-wide approaches)
Genome Structure and Regulation
Genomics
Genetically Modified Animals
Livestock Cloning
Gene and Molecular Therapy
polar bear (Ursus maritimus)
R2C2
long-read high throughput sequencing
ONT
Oxford Nanopore Technologies
transcriptome annotation
Ashley Byrne
Megan A. Supple
Roger Volden
Kristin L. Laidre
Beth Shapiro
Christopher Vollmers
DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
topic_facet Genetics
Genetic Engineering
Biomarkers
Developmental Genetics (incl. Sex Determination)
Epigenetics (incl. Genome Methylation and Epigenomics)
Gene Expression (incl. Microarray and other genome-wide approaches)
Genome Structure and Regulation
Genomics
Genetically Modified Animals
Livestock Cloning
Gene and Molecular Therapy
polar bear (Ursus maritimus)
R2C2
long-read high throughput sequencing
ONT
Oxford Nanopore Technologies
transcriptome annotation
description Transcriptome studies evaluating whole blood and tissues are often confounded by overrepresentation of highly abundant transcripts. These abundant transcripts are problematic, as they compete with and prevent the detection of rare RNA transcripts, obscuring their biological importance. This issue is more pronounced when using long-read sequencing technologies for isoform-level transcriptome analysis, as they have relatively lower throughput compared to short-read sequencers. As a result, long-read based transcriptome analysis is prohibitively expensive for non-model organisms. While there are off-the-shelf kits available for select model organisms capable of depleting highly abundant transcripts for alpha (HBA) and beta (HBB) hemoglobin, they are unsuitable for non-model organisms. To address this, we have adapted the recent CRISPR/Cas9-based depletion method (depletion of abundant sequences by hybridization) for long-read full-length cDNA sequencing approaches that we call Long-DASH. Using a recombinant Cas9 protein with appropriate guide RNAs, full-length hemoglobin transcripts can be depleted in vitro prior to performing any short- and long-read sequencing library preparations. Using this method, we sequenced depleted full-length cDNA in parallel using both our Oxford Nanopore Technology (ONT) based R2C2 long-read approach, as well as the Illumina short-read based Smart-seq2 approach. To showcase this, we have applied our methods to create an isoform-level transcriptome from whole blood samples derived from three polar bears (Ursus maritimus). Using Long-DASH, we succeeded in depleting hemoglobin transcripts and generated deep Smart-seq2 Illumina datasets and 3.8 million R2C2 full-length cDNA consensus reads. Applying Long-DASH with our isoform identification pipeline, Mandalorion, we discovered ∼6,000 high-confidence isoforms and a number of novel genes. This indicates that there is a high diversity of gene isoforms within U. maritimus not yet reported. This reproducible and straightforward approach has not ...
format Dataset
author Ashley Byrne
Megan A. Supple
Roger Volden
Kristin L. Laidre
Beth Shapiro
Christopher Vollmers
author_facet Ashley Byrne
Megan A. Supple
Roger Volden
Kristin L. Laidre
Beth Shapiro
Christopher Vollmers
author_sort Ashley Byrne
title DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
title_short DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
title_full DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
title_fullStr DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
title_full_unstemmed DataSheet_1_Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus).pdf
title_sort datasheet_1_depletion of hemoglobin transcripts and long-read sequencing improves the transcriptome annotation of the polar bear (ursus maritimus).pdf
publishDate 2019
url https://doi.org/10.3389/fgene.2019.00643.s001
https://figshare.com/articles/DataSheet_1_Depletion_of_Hemoglobin_Transcripts_and_Long-Read_Sequencing_Improves_the_Transcriptome_Annotation_of_the_Polar_Bear_Ursus_maritimus_pdf/8957348
genre polar bear
Ursus maritimus
genre_facet polar bear
Ursus maritimus
op_relation doi:10.3389/fgene.2019.00643.s001
https://figshare.com/articles/DataSheet_1_Depletion_of_Hemoglobin_Transcripts_and_Long-Read_Sequencing_Improves_the_Transcriptome_Annotation_of_the_Polar_Bear_Ursus_maritimus_pdf/8957348
op_rights CC BY 4.0
op_rightsnorm CC-BY
op_doi https://doi.org/10.3389/fgene.2019.00643.s001
_version_ 1766171282918866944