bianchini88/monitoring_sra-samples_EBIsearch: Monitoring the submission to ENA/SRA of sample data from Norwegian institutions

The purpose of this code is to monitor the submissions of Norwegian sequencing data to domain repositories. This is achieved by querying the "sra-samples" endpoint of EBI search , containing metadata of the samples deposited in the Sequence Read Archive (SRA) which is also synchronised wit...

Full description

Bibliographic Details
Main Author: Federico Bianchini
Format: Other/Unknown Material
Language:unknown
Published: Zenodo 2024
Subjects:
Online Access:https://doi.org/10.5281/zenodo.10948339
Description
Summary:The purpose of this code is to monitor the submissions of Norwegian sequencing data to domain repositories. This is achieved by querying the "sra-samples" endpoint of EBI search , containing metadata of the samples deposited in the Sequence Read Archive (SRA) which is also synchronised with the European Nucleotide Archive (ENA) as part of the International Nucleotide Sequence Data Collaboration (INSDC) . The code performs a query based on the country name Norway. Note that this identifies all the samples collected in Norway, not necessarily by Norwegian institutions or organisations. Extensive filtering is used to isolate the relevant data due to the lack of standardisation across the centres' names. The results are then plotted in two graphs, one for the BOTT (Bergen, Oslo, Trondheim, and Tromsø) universities and one for the Norwegian Institute of Public Health (NIPH) (Norwegian: Folkehelseinstituttet; FHI). A non-updated reference for these plots is available in the /plots4reference folder. Powered by EBI Search.