An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data

Abstract Background Within Plasmodium falciparum merozoite surface protein 1 (MSP1), the N-terminal block 2 region is a highly polymorphic target of naturally acquired antibody responses. The antigenic diversity is determined by complex repeat sequences as well as non-repeat sequences, grouping into...

Full description

Bibliographic Details
Published in:Malaria Journal
Main Authors: Harvey Aspeling-Jones, David J. Conway
Format: Article in Journal/Newspaper
Language:English
Published: BMC 2018
Subjects:
Online Access:https://doi.org/10.1186/s12936-018-2475-2
https://doaj.org/article/8d0ddaec0ad043a5a23421b4045ba3d9
id ftdoajarticles:oai:doaj.org/article:8d0ddaec0ad043a5a23421b4045ba3d9
record_format openpolar
spelling ftdoajarticles:oai:doaj.org/article:8d0ddaec0ad043a5a23421b4045ba3d9 2023-05-15T15:18:01+02:00 An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data Harvey Aspeling-Jones David J. Conway 2018-10-01T00:00:00Z https://doi.org/10.1186/s12936-018-2475-2 https://doaj.org/article/8d0ddaec0ad043a5a23421b4045ba3d9 EN eng BMC http://link.springer.com/article/10.1186/s12936-018-2475-2 https://doaj.org/toc/1475-2875 doi:10.1186/s12936-018-2475-2 1475-2875 https://doaj.org/article/8d0ddaec0ad043a5a23421b4045ba3d9 Malaria Journal, Vol 17, Iss 1, Pp 1-13 (2018) Polymorphism msp1 Repeat sequences Sequence mapping De novo assembly Molecular epidemiology Arctic medicine. Tropical medicine RC955-962 Infectious and parasitic diseases RC109-216 article 2018 ftdoajarticles https://doi.org/10.1186/s12936-018-2475-2 2022-12-31T04:01:51Z Abstract Background Within Plasmodium falciparum merozoite surface protein 1 (MSP1), the N-terminal block 2 region is a highly polymorphic target of naturally acquired antibody responses. The antigenic diversity is determined by complex repeat sequences as well as non-repeat sequences, grouping into three major allelic types that appear to be maintained within populations by natural selection. Within these major types, many distinct allelic sequences have been described in different studies, but the extent and significance of the diversity remains unresolved. Methods To survey the diversity more extensively, block 2 allelic sequences in the msp1 gene were characterized in 2400 P. falciparum infection isolates with whole genome short read sequence data available from the Pf3K project, and compared with the data from previous studies. Results Mapping the short read sequence data in the 2400 isolates to a reference library of msp1 block 2 allelic sequences yielded 3815 allele scores at the level of major allelic family types, with 46% of isolates containing two or more of these major types. Overall frequencies were similar to those previously reported in other samples with different methods, the K1-like allelic type being most common in Africa, MAD20-like most common in Southeast Asia, and RO33-like being the third most abundant type in each continent. The rare MR type, formed by recombination between MAD20-like and RO33-like alleles, was only seen in Africa and very rarely in the Indian subcontinent but not in Southeast Asia. A combination of mapped short read assembly approaches enabled 1522 complete msp1 block 2 sequences to be determined, among which there were 363 different allele sequences, of which 246 have not been described previously. In these data, the K1-like msp1 block 2 alleles are most diverse and encode 225 distinct amino acid sequences, compared with 123 different MAD20-like, 9 RO33-like and 6 MR type sequences. Within each of the major types, the different allelic sequences show highly skewed ... Article in Journal/Newspaper Arctic Directory of Open Access Journals: DOAJ Articles Arctic Indian Malaria Journal 17 1
institution Open Polar
collection Directory of Open Access Journals: DOAJ Articles
op_collection_id ftdoajarticles
language English
topic Polymorphism
msp1
Repeat sequences
Sequence mapping
De novo assembly
Molecular epidemiology
Arctic medicine. Tropical medicine
RC955-962
Infectious and parasitic diseases
RC109-216
spellingShingle Polymorphism
msp1
Repeat sequences
Sequence mapping
De novo assembly
Molecular epidemiology
Arctic medicine. Tropical medicine
RC955-962
Infectious and parasitic diseases
RC109-216
Harvey Aspeling-Jones
David J. Conway
An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
topic_facet Polymorphism
msp1
Repeat sequences
Sequence mapping
De novo assembly
Molecular epidemiology
Arctic medicine. Tropical medicine
RC955-962
Infectious and parasitic diseases
RC109-216
description Abstract Background Within Plasmodium falciparum merozoite surface protein 1 (MSP1), the N-terminal block 2 region is a highly polymorphic target of naturally acquired antibody responses. The antigenic diversity is determined by complex repeat sequences as well as non-repeat sequences, grouping into three major allelic types that appear to be maintained within populations by natural selection. Within these major types, many distinct allelic sequences have been described in different studies, but the extent and significance of the diversity remains unresolved. Methods To survey the diversity more extensively, block 2 allelic sequences in the msp1 gene were characterized in 2400 P. falciparum infection isolates with whole genome short read sequence data available from the Pf3K project, and compared with the data from previous studies. Results Mapping the short read sequence data in the 2400 isolates to a reference library of msp1 block 2 allelic sequences yielded 3815 allele scores at the level of major allelic family types, with 46% of isolates containing two or more of these major types. Overall frequencies were similar to those previously reported in other samples with different methods, the K1-like allelic type being most common in Africa, MAD20-like most common in Southeast Asia, and RO33-like being the third most abundant type in each continent. The rare MR type, formed by recombination between MAD20-like and RO33-like alleles, was only seen in Africa and very rarely in the Indian subcontinent but not in Southeast Asia. A combination of mapped short read assembly approaches enabled 1522 complete msp1 block 2 sequences to be determined, among which there were 363 different allele sequences, of which 246 have not been described previously. In these data, the K1-like msp1 block 2 alleles are most diverse and encode 225 distinct amino acid sequences, compared with 123 different MAD20-like, 9 RO33-like and 6 MR type sequences. Within each of the major types, the different allelic sequences show highly skewed ...
format Article in Journal/Newspaper
author Harvey Aspeling-Jones
David J. Conway
author_facet Harvey Aspeling-Jones
David J. Conway
author_sort Harvey Aspeling-Jones
title An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
title_short An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
title_full An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
title_fullStr An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
title_full_unstemmed An expanded global inventory of allelic variation in the most extremely polymorphic region of Plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
title_sort expanded global inventory of allelic variation in the most extremely polymorphic region of plasmodium falciparum merozoite surface protein 1 provided by short read sequence data
publisher BMC
publishDate 2018
url https://doi.org/10.1186/s12936-018-2475-2
https://doaj.org/article/8d0ddaec0ad043a5a23421b4045ba3d9
geographic Arctic
Indian
geographic_facet Arctic
Indian
genre Arctic
genre_facet Arctic
op_source Malaria Journal, Vol 17, Iss 1, Pp 1-13 (2018)
op_relation http://link.springer.com/article/10.1186/s12936-018-2475-2
https://doaj.org/toc/1475-2875
doi:10.1186/s12936-018-2475-2
1475-2875
https://doaj.org/article/8d0ddaec0ad043a5a23421b4045ba3d9
op_doi https://doi.org/10.1186/s12936-018-2475-2
container_title Malaria Journal
container_volume 17
container_issue 1
_version_ 1766348257456291840