Matrices of the frequency and similarity of Arabic letters and allographs

Published online: 19 February 2020 Indicators of letter frequency and similarity have long been available for Indo-European languages. They have not only been pivotal in controlling the design of experimental psycholinguistic studies seeking to determine the factors that underlie reading ability and...

Full description

Bibliographic Details
Published in:Behavior Research Methods
Main Authors: Boudelaa, Sami, Perea, Manuel, Carreiras, Manuel
Format: Article in Journal/Newspaper
Language:English
Published: Behavior Research Methods 2020
Subjects:
Online Access:http://hdl.handle.net/10810/48762
https://doi.org/10.3758/s13428-020-01353-z
Description
Summary:Published online: 19 February 2020 Indicators of letter frequency and similarity have long been available for Indo-European languages. They have not only been pivotal in controlling the design of experimental psycholinguistic studies seeking to determine the factors that underlie reading ability and literacy acquisition, but have also been useful for studies examining the more general aspects of human cognition. Despite their importance, however, such indicators are still not available for Modern Standard Arabic (MSA), a language that, by virtue of its orthographic system, presents an invaluable environment for the experimental investigation of visual word processing. This paper presents for the first time the frequencies of Arabic letters and their allographs based on a 40-million-word corpus, along with their similarity/confusability indicators in three domains: (1) the visual domain, based on human ratings; (2) the auditory domain, based on an analysis of the phonetic features of letter sounds; and (3) the motoric domain, based on an analysis of the stroke features used to write letters and their allographs. Taken together, the frequency and similarity of Arabic letters and their allographs in the visual and motoric domains, as well as the similarities among the letter sounds, will be useful for researchers interested in the processes underpinning orthographic processing, visual word recognition, reading, and literacy acquisition. This research was funded by two United Arab Emirates University College of Humanities and Social Sciences grants to Sami Boudelaa (G00002367 and G00003158).