Improved versions of the Levenshtein distance method for comparing sequence information in animals' vocalisations:tests using humpback whale song

Animals can communicate using visual and acoustic displays to convey information to conspecifics. In some cases, such displays are produced in highly stereotyped and repetitive sequences. Here we use a quantitative analysis technique, the Levenshtein distance, to assess similarity in sequences of di...

Full description

Bibliographic Details
Published in:Behaviour
Main Authors: Garland, Ellen C., Lilley, Matthew S., Goldizen, Anne W., Rekdahl, Melinda L., Garrigue, Claire, Noad, Michael J.
Format: Article in Journal/Newspaper
Language:English
Published: 2012
Subjects:
Online Access:https://research-portal.st-andrews.ac.uk/en/researchoutput/improved-versions-of-the-levenshtein-distance-method-for-comparing-sequence-information-in-animals-vocalisations(d3248c86-248a-4ab6-9f7f-9e778c946659).html
https://doi.org/10.1163/1568539X-00003032
Description
Summary:Animals can communicate using visual and acoustic displays to convey information to conspecifics. In some cases, such displays are produced in highly stereotyped and repetitive sequences. Here we use a quantitative analysis technique, the Levenshtein distance, to assess similarity in sequences of displays at both the population and individual levels. We review two existing variations of the method and present two new variations that complement and extend these existing techniques. Three of the methods include the use of a median string sequence and three use a normalisation of the original equation. Humpback whale song theme sequences from multiple populations, years and song types (different variations of the display) are used as examples to illustrate the application and success of each variation. A novel outcome of this technique is that it can produce a threshold measure of similarity to assess when behavioural sequences are so dissimilar that they must be considered different, with a measure of the probability of such clusters being distinct. The Levenshtein distance is applicable to all behavioural data produced in sequences and its use should not be limited to acoustical studies.