Same data, different results? : machine learning approaches in bioacoustics
We investigated the impact of using different feature extraction (spectro-temporal measurements, linear and Mel-frequency cepstral coefficients (MFCC), as well as highly comparative time-series analysis) and classification methods (discriminant function analysis, neural networks, random forests (RF), and support vector machines) on the consistency of caller identity classification accuracy across 16 mammalian datasets. We found that MFCCs and RFs yield consistently reliable results across datasets, facilitating a standardised approach across species that generates directly comparable data. These findings remained consistent across vocalisation sample sizes and number of individuals considered. We offer guidelines for processing and analysing mammalian vocalisations, fostering greater comparability and advancing our understanding of the evolutionary significance of acoustic communication in diverse mammalian species.
Preview
Cite
Access Statistic


