This paper proposes a hierarchical model for the analysis of spectrograms of animal calls. The motivation stems from analysing recordings of the so-called grunt calls emitted by various lemur species. Our goal is to identify a latent spectral shape that characterizes each species and facilitates measuring dissimilarities between them. The model addresses the synchronization of animal vocalizations, due to varying time-lengths and speeds, with nonstationary temporal patterns and accounts for periodic sampling artifacts produced by the time discretization of analogue signals. The former is achieved through a synchronization function, and the latter is modelled using a circular representation of time. To overcome the curse of dimensionality inherent in the model’s implementation, we employ the Nearest Neighbour Gaussian Process, and posterior samples are obtained using the Markov chain Monte Carlo method. We apply the model to a real dataset comprising sounds of eight different species. We define a representative sound for each species and compare them using a distance measure. Cross-validation is used to evaluate the predictive capability of our proposal and explore special cases. Additionally, a simulation study is used to demonstrate how effectively the Markov chain Monte Carlo algorithm can identify the parameters used to generate the data.
Bayesian inference for latent spectral shapes / Valente, Daria; Yip, Hiu Ching; Mastrantonio, Gianluca; Bibbona, Enrico; Friard, Olivier; Gamba, Marco. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS. - ISSN 0035-9254. - ELETTRONICO. - (2025), pp. 1-19. [10.1093/jrsssc/qlaf057]
Bayesian inference for latent spectral shapes
Hiu Ching Yip;Gianluca Mastrantonio;Enrico Bibbona;
2025
Abstract
This paper proposes a hierarchical model for the analysis of spectrograms of animal calls. The motivation stems from analysing recordings of the so-called grunt calls emitted by various lemur species. Our goal is to identify a latent spectral shape that characterizes each species and facilitates measuring dissimilarities between them. The model addresses the synchronization of animal vocalizations, due to varying time-lengths and speeds, with nonstationary temporal patterns and accounts for periodic sampling artifacts produced by the time discretization of analogue signals. The former is achieved through a synchronization function, and the latter is modelled using a circular representation of time. To overcome the curse of dimensionality inherent in the model’s implementation, we employ the Nearest Neighbour Gaussian Process, and posterior samples are obtained using the Markov chain Monte Carlo method. We apply the model to a real dataset comprising sounds of eight different species. We define a representative sound for each species and compare them using a distance measure. Cross-validation is used to evaluate the predictive capability of our proposal and explore special cases. Additionally, a simulation study is used to demonstrate how effectively the Markov chain Monte Carlo algorithm can identify the parameters used to generate the data.| File | Dimensione | Formato | |
|---|---|---|---|
|
pubblicato_qlaf057.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
1.69 MB
Formato
Adobe PDF
|
1.69 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3004924
