Dirichlet process mixture model based on topologically augmented signal representation for clustering infant vocalizations

Bonafos, Guillem, Bourot, Clara, Pudlo, Pierre, Freyermuth, Jean-Marc, Reboul, Laurence, Tronçon, Samuel, Rey, Arnaud

Jul-8-2024–arXiv.org Machine Learning

Based on audio recordings made once a month during the first 12 months of a child's life, we propose a new method for clustering this set of vocalizations. We use a topologically augmented representation of the vocalizations, employing two persistence diagrams for each vocalization: one computed on the surface of its spectrogram and one on the Takens' embeddings of the vocalization. A synthetic persistent variable is derived for each diagram and added to the MFCCs (Mel-frequency cepstral coefficients). Using this representation, we fit a non-parametric Bayesian mixture model with a Dirichlet process prior to model the number of components. This procedure leads to a novel data-driven categorization of vocal productions. Our findings reveal the presence of 8 clusters of vocalizations, allowing us to compare their temporal distribution and acoustic profiles in the first 12 months of life.

dirichlet process mixture model, representation, vocalization, (13 more...)

arXiv.org Machine Learning

Jul-8-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Europe
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.05)
- Asia > Singapore
  - Central Region > Singapore (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Health & Medicine (0.68)

Technology:
- Information Technology
  - Data Science (0.94)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found