Topological data analysis of human vowels: Persistent homologies across representation spaces

Topological data analysis is a mathematical framework that characterizes the global shape of complex data by identifying structural features—such as clusters, loops, or holes—that remain stable despite noise or small variations. We studied how vocal signals, example shown in (a), should be represented so that topology-based algorithms can correctly classify it.
We compared the topological features extracted from several signal representations, including spectrograms (b), spectrogram zeros (c), and Takens’ embeddings (d). Using a publicly available dataset of 11,200 recorded vowel utterances, we conducted an empirical analysis demonstrating that these topological features provide additional discriminative information for both speaker and vowel classification. Moreover, features derived from different signal representations appear to be complementary. Our results suggest that low-persistence topological features, often dismissed as “topological noise”, encode important information about speech.
Guillem Bonafos, Pierre Pudlo, Jean-Marc Freyermuth, Samuel Tronçon, and Arnaud Rey.
2026.
Speech Communication 178 (March): 103363 — @HAL
On the Acquisition of Typing Skills without Formal Training by School-Aged Children
Svetlana Pinet, Christelle Zielinski, F.-Xavier Alario, and Marieke Longcamp.
2025. Reading and Writing, November 25. — @HAL
Event‐Related Brain Potentials and Frequency‐Following Response to Syllables in Newborns and Adults
G. Danielou, E. Hervé, A. S. Dubarry, B. Desnous, and C. François.
2026. European Journal of Neuroscience 63 (3): e70418 — @HAL
Bridging the Communication Gap: Pragmatics and Interactional Dynamics in Deaf and Hard of Hearing Children
Chiara Mazzocconi, Céline Hidalgo, Charlie Hallart, Stéphane Roman, Roxane Bertrand, Leonardo Lancia, Daniele Schön
2026. Journal of Speech, Language, and Hearing Research, February 20, 1–38 — @HAL
Beyond Phonemic Awareness: The Alphabetic Principle Predicts Reading Acquisition in a Nationwide Longitudinal Study
Paul Gioia, Johannes C. Ziegler, and Jerome Deauvieau.
2026. Cognition 271 (June): 106457 — @HAL
Neural Correlates of Conversational Feedback
Philippe Blache, Deirdre Bolger, Mireille Besson, Auriane Boudin, and Roxane Bertrand.
2026. Language, Cognition and Neuroscience, January 27, 1–26 — @HAL
Opportunistic Observation of Long-Finned Pilot Whales Interacting with a Solitary Humpback Whale in the Gascogne Gulf (Northwest Atlantic)
Paul Best, Lise Habib-Dassetto, Jules Cauzinille, Thierry Legou, Fabienne Delfour, and Marie Montant.
2025. Aquatic Mammals 51 (6): 515–20. — @HAL
Traitement de Séquences dans des modalités visuo-phonologique et visuo-motrice
Simon Thibault, LPL & CRPN
Forum des Sciences Cognitives de Marseille
Association Les Neuronautes
Combining Spatial Wavelets and Sparse Bayesian Learning for Extended Brain Sources Reconstruction

In the context of M/EEG source reconstruction, most distributed source models tend to strongly overestimate the spatial extent of brain activity and underestimate its depth and amplitude (Black contour line: boundary of the “aud-rh” region given by MNE-Python).
The top row shows the results of the MNE and eLORETA algorithms (implemented in MNE-Python with the default parameters), which suffer significantly from these limitations (although eLORETA manages to estimate depth). Sparse Bayesian Learning (SBL, bottom left) underestimates the spatial extent while accurately locating the activity. Combining SBL with spectral graph wavelets, as shown in the bottom right panel, correctly locates the activity (red and green lines are level curves corresponding respectively to 1% and 10% source amplitude levels), estimates its spatial extent and depth, and yields a quantitatively relevant amplitude estimate.
Combining Spatial Wavelets and Sparse Bayesian Learning for Extended Brain Sources Reconstruction
2025. IEEE Transactions on Biomedical Engineering, 1–12. — @HAL