Whetten, Ryan Parcollet, Titouan Moumen, Adel Dinarelli, Marco Estève, Yannick
Self-Supervised Learning (SSL) has proven to be effective in various domains, including speech processing. However, SSL is computationally and memory expensive. This is in part due the quadratic complexity of multi-head self-attention (MHSA). Alternatives for MHSA have been proposed and used in the speech domain, but have yet to be investigated pro...
Hayler, Raymond Charters, Emma Coulson, Susan Hubert Low, Tsu-Hui
Published in
International journal of speech-language pathology
Facial nerve palsy (FNP) affects physical and social function, including speech. There exists discrepancy between professional and patient perception of appearance following FNP; however, speech differences remain unknown. We aimed to compare ratings of speech intelligibility by different listeners. Patients were identified through the Sydney Facia...
Vainio, Lari Kilpeläinen, Markku Wikström, Alexandra Vainio, Martti
Published in
Language and speech
Previous investigations have shown various interactions between spatial concepts and speech sounds. For instance, the front-high vowel [i] is associated with the concept of forward, and the back-high vowel [o] is associated with the concept of backward. Three experiments investigated whether the concepts of forward/front and backward/back are assoc...
Picanço Marchand, Daniel Lucas Rodrigues Carvalho, Lucas Sávio de Souza Leal, Diego Gonçalves Câmara, Sheila Cassol, Mauriceia
Published in
Logopedics, phoniatrics, vocology
Presentations to audiences are often seen as challenging by university students, causing physiological reactivity on cortisol levels and heart rate, for example. Many students perceive that they have difficulties expressing themselves or do not consider themselves to be good communicators. With the thought that efficient communication is able to br...
illner, v. novotný, m. kouba, t. tykalová, t. šimek, m. sovka, p. švihlík, j. růžička, e. šonka, k. dušek, p.
...
Background: Speech dysfunction represents one of the initial motor manifestations to develop in Parkinson's disease (PD) and is measurable through smartphone. Objective: The aim was to develop a fully automated and noise-resistant smartphone-based system that can unobtrusively screen for prodromal parkinsonian speech disorder in subjects with isola...
Kleiman, Michael J Galvin, James E
Published in
Journal of Alzheimer's disease : JAD
Alzheimer's disease (AD) is characterized by progressive cognitive decline, including impairments in speech production and fluency. Mild cognitive impairment (MCI), a prodrome of AD, has also been linked with changes in speech behavior but to a more subtle degree. This study aimed to investigate whether speech behavior immediately following both fi...
Perry, Jamie L Gilbert, Imani R Xing, Fangxu Jin, Riwei Kuehn, David P Shosted, Ryan K Woo, Jonghye Liang, Zhi-Pei Sutton, Bradley P
Published in
The Cleft palate-craniofacial journal : official publication of the American Cleft Palate-Craniofacial Association
To introduce a highly innovative imaging method to study the complex velopharyngeal (VP) system and introduce the potential future clinical applications of a VP atlas in cleft care. Four healthy adults participated in a 20-min dynamic magnetic resonance imaging scan that included a high-resolution T2-weighted turbo-spin-echo 3D structural scan and ...
núñez-vidal, esther fernández-ruiz, raúl álvarez-marquina, agustín hidalgo-delaguía, irene garayzábal-heinze, elena hristov-kalamov, nikola domínguez-mateos, francisco conde, cristina martínez-olalla, rafael
Smith–Magenis syndrome (SMS) is a rare, underdiagnosed condition due to limited public awareness of genetic testing and a lengthy diagnostic process. Voice analysis can be a noninvasive tool for monitoring and detecting SMS. In this paper, the cepstral peak prominence and mel-frequency cepstral coefficients are used as disease monitoring and detect...
yousufi, musyyab damaševičius, robertas maskeliūnas, rytis
Background/Objectives: This study investigates the classification of Major Depressive Disorder (MDD) using electroencephalography (EEG) Short-Time Fourier-Transform (STFT) spectrograms and audio Mel-spectrogram data of 52 subjects. The objective is to develop a multimodal classification model that integrates audio and EEG data to accurately identif...
carreiro-martins, pedro paixão, paulo caires, iolanda matias, pedro gamboa, hugo soares, filipe gomez, pedro sousa, joana neuparth, nuno
Background/Objectives: The interest in processing human speech and other human-generated audio signals as a diagnostic tool has increased due to the COVID-19 pandemic. The project OSCAR (vOice Screening of CoronA viRus) aimed to develop an algorithm to screen for COVID-19 using a dataset of Portuguese participants with voice recordings and clinical...