People with articulation and phonological disorders need training to plan and to execute sounds of speech. Compared to other children, children with Down Syndrome have significantly delayed speech development because they present developmental disabilities, mainly apraxia of speech. In practice, speech therapists plan and perform trainings of articulatory and non-articulatory sounds such as blow production and popping lips in order to assist speech production. Mobile applications can be integrated into the clinical treatment to transcend the boundaries of clinics and schedules and therefore reach more people at any time. The use of artificial intelligence and machine learning techniques can improve this kind of application. The aim of this pilot study is to assess speech recognition methods prioritizing the training of sounds for speech production, particularly the non-articulatory sounds. These methods apply Mel-Frequency Cepstrum Coefficients and Laplace transform to extract features, as well as traditional statistical tests and Support Vector Machine (SVM) to recognize sounds. This study also reports experimental results regarding the effectiveness of the methods on a set of 197 sounds. Overall, SVM provides higher accuracy.
CITATION STYLE
Souza, F. C. M., Souza, A. C. C., Nakamura, G. M., Soares, M. D., Mandrá, P. P., & Macedo, A. A. (2018). Investigating the Recognition of Non-articulatory Sounds by Using Statistical Tests and Support Vector Machine. In Advances in Intelligent Systems and Computing (Vol. 738, pp. 639–649). Springer Verlag. https://doi.org/10.1007/978-3-319-77028-4_82
Mendeley helps you to discover research relevant for your work.