Perception of time-compressed speech: an experimental study

Authors

DOI:

https://doi.org/10.5007/1984-8420.2023.e92124

Keywords:

Speech rate, Acceptability, Intelligibility

Abstract

This paper aims to identify which speech rate values are considered acceptable and from which point the speech rate makes the utterance content unintelligible in Brazilian Portuguese (BP). Therefore, it reports an experimental study on the impact of time-compressed speech on the acceptability and intelligibility of utterances in BP. For the experiments, short audio sentences containing warning messages were used as stimuli. These sentences were recorded in a natural speech rate and then digitally manipulated to faster rates in a scalar fashion (from 9 to 19 syllables per second). Intelligibility and acceptability tests were then conducted with blind and sighted subjects. The results indicate that time-compressed speech has a significant impact on both acceptability and intelligibility of utterances for both groups of participants and that while blind subjects tended to give slightly higher acceptability rates across all speech rate conditions, sighted subjects performed better in the intelligibility experiment, what contradicts a trend that is often reported in the literature.

Author Biographies

René Alain Santana de Almeida, Universidade Federal do Recôncavo da Bahia

Doutor em Letras e Linguística pela Universidade Federal de Alagoas e docente do Curso de Licenciatura em Letras da Universidade Federal do Recôncavo da Bahia. Atua principalmente nas seguintes áreas de pesquisa: língua portuguesa, prosódia, fonética experimental e psicolinguística.

Miguel Oliveira Jr, Universidade Federal de Alagoas

Doutor em Linguística pela Simon Fraser University (Vancouver, Canadá) e docente da Faculdade de Letras (FALE-UFAL) e do Programa de Pós-graduação em Letras e Linguística da Universidade Federal de Alagoas. Atua principalmente nas seguintes áreas de pesquisa: prosódia, fonética experimental, psicolinguística e documentação linguística.

Ayane Nazarela Santos de Almeida, Universidade Federal do Recôncavo da Bahia

Doutora em Letras e Linguística pela Universidade Federal de Alagoas e docente do curso de Licenciatura em Letras da Universidade Federal do Recôncavo da Bahia. Atua principalmente nas seguintes áreas de pesquisa: língua portuguesa, prosódia, fonética experimental e ensino de língua portuguesa para Surdos.

Oyedeji Musiliyu, Canada Revenue Agency

Doutor em Letras e Linguística pela Universidade Federal de Alagoas e servidor da Canada Revenue Agency. Atua principalmente nas seguintes áreas de pesquisa: prosódia, fonética experimental e psicolinguística.

References

ANVISA. Regulation about pharmaceutical drug advertising. Disponível em: http://portal.anvisa.gov.br/wps/wcm/connect/b12a03004745973d9f9adf3fbc4c6735/rdc_9608_comentada.pdf?MOD=AJPERES. Acesso em: 04 fev. 2015.

ASAKAWA, C. et al. Maximum listening speeds for the blind. Proceedings of the 2003 International Conference on Auditory Display, (ICAD03), Boston, p. 276-279, 2003.

BADDELEY, A. D.; HITCH, G. J. Development of working memory: should the Pascual Leone and the Baddeley and Hitch models be merged? Journal of Experimental Child Psychology, v. 77, n. 2, p. 128-137, 2000.

BEATTY, M. J.; BEHNKE, R. R.; FROELICH, D. L. Effects of achievement incentive and presentation rate on listening comprehension. Quarterly Journal of Speech, v. 66, n. 2, p. 193-200, 1980.

BLAAUW, E. On the perceptual classification of spontaneous and read speech. Research Institute for Language and Speech, Utrecht University, 1995.

BOERSMA, P.; WEENINK, D. Praat: doing phonetics by computer (Versão 6.0.36) [Computer program]. 2017. Disponível em: http://www.praat.org/.

BOROUJENI, F. M. et al. Comparison of auditory stream segregation in sighted and early blind individuals. Neuroscience letters, v. 638, p. 218-221, 2017.

CARNE, E. B. A professional’s guide to data communication in a TCP/IP world. Boston: Artech House, 2004.

CORRETGE, R. Praat vocal toolkit [Computer Software]. 2012. Disponível em: < http://www.praatvocaltoolkit.com/.

DAGENAIS, P.; BROWN, G.; MOORE, R. Speech rate effects upon intelligibility and acceptability of dysarthric speech. Clinical Linguistics & Phonetics, v. 20, n. 2-3, p. 141-148, 2006.

DIETRICH, S.; HERTRICH, I.; ACKERMANN, H. Training of ultra-fast speech comprehension induces functional reorganization of the central-visual system in late-blind humans. Frontiers in Human Neuroscience, v. 7, article 701, 2013a.

DIETRICH, S.; HERTRICH, I.; ACKERMANN, H. Ultra-fast speech comprehension in blind subjects engages primary visual cortex, fusiform gyrus, and pulvinar - a functional magnetic resonance imaging (fMRI) study. BMC Neuroscience, v. 14, article 74, 2013b.

DONZEL, M. van. Prosodic Aspects of Information Structure in Discourse. Thesis (PhD). Faculteit der Geesteswetenschappen, University of Amsterdam, Amsterdam, 1999.

FAIRBANKS, G.; EVERITT, W. L.; JAEGER, R. P. Method for time or frequency compression-expansion of speech. In: DUKER, Sam (Ed.), Time-compressed speech, v. 1, p. 172-180. Metuchen, N.J.: Scarecrow, 1974.

FENG, J. et al. Effect of blindness on mismatch responses to Mandarin lexical tones, consonants, and vowels. Hearing Research, v. 371, p. 87-97, 2019.

FON, J. Speech rate as a reflection of variance and invariance in conceptual planning in storytelling. Proceedings of the 14th International Congress of Phonetic Sciences (ICPhS), San Francisco, v. 14, n. 1, p. 663-666, 1999.

FOULKE, E. The comprehension of rapid speech by the blind: part III, 1969. Disponível em: http://files.eric.ed.gov/fulltext/ED034346.pdf. Acesso em: 25 fev. 2015.

GARVEY, W. D. The intelligibility of speeded speech. Journal of Experimental Psychology, vol. 45, n. 2, p. 102-108, 1953.

GHITZA, O.; GREENBERG, S. On the possible role of brain rhythms in speech perception: intelligibility of time-compressed speech with periodic and aperiodic insertions of silence. Phonetica, v. 66, n. 1-2, p. 113-126, 2009.

JANSE, E. Production and perception of fast speech. Utrecht: LOT, 2003.

JANSE, E.; WERFF, M. van; QUENÉ, H. Listening to fast speech: aging and sentence context. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, p. 681-684, 2007.

KING, P. E.; BEHNKE, R. R. The Effect of Time‐Compressed Speech on Comprehensive, Interpretive, and Short‐Term Listening. Human Communication Research, v. 15, n. 3, p. 428-443, 1989.

KOWAL, S.; WIESE, R.; O’CONNELL, D. The use of time in storytelling. Language and Speech, vol. 26, n. 4, p. 377-392, 1983.

LOIOTILE, R. et al. Enhanced performance on a sentence comprehension task in congenitally blind adults. Language, cognition and neuroscience, v. 35, n. 8, p. 1010-1023, 2020.

MARTINS, V.; ANDRADE, C. R. F. de. Perfil evolutivo da fluência da fala de falantes do português brasileiro. Pró-Fono Revista de Atualização Científica, vol. 20, n. 1, p. 7-12, 2008.

MOOS, A. et al. Perception of Ultra-Fast Speech by a Blind Listener – Does He Use His Visual System? Proceedings of the 8th International Seminar on Speech Production, ISSP, p. 297-300, 2008.

MOOS, A.; TROUVAIN, J. Comprehension of ultra-fast speech-blind vs. “normally hearing” persons. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, p. 677-680, 2007.

O’CONNELL, D.; KOWAL, S. Cross-linguistic pause and rate phenomena in adults and adolescents. Journal of Psycholinguistic Research, v. 1, n. 2, p. 155-164, 1972.

OLIVEIRA JR, M. Prosodic Features in Spontaneous Narratives. Thesis (PhD). Simon Fraser University, Vanvouver, Canada, 2000.

SCHMIDT-NIELSEN, A. Intelligibility and Acceptability testing for Speech Technology. Naval Research Laboratory, 1992. Disponível em: http://www.dtic.mil/dtic/tr/fulltext/u2/a252015.pdf. Acesso em: 24 fev. 2015.

SUTTON, B. et al. Younger and older adults rate performance when listening to synthetic speech. Augmentative and Alternative Communication, v. 11, n. 3, p. 147-153, 1995.

TROUVAIN, J. On the comprehension of extremely fast synthetic speech. Saarland Working Papers in Linguistics (SWPL), v. 1, p. 5-13, 2007.

Published

2023-08-07

Issue

Section

Artigos