Phonetic convergence in the shadowing for natural and synthesized speech in Polish
PDF

Keywords

shadowing
convergence
speech synthesis

How to Cite

Jankowska, K., Kuczmarski, T., & Demenko, G. (2020). Phonetic convergence in the shadowing for natural and synthesized speech in Polish. Lingua Posnaniensis, 62(2), 7–17. https://doi.org/10.2478/linpo-2020-0008

Abstract

The matter of shadowing natural speech has been discussed in many studies and papers. However, there is very little knowledge of human phonetical convergence to synthesized speech. To find out more about this issue an experiment in the Polish language was conducted. Two types of stimuli were used – natural speech and synthesised speech. Five sets of sentences with various phonetic phenomena in Polish were prepared. A group of twenty persons were recorded which gave the total number of 100 samples for each phenomenon. The summary of results shows convergence in both natural and synthesised speech in set number 1, 2, 4 while in group 3 and 5 the convergence was not observed. The baseline production shown that the great majority of participants prefer ɛn/ɛm version of phonetic feature which was reflected in 83 out of 100 sentences. In the shadowing natural speech participants changed ɛn/ɛm to ɛw/ɛ̃ in 26 cases and in 4 ɛw/ɛ̃ to ɛn/ɛm. When shadowing synthesised speech shift from ɛn/ɛm to ɛw/ɛ̃ in 18 sentences and 4 from ɛw/ɛ̃ to ɛn/ɛm. The intonation convergence was also observed in the perceptual analysis, however the analysis of F0 statistics did not show statistically significant differences.

https://doi.org/10.2478/linpo-2020-0008
PDF

References

Breuer, Stefan & Stober, Karlheinz & Wagner, Petra & Abresch, Julia. 2000. Dokumentation zum Bonn Open Synthesis System BOSS II, Unveroffentliches Dokument, IKP. http://www.ikp.uni-bonn.de/. (Accessed 2010--09-19.)

Demenko, Grażyna & Wypych, Mikołaj & Baranowska, Emilia. 2003. Implementation of grapheme-to-phoneme rules and extended SAMPA alphabet in Polish text-to-speech synthesis. Speech and Language Technology 7. 79-97.

Demenko, Grażyna & Klessa, Katarzyna & Szymański, Marcin & Bachan, Jolanta. 2007. The design of Polish speech corpora for speech synthesis in BOSS system (Paper presented at the conference of XII Sympozjum „Podstawowe Problemy Energoelektroniki, Elektromechaniki i Mechatroniki” PPEE m, Wisła 2007).

Gessinger, Iona & Raveh, Eran & Le Maguer, Sébastien & Möbius, Bernd & Steiner, Ingmar. 2017. Shadowing Synthesized Speech – Segmental Analysis of Phonetic Convergence (Paper presented at the conference of 18th Annual Conference of the International Speech Communication Association Stockholm, August 20-24, 2017).

Jassem, Wiktor. 2003. Polish. Journal of the International Phonetic Association. 103-107.

Kuczmarski, Tomasz. 2010. HMM-based Speech Synthesis Applied to Polish. Speech and Language Technology. Ed. Demenko, Grażyna & Wagner, Agnieszka. Poznań: Polish Phonetic Association, 2009/2010. 221-228.

Lison, Pierre & Meena Raveesh. 2014. Spoken dialogue systems: the new frontier in human-computer interaction. Crossroads, The ACM Magazine for Students. 46-51.

Nowakowski, Paweł & Wiatrowski, Przemysław. 2013. Informacje fonetyczno-ortograficzne w podręczniku Kultura języka polskiego. Wymowa, ortografia, interpunkcja. Slavia Occidentalis. 87-100.

Pardo, Jennifer S. 2013. Phonetic convergence in shadowed speech: A comparison of perceptual and acoustic measures (Paper presented at the conference of 14th Annual Conference of the International Speech Communication Association Lyon, August 25-29, 2013).

Rojczyk, Arkadiusz. 2013. Phonetic imitation of L2 vowels in a rapid shadowing task. Proceedings of the 4th Pronunciation in Second Language Learning and Teaching Conference. 66-76.

Sabahi, Shahab. My Voice Analysis. March 2019. https://github.com/Shahabks/my-voice-analysis (Accessed 2019-12-03.)

Wagner, Agnieszka. 2015. Description of vowels in general and of Polish vowels in detail. https://agnieszkawagner.weebly.com/uploads/1/5/4/8/15489492/vowels2015.pdf (Accessed 2020-01-31.)

Zen, Heiga & Nose, Takashi & Yamagishi, Junichi & Sako, Shinji & Masuko, Takashi & Black, Alan & Tokuda, Keiichi. 2007. The HMM-based speech synthesis system (HT S) version 2.0. Proceedings of 6th ISCA Workshop on Speech Synthesis (SSW-6). 294-299.