DOI : https://doi.org/10.15565/jll.2018.03.73.31
Title : A study on the processing of n-insertion in natural TTS implementation - Focusing on a model presentation that harmonizes standard and real pronunciation
Author : 임현열(Hyeon-yeol IM)
Journal : 語文論集
Vol. : 73 No.- [2018]
pages : 31-59
Published by : 중앙어문학회(The Society of Chung-Ang Lang. & Lit.)
Date : 2018.03
Register Information : KCI
etc. :
-----------------------------------------------------------------------
<Abstract>
In this study, we first confirm that n-insertion pronunciation is an optional phonological phenomenon; thereafter, we assess the realization pattern of n-insertion pronunciations by examining the results of past pronunciation studies. In addition, we examine how n-insertion words are pronounced in text-to-speech (TTS) utterances provided by Google Translator and Naver Papago, revealing that the n-insertion pronunciations of TTS utterances are unrealistically realized and that the pronunciation is not properly reflected.
Based on this, we propose an n-insertion method for natural TTS implementation. The main content is to construct a real pronunciation dictionary, reflect it in the TTS algorithm, and adjust the rate of applying real pronunciation according to user preference. In other word, to implement a natural TTS pronunciation, the pronunciation of the words placed in the n-insertion environment is basically calculated according to the standard pronunciation. However, since the degree of preference of the standard pronunciation or the degree of preference of the real pronunciation is different in relation to the n-insertion pronunciation according to the user, we suggest a method that allows the user to use the TTS by adjusting the n-insertion pronunciation preference.
Key words : TTS(Text-to-speech), n-insertion, optional phonological phenomenon, standard pronunciation, realistic pronunciation, pronunciation preference, speech synthesis, speech production
|